front 1 How do slicers enhance an Excel dashboard? | back 1 By allowing users to filter data visually and interactively |
front 2 One-way ANOVA is most appropriate when you want to: | back 2 Compare means across three or more independent groups |
front 3 Which AI tool is best for direct data analysis — allowing users to upload CSV/Excel files and ask questions in plain English to receive regressions, forecasts, and visualizations? | back 3 Julius AI |
front 4 If df.shape returns (1000, 15), this means: | back 4 The dataset has 1000 rows and 15 columns |
front 5 The operation df.groupby('category')['rating'].mean() is used to: | back 5 Calculate the average rating for each product category |
front 6 How does Apple's decision-making approach differ from that of most companies? | back 6 It encourages collaborative debate between functional experts rather than top-down directives |
front 7 Which of the following metrics measures forecast accuracy? | back 7 MAD |
front 8 What does a business 'Agent' do that distinguishes it from a traditional Chatbot? | back 8 It lets the LLM decide the path at runtime and loops between thought, action, and observation |
front 9 One of the goal of data architecture is to manage data and its flow through the enterprise. It follows a set of principles. Which of the following is a principle of data architecture? | back 9 Provide user interfaces for consuming data. |
front 10 Why is business analytics described as 'uniquely well-suited' for agentic AI? | back 10 Because analytics tasks are tool-rich (SQL, Python, sheets) and goal-driven — exactly the conditions under which agents outperform single-shot LLM calls |
front 11 Which SQL function is used to count the number of rows in a table? | back 11 COUNT() |
front 12 What is the benefit of using a PivotTable in a dashboard? | back 12 It allows users to create dynamic charts and summaries easily |
front 13 Which tool is best for 'long, parallelizable analytics tasks'? | back 13 Kimi K2.5 |
front 14 Which of the following statements is False? | back 14 Euclidean distance does not allow for diagonal movement, which may lead to inaccurate similarity assessments in high-dimensional spaces. |
front 15 What does "Generalization" mean in the context of Machine Learning? | back 15 The ability of a model to make accurate predictions on unseen data |
front 16 Which of the following is a common mistake when choosing a chart type for data visualization? | back 16 Using a pie chart with too many variables |
front 17 Which of the following statement is INCORRECT in the perspective of data architecture and data modeling? | back 17 Data architecture is about the micro view that seeks to understand and support the relationships between an organization’s functions, technology, and data types. |
front 18 In a linear programming problem, the objective function represents: | back 18 The quantity to be maximized or minimized |
front 19 Which of the following best describes “phishing” as a cybersecurity threat? | back 19 Trickery used to obtain sensitive information by impersonating trusted entities |
front 20 What does a lower MAPE value indicate about a forecasting model? | back 20 The forecast is more accurate |
front 21 What is an example of disruptive technology? | back 21 Computers with word processors replacing typewriters |
front 22 Which of the following describes Porter’s Generic Competitive Strategies? | back 22 Strategies for gaining a competitive advantage in the market |
front 23 What is the primary function of similarity measures in data mining? | back 23 To quantify how data points are alike based on specific mathematical functions. |
front 24 Which of the following is not one of the three core characteristics of cloud computing? | back 24 Fully managed by the user |
front 25 Which regulation is specifically designed to protect the personal data of European Union citizens? | back 25 GDPR |
front 26 Which algorithm is used for dimensionality reduction in Machine Learning? | back 26 Principal Component Analysis (PCA) |
front 27 Which Excel feature is commonly used to filter data interactively on a dashboard? | back 27 Slicer |
front 28 Pivot Tables are essential in dashboard creation because they: | back 28 Summarize and aggregate large datasets dynamically without changing source data |
front 29 What is the primary purpose of Google's NotebookLM | back 29 To summarize long files and websites, and generate structured notes and study guides |
front 30 In data inspection, conditional formatting is most useful for: | back 30 Visually identifying patterns such as missing values or outliers |
front 31 In the context of linear programming, 'slack' refers to: | back 31 The unused portion of a resource constraint |
front 32 Excel Solver is most appropriately used to: | back 32 Find optimal values for decision variables subject to constraints |
front 33 Which of the following is a benefit of data literacy within an organization? | back 33 Improved cross-functional collaboration |
front 34 What was the primary organizational change Steve Jobs implemented upon returning to Apple in 1997? | back 34 Eliminating general managers and restructuring into a functional organization |
front 35 Why does Apple avoid tying senior R&D executives’ bonuses to individual product performance? | back 35 To encourage long-term innovation without short-term financial pressure |
front 36 Each organization strives to be ethical, to be legally, and morally strong. Which of the following is NOT one of the principles of data ethics in the organization? | back 36 Foresightedness |
front 37 Which of the following best describes strategy risks according to Kaplan and Mikes’ framework? | back 37 Risks taken voluntarily in pursuit of superior returns |
front 38 In the context of a confusion matrix, what does a high false negative (FN) rate indicate in a fraud detection model? | back 38 The model is misclassifying a large number of fraudulent transactions as non-fraudulent. |
front 39 Which algorithm is commonly used for clustering in unsupervised learning? | back 39 K-Means |
front 40 Which library is commonly used in Google Colab for handling Excel files? | back 40 Pandas |
front 41 Which of the following scenarios would most likely result in an underfitted model? | back 41 A decision tree with minimal depth that fails to capture patterns in the data. |
front 42 A retail chain randomly selects entire cities for a sales performance survey, and all stores within those cities are included. Which sampling method is this, and what advantage does it provide? | back 42 Cluster sampling – simplifies data collection from large, dispersed populations |
front 43 Which SQL clause is used to sort the result set of a query? | back 43 ORDER BY |
front 44 Which Excel function would be most appropriate to count how many houses in the dataset have exactly 3 bedrooms AND are located in Seattle? | back 44 COUNTIFS() |
front 45 A firm identifies that 25% of its records contain missing values for “Customer Income.” It chooses to compute averages from the same geographic area and fill these gaps. This practice best exemplifies: | back 45 Imputation |
front 46 In an adjacency matrix for an undirected graph, which of the following is always true? | back 46 The matrix is symmetric (entry i,j equals entry j,i) |
front 47 Which of the following best describes Platform-as-a-Service (PaaS) in cloud computing? | back 47 It delivers tools and infrastructure for developers to build applications over the Internet |
front 48 What is the main challenge of reinforcement learning? | back 48 Balancing exploration and exploitation |
front 49 What is the purpose of the PRIMARY KEY in a table? | back 49 To uniquely identify each record in a table |
front 50 Converting a data range into an Excel Table (using Insert > Table) provides which primary advantage for data analysis? | back 50 It enables structured references and dynamic formulas that adjust as data changes |
front 51 What does ERP stand for? | back 51 Enterprise Resource Planning |
front 52 In network analysis, what do 'nodes' and 'edges' represent respectively? | back 52 Entities (e.g., people, companies) and the relationships or connections between them |
front 53 What keyword is used to filter records in SQL? | back 53 WHERE |
front 54 Microsoft’s acquisition of LinkedIn illustrates which core dimension of data monetization strategy? | back 54 Transformation of user engagement into revenue through enhanced contextual targeting |
front 55 What lesson can other companies learn from Apple's approach to innovation? | back 55 Aligning decision-making authority with technical expertise can lead to sustained innovation |
front 56 A logistics company wants to place a new distribution center so it can reach all retail locations with the fewest delivery hops. Which centrality measure should they prioritize? | back 56 Closeness Centrality — to find the location closest to all others |
front 57 Which of the following is a common mistake in database design? | back 57 Ignoring Normalization, Communication issues between the business and tech, Redundant tables and fields |
front 58 In regression output, a p-value less than 0.05 for a predictor variable (e.g., square footage) indicates: | back 58 The variable has a statistically significant relationship with the outcome |
front 59 Based on the MIT finding that 83.3% of users could not recall sentences they wrote minutes earlier when using AI, what is a key concern for business analytics professionals? | back 59 Over-reliance on AI may reduce critical thinking |
front 60 The R-squared value in regression analysis tells you: | back 60 The proportion of variance in the dependent variable explained by the independent variables |
front 61 Which of the following statements is NOT correct? | back 61 Discovery Bank in South Africa believes there’s a correlation between fiscal responsibility and family size. |
front 62 According to Gartner’s definition, which of the following is NOT a component of data literacy? | back 62 The capability to modify database schemas. |
front 63 What is the primary function of data governance in an organization? | back 63 Controlling how data is stored, accessed, and maintained. |
front 64 Which of the following best describes a decision variable in an optimization model? | back 64 A controllable factor that directly influences the outcome of an optimization problem. |
front 65 What is the primary purpose of a Data as a Service (DaaS) model? | back 65 To provide on-demand access to data without managing infrastructure. |
front 66 What is an example of predictive analytics in business? | back 66 None of the option |
front 67 Which of the following statements is not a key principle of quality data? | back 67 Management success and quality data are inextricably linked. |
front 68 Which of the following cloud computing service models provides a complete development and deployment environment without managing the underlying infrastructure? | back 68 Platform as a Service (PaaS) |
front 69 What is the primary objective of data management in business analytics? | back 69 To organize, access, and govern data for improved decision-making |
front 70 Effective data monetization can be a source of competitive advantage. Which of the following approaches is NOT related to data monetization? | back 70 Connecting suppliers to customers in the value chain |
front 71 How does federated learning enhance data privacy in Big Data applications? | back 71 By training on decentralized devices without sharing raw data. |
front 72 Which of the following is an example of semi-structured data? | back 72 JSON |
front 73 Which of the following statements is NOT correct? | back 73 Data brokers' data can not be used for harmful purposes, such as identity theft, fraud, or targeted discrimination |
front 74 Why is fixing data errors through cleanup considered ineffective? | back 74 It is expensive and time-consuming, and new errors keep emerging if the root cause is not addressed. |
front 75 What is the primary role of business analytics in organizations? | back 75 To support data-driven decision-making using statistical and quantitative methods |
front 76 Which of the following is False? | back 76 Cloud computing increases data processing latency by performing computations closer to data sources. |
front 77 If you plan to start a real state business, which data brokers should you ask for data analytics? | back 77 CoreLogic |
front 78 Which type of decision model assumes that all input parameters are known with certainty? | back 78 Deterministic Model |
front 79 Which of the following statements is NOT correct? | back 79 Cloud computing is public, meaning that your data is also public. |
front 80 What is the primary goal of start-ups like Trifacta and Paxata in the big data industry? | back 80 To automate the processes of data gathering, cleaning, and preparation. |
front 81 Which of the following state is NOT correct for data brokerage? | back 81 They regulate the internet and monitor online activity for security purposes. |
front 82 Which of the following tools provide the capability of data preparation with analytics? | back 82 Rocket |
front 83 Which of the following options is correct? | back 83 McKinsey Triangle - Analyzing, Managing, and Presenting |
front 84 Which of the following is NOT a real-world application of optimization in business? | back 84 Predicting stock market trends using historical data. |
front 85 What kind of data does Apple collect when users send messages via iMessage? | back 85 encrypted text and anonymized metadata |
front 86 Which of the following statements is correct? | back 86 Unlike physical assets, data increases in value when used multiple times. |
front 87 How do data brokers collect consumer information? | back 87 By tracking social media posts, online transactions |
front 88 In the McKinsey Global Institute study, what was identified as the primary economic impact of Big Data? | back 88 Transformation of economies by increasing productivity growth and consumer surplus. |
front 89 Why is prescriptive analytics considered more advanced than predictive analytics? | back 89 It predicts future events with specific actions to achieve desired outcomes. |
front 90 Which of the following tools is NOT used in Excel’s What-If Analysis? | back 90 Monte Carlo Simulation |
front 91 Which of the following is NOT a key task in data wrangling? | back 91 Encryption |
front 92 Which of the following statement is NOT correct? | back 92 Structured data is stored in databases with undefined schemas, while unstructured data follows some kind of tagging or metadata. |
front 93 Why do companies like Amazon, and Capital One dominate their industries? | back 93 They use analytics across multiple business functions. |
front 94 According to research, what percentage of business contacts experience some form of data change within a year? | back 94 70.8% |
front 95 What is the primary reason organizations implement data localization laws? | back 95 None of the options |
front 96 What was one of the major incentives for Microsoft's acquisition of LinkedIn in 2016? | back 96 To leverage LinkedIn’s user data for monetization through targeted advertising and professional insights. |
front 97 Which of the following statements is NOT correct? | back 97 None of the options |
front 98 What is digital mindset? | back 98 The attitudes and behaviors that enable people to see opportunities in data, AI, and algorithms. |
front 99 Which of the following statements is NOT correct? | back 99 With PaaS, an organization shifts its hardware to the cloud by renting servers and data storage instead of buying and managing its own. |