_______ is an interactive website kept constantly updated and
relevant to the needs of its customers using a database.
a. data warehouse
b. data mart
c. data driven website
d. web blog
The process of analyzing data to extract information not offered by
the raw data alone.
a. information scrubbing
c. data governance
d. data mining
_______ provides details about data
a. data dictionary
b. data element
d. foreign key
_______ stores information about a person, place, thing, transaction,
c. data element
_______ helps users graphically design the answer to a question
against a database.
a. query by example tool
b. structured query language
c. relational database model
d. relational database management system
An example of transactional information would be:
a. product statistics
b. sales projections
c. future growth projections
d. sales receipt
The smallest or basic unit of information is referred to as:
a. database model
c. data element
Fixing or discarding inconsistent or incorrect data is known as:
a. data mining
b. data organizing
c. data cleaning
d. data exploring
Revealing the relationship between variables with nature and
frequency of relationship is known as:
a. statistical analysis
b. cluster analysis
c. association detection
d. data cleaning
T/F A representation of multidimensional information can be referred to as a "cube".
Which of the following are both examples of analytical information that might be stored in a database?
A. Sales receipts and packing slips
B. Sales receipts and future growth
C. Trends and packing slips
D. Product statistics and sales projections
Which of the following is NOT one of the five common characteristics of high quality information?
Give at least one advantage of a data driven website.
Answer: (1) The content is easy to manage. (2) It is easy to store large amounts of data. (3) It is easy to eliminate human errors.
Time-series information is used in association detection.
Which of the following is NOT a way by which data warehouses increase effectiveness?
A. Analyzing trends
B. Identifying product waste
C. Understanding competitors
D. Developing customer profiles
T/F Data mining is one of the four aspects of Data we need to consider?
T/F All Data systems have the same format?
Can Databases be linked together and if so what is it called?
They can be linked together, even multiple databases at a time and this is called a relational database.
T/F Metadata is data about data?
Companies use ______________ techniques to analyze particular data in
order to compile a complete picture of their operations allowing
identification of trends and improved forecasts.
A. Data mining
B. Unstructured Data
C. Market Basket
D. Business Intelligence
The primary purpose of a data warehouse is to combine information.
If a grocery store is trying to analyze data regarding what
percentage of the time customers buying bread are also buying peanut
butter they would use the data mining technique known as:
a. Cluster Analysis
b. Statistical Analysis
c. Time-series Analysis
d. Market Basket Analysis
It is crucial for businesses to understand digest, analyze and filter
high-quality information in order to see growth and success in their
industry. Information that has a primary purpose to support daily
operational tasks is known as _______________.
a. Analytical Information
b. Transactional Information
d. Information granularity
There are common characteristics of high-quality information to make
certain that systems do not suffer from data integrity issues. Which
of these characteristics is not one of them?
True/False. Information cleansing or scrubbing is a process that weeds out and fixes or discards incorrect or incomplete information.
True/False. Web mining analyzes unstructured data to find trends and patterns in words and sentences.
What is a technique used to divide information sets into manually
exclusive groups so that members of each group are close together and
different groups are far away?
a) Association Detection
b) Cluster Analysis
c) Market Basket Analysis
d) Statistical Analysis
a) assigns records to one of a predefined set of classes
b) determines which things go together
c) segments a heterogeneous population of records into a number of more homogeneous subgroups
d) analyzes unstructured data to find trends and patterns in words and senternces
The data warehouse enables business users typically managers, to be
more effective in all these ways EXCEPT:
a) grouping similar products together
b) developing customer profiles
c) identifying financial issues
d) understanding competitors
Data governance refers to the overall management of the availability usability, integrity, and security of company data.
A data dictionary:
a. compiles all of the metadata about the element in the data model
b. stores information in the form of logically related two-dimensional tables
c. allows users to create read, update, and delete data in a relational database
d. none of the above
A physical view of information focuses on how individual users
logically access information to meet their own particular business
Another word for information cleansing is:
The three common forms for mining structured and unstructured data
a. cluster analysis statistical analysis, and multidimensional analysis
b. data analysis, cluster analysis, and configuration analysis
c. cluster analysis, association detection, and statistical analysis
d. cluster analysis, alignment analysis, and statistical analysis
Explain the difference between transactional and analytical information.
Transactional information encompasses all of the information contained within a single business process or unit of work, and its primary purpose is to support daily operational tasks. Analytical information encompasses all organizational information, and its purpose is to support the performing of managerial analysis tasks. Analytical information is useful when making important decisions such as whether a company should build a new plant of hire more sales personnel.
What is a major pitfall of real-time information?
a. It is constantly changing.
b. It is often inaccurate.
c. It is often incomplete.
d. All of the above
Which of the following is NOT a serious consequence that businesses
face when using low-quality information?
a. Difficulty tracking revenue because of inaccurate invoices.
b. Inability to build strong relationships with customers.
c. Difficulty identifying the business's most valuable customers
d. Lost revenue opportunities from marketing to valuable customers.
Name two defined policies that a company that supports a data governance program specifies.
2 of these:
-who is accountable for various aspects of the data
-the process concerning how to store, archive, back up, and secure the data
-procedures identifying accessibility levels for employees
Columns and fields such as the primary key and foreign key are
examples of what?
d. Data models
T/F The four aspects of data are data type data timelines, data quality, and data governance
A _____ key is one or more columns that can be used to identify a unique row in a table?
_____ keys are keys from a different table than the one in which they reside?
T/F Market basket analysis is the items people tend to buy separate
T/F A statistical analysis requires high quality data a sturdy DBMS, and a solid statistical skills
The four primary traits that help determine the value of information
a. Information type: transactional and analytical
b. Information timeliness
c. Information quality
d. Information governance
e. All of the above
Of the five common characteristics of high quality information which
of the following has to do with each transaction and event only being
represented once in the information?
Which of the following is not true?
a. Metadata provides details about data.
b. A data element is the largest or most broadest unit of information.
c. Data models are logical data structures that detail the relationships among data elements using graphics or pictures.
d. A data dictionary is a compilation of all the metadata about the data elements in the data model.
List some examples of serious business consequences that can occur due to using low quality information to make decisions.
-Inability to accurately track customers.
-Difficulty identifying the organization's most valuable customers
-Inability to identify selling opportunities
-Lost revenue opportunities from marketing to nonexistent customers
-The cost of sending undeliverable mail
-Difficulty tracking revenue because of inaccurate invoices
-Inability to build strong relationships with customers
Which of the common forms for mining structures and unstructured data
is a technique used to divide information sets into mutually exclusive
groups where members of each group are closely grouped together while
the different groups are as far apart as possible?
a. Association detection
b. Statistical Analysis
c. Cluster Analysis
d. Separation Analysis
What are the four aspects of data?
Type, timeliness, quality, and governance
What are the two types of data?
Transactional and analytical
Which of the following is NOT a characteristic of Data quality?
E. All are characteristics of Data quality.
A field that or group of fields that uniquely identifies a given
record in a table is __________.
A. Foreign Key
B. Primary Key
Which of the following is the most inclusive?
A. Data mart
B. Data field
C. Data record
D. Data warehouse
What are five common characteristics of high-quality information? Name two or three
Accurate, complete, consistent, timely, unique
What is a primary key of one table that appears as an attribute in
another table and acts to provide a logical relationship between the
a. primary key
b. foreign key
What are some data-driven website advantages?
- Easy to manage content
- Easy to store large amounts of data
- Easy to eliminate human errors
What is the primary purpose of a data warehouse?
a. combine information
b. store information
c. analyze trends
Association detection reveals the relationship between variables along with the nature and frequency of the relationships.
Data governance refers to the overall management of the availablility usability, integrity, and security of company data.
________ is a process that weeds out and fixes or discards
inconsistent incorrect, or incomplete information.
B. Data redundancy
C. Information cleansing or scrubbing
D. Data quality audits
What is data mining?
The process of analyzing data to extract information not offered by the raw data alone.
_______ are predictions based on time-series information.
A. Association detection
B. Market basket analysis
D. Association detection
Cluster analysis is a technique used to divide information sets into
mutually exclusive groups such that the members of each group are as
close together as possible to one another and the different groups are
as far apart as possible.