SOLUTIONS 2026 #11
Example of discrete, qualitative, ordinal attributes - correct answer Bronze, Silver, Gold
medals as awarded at the Olympics
Text files are easier to inspect than binary files by loading the file or viewing it with text
editor - correct answer True
Structured data - correct answer Data mining algorithms use and can be classified as
categorical or numeric
The data scientist can use programming languages such as Python and R to visualize
the data - correct answer True
Noise objects are always outliers - correct answer True
Business understanding phase in CRISP-DM - correct answer the data scientists first
starts by identifying the business problem and business objectives
Predictive analytics - correct answer Seeks to determine what is likely to happen in the
future
Data processing - correct answer Data discretization and cleaning are part of data
preprocessing
Foundation of Relational Databases - correct answer A collection of excel type data
tables with rows as data lines and columns as attributes
OLAP - correct answer Is ideal for long-term decision making
NOSQL - correct answer Document databases
Business rules helps set up the __ - correct answer Conceptual data model
Data Mining - correct answer The process of extracting usable data from a larger set of
any raw data in order to identify patterns
Unsupervised Learning - correct answer Using unlabeled data, allows a model to
discover patterns and information that was previously undetected
Type of Machine Learning algorithms - correct answer Clustering
Classification
Regression
Association
, Market Basket Analysis Example - correct answer Result in a retailer deceasing to
locate 6-packs of beer at the ned of the infant diapers aisle
Sampling Frame - correct answer A list of all unites in a target population
Construct Validity - correct answer Mismatch between a construct and its associated
measurement
Error - correct answer Deviations in the sample or methods from the true measures of
the population
Cochran's (1961) on Stratified Sampling - correct answer Adding strata can decrease
sampling variance, but that there are diminishing returns for every stratum added to the
design. Generally better to use fewer strata
Stratified Sampling - correct answer Minimize design effects
Cluster Sampling - correct answer Maximize design effects, because of the risk of
homogeneity within the clusters
Estimated Sample Statistics - correct answer previous studies, statistic from pilot study ,
educated best guess
Non-Response Error - correct answer The degree to which statistics are off because
respondents can't be reached
Measurement Error - correct answer The degree to which a survey statistic differs from
the "true" value due to the way the statistic is collected
Sampling Error - correct answer An estimate of how much a survey statistic differs from
the "true" statistic because of the sample that was selected
Coverage error - correct answer The degree to which statistics are off because the
sample doesn't properly represent the underlying population
Basic elements of a properly done survey sample - correct answer Randomly selected
elements from a list
A list of the units or elements of the population
A method to assure that key elements of the population are represented in the sample
The Kish Method - correct answer Method for randomly selecting which household
member to sample in a household with multiple members
Starts with a single question about the number of eligible persons in the housed so they
can be listed
Rare population - correct answer Less than 3% of a total population