QBA Final Exam Study Guide
________ are used in the pharmaceutical industry to assess the risk of introducing a
new drug - ANS-Simulations
_________ analytics are techniques that use models, constructed from past data, to
predict the future or to ascertain the impact of one variable on another. - ANS-Predictive
_________ attempts to classify a categorical outcome as a linear function of
explanatory variables. - ANS-Logistic regression
__________ is a category of data mining techniques in which an algorithm learns how
to classify or estimate an outcome variable of interest. - ANS-Supervised Learning
__________ is a method of calculating dissimilarity between clusters by calculating the
distance between the centroids of the two clusters. - ANS-Centroid Linkage
__________ is a statistical procedure used to develop an equation showing how two
variables are related. - ANS-Regression analysis
__________ is dividing the sample data into three sets for training, validation, and
testing of the data mining algorithm performance. - ANS-Data Partitioning
__________ is the data set used to build the candidate models. - ANS-Training set
__________ refers to the scenario in which the relationship between the dependent
variable and one independent variable is different at different values of a second
independent variable. - ANS-Interaction
______Is a measure of the heterogeneity of observations in a classification tree - ANS-
Impurity
______refers to the scenario in which the analyst builds a model that does a great job of
explaining the sample of data on which it is based but fails to accurately predict outside
the sample data. - ANS-Overfitting
A __________ refers to a constraint that can be expressed as an equality at the optimal
solution. - ANS-Binding constraint
A _______refers to a model input that can be controlled in a spreadsheet model - ANS-
Decision variable
, A better understanding of consumer behavior through analytics leads to - ANS-Better
pricing strategies
A characteristic or quantity of interest that can take on different values - ANS-Variable
A chart that is recommended as an alternative to a pie chart is a - ANS-Bar chart
A cluster's __________ can be measured by the difference between the distance value
at which a cluster is originally formed and the distance value at which it is merged with
another cluster in a dendrogram. - ANS-Durability
A disadvantage of stacked - column charts and stacked- bar charts is that - ANS-it can
be difficult to perceive small differences in areas.
A forecast that helps direct police officers to areas where crimes are likely to occur
based on past data is an example of - ANS-Predictive Analytics
A light bulb manufacturer uses descriptive analytics - ANS-To present supply chain to
managers visually
A line chart that has no axes but is used to provide information on overall trends for time
series data is called a - ANS-Sparkline
A one-tailed test is a hypothesis test in which the rejection region is - ANS-In one tail of
the sampling distribution
A procedure for using sample data to find the estimated regression equation is - ANS-
The least squares method
A set of values for the random variables is called a(n) - ANS-trial
A simple random sample of 31 observations was taken from a large population. The
sample mean equals 5. Five is a - ANS-Point estimate
A test set is the data set used to - ANS-estimate performance of the final model on
unseen data.
A tree diagram used to illustrate the sequence of nested clusters produced by
hierarchical clustering is known as a - ANS-Dendrogram
A________decision is concerned with how the organization should achieve the goals
and objectives set by its strategy - ANS-Tactical
A_____classifies a categorical outcome variable by splitting observations into groups
via a sequence of hierarchical rules - ANS-Classification tree
________ are used in the pharmaceutical industry to assess the risk of introducing a
new drug - ANS-Simulations
_________ analytics are techniques that use models, constructed from past data, to
predict the future or to ascertain the impact of one variable on another. - ANS-Predictive
_________ attempts to classify a categorical outcome as a linear function of
explanatory variables. - ANS-Logistic regression
__________ is a category of data mining techniques in which an algorithm learns how
to classify or estimate an outcome variable of interest. - ANS-Supervised Learning
__________ is a method of calculating dissimilarity between clusters by calculating the
distance between the centroids of the two clusters. - ANS-Centroid Linkage
__________ is a statistical procedure used to develop an equation showing how two
variables are related. - ANS-Regression analysis
__________ is dividing the sample data into three sets for training, validation, and
testing of the data mining algorithm performance. - ANS-Data Partitioning
__________ is the data set used to build the candidate models. - ANS-Training set
__________ refers to the scenario in which the relationship between the dependent
variable and one independent variable is different at different values of a second
independent variable. - ANS-Interaction
______Is a measure of the heterogeneity of observations in a classification tree - ANS-
Impurity
______refers to the scenario in which the analyst builds a model that does a great job of
explaining the sample of data on which it is based but fails to accurately predict outside
the sample data. - ANS-Overfitting
A __________ refers to a constraint that can be expressed as an equality at the optimal
solution. - ANS-Binding constraint
A _______refers to a model input that can be controlled in a spreadsheet model - ANS-
Decision variable
, A better understanding of consumer behavior through analytics leads to - ANS-Better
pricing strategies
A characteristic or quantity of interest that can take on different values - ANS-Variable
A chart that is recommended as an alternative to a pie chart is a - ANS-Bar chart
A cluster's __________ can be measured by the difference between the distance value
at which a cluster is originally formed and the distance value at which it is merged with
another cluster in a dendrogram. - ANS-Durability
A disadvantage of stacked - column charts and stacked- bar charts is that - ANS-it can
be difficult to perceive small differences in areas.
A forecast that helps direct police officers to areas where crimes are likely to occur
based on past data is an example of - ANS-Predictive Analytics
A light bulb manufacturer uses descriptive analytics - ANS-To present supply chain to
managers visually
A line chart that has no axes but is used to provide information on overall trends for time
series data is called a - ANS-Sparkline
A one-tailed test is a hypothesis test in which the rejection region is - ANS-In one tail of
the sampling distribution
A procedure for using sample data to find the estimated regression equation is - ANS-
The least squares method
A set of values for the random variables is called a(n) - ANS-trial
A simple random sample of 31 observations was taken from a large population. The
sample mean equals 5. Five is a - ANS-Point estimate
A test set is the data set used to - ANS-estimate performance of the final model on
unseen data.
A tree diagram used to illustrate the sequence of nested clusters produced by
hierarchical clustering is known as a - ANS-Dendrogram
A________decision is concerned with how the organization should achieve the goals
and objectives set by its strategy - ANS-Tactical
A_____classifies a categorical outcome variable by splitting observations into groups
via a sequence of hierarchical rules - ANS-Classification tree