and All Correct Answers 2025-2026
Updated.
Which of the following is NOT included when using "summary ()" on a dataset?
a. Mean
b. Median
c. Mode
d. 1st Quartile
e. 3rd Quartile - Answer c. Mode
What is the difference between head(boston.housing.df) and boston.housing.df[1:6,] - Answer
These are two ways of getting the same output.
True or False.
Predictive Analytics, as opposed to Statistics, is involved with macro-decisioning when dealing
with data-driven problems. - Answer False
What would the function boston.housing.df[1:3, 10] return? - Answer The first three rows
and the tenth column of the boston.housing.df data frame
Match the following situations with the methods we should use (supervised or unsupervised).
Chase bank tries to decide whether to give loan to a company based on its credit and
transaction history. - Answer supervised
Match the following situations with the methods we should use (supervised or unsupervised).
Kroger provides product coupon to the customers who purchase the bundles of shampoo and
body wash together. - Answer unsupervised
Match the following situations with the methods we should use (supervised or unsupervised).
, Amazon provides purchase suggestions by stating that "customers like you also want the
following products." - Answer unsupervised
Match the following situations with the methods we should use (supervised or unsupervised).
Liberty Mutual designed a new insurance product and want to know the future sales and
profitability of it. - Answer supervised
Which of the following counts the number of missing values of the "RM" variable in a dataset?
A. count(is.na(dataset$RM)
B. sum(is.na(dataset$RM))
C. sum(is.na(NA))
D. count(dataset$RM$NA) - Answer B. sum(is.na(dataset$RM))
Which of the following is NOT a type of data mining?
a. Prediction
b. Classification
c. Machine learning
d. Association rules - Answer c. Machine learning
Fill in the code to generate 5 normal random variables with variance 16 and mean 3.
rnorm(__________, ______________, ____________) - Answer n=5, mean=3, sd=4
If I wanted to display rows 1-10, and columns 2,6,8,9,10 from the Boston housing data set -
which of the following R scripts would give me these specified rows and columns? Select all
options that apply.
A. boston.housing.df[1:10, c(2, 6, 8:10)]
B. boston.housing.df[1:10, (2,6,8,9,10)]
C. boston.housing.df[(1:10), (2,6,8,9,10)]
D. boston.housing.df[c(1:10), c(2,6, 8, 9, 10)]
E. boston.housing.df[c(1:10), c =2,6,8,9,10]
F. boston.housing.df[1:10, c(2,6, 8, 9, 10)] - Answer A. boston.housing.df[1:10, c(2, 6, 8:10)]
and