complete solution
Select the correct statement.
A methodology is an application for a computer program.
A methodology is a set of instructions.
A methodology is a system of methods used in a particular area of study or activity.
All of the above statements are correct.
A methodology is a system of methods used in a particular area of study or activity.
Select the correct statement.
The data science methodology described in this course is only used by certified data
scientists.
The data science methodology described in this course is outlined by John Rollins from
IBM.
The data science methodology described in this course is limited to IBM.
None of the above statements are correct.
The data science methodology described in this course is outlined by John Rollins from
IBM.
Select the correct statement.
The first stage of the data science methodology is data understanding.
The first stage of the data science methodology is modeling.
The first stage of the data science methodology is business understanding.
The first stage of the data science methodology is data collection.
The first stage of the data science methodology is business understanding.
Select the correct statement.
If a problem is a dish, then data is an answer.
If a problem is a dish, then data is an ingredient.
If a problem is a dish, then data is a list of information. None of the above statements
are correct.
If a problem is a dish, then data is an ingredient.
Select the correct statement.
A data requirement is never refined.
A data requirement is set in stone.
A data requirement is the initial set of ingredients.
None of the above statements are correct.
A data requirement is the initial set of ingredients.
Select the correct statement.
Data scientists determine how to prepare the data. Data scientists identify the data that
is required for data modeling.
, Data scientists determine how to collect the data.
All of the above.
All of the above.
Select the correct statement about data preparation.
Data preparation involves properly formatting the data. Data preparation involves
correcting invalid values and addressing outliers.
Data preparation involves removing duplicate data. Data preparation involves
addressing missing values.
All of the above statements are correct.
All of the above statements are correct.
Select the correct statement about data understanding.
Data understanding encompasses removing redundant data.
Data understanding encompasses all activities related to constructing the dataset.
Data understanding encompasses sorting the data.
All of the above statements about data understanding are correct.
Data understanding encompasses all activities related to constructing the dataset.
Select the correct statement about what data scientists and database administrators
(DBAs) do during data preparation.
During data preparation, data scientists and DBAs identify missing data.
During data preparation, data scientists and DBAs determine the timing of events.
During data preparation, data scientists and DBAs aggregate the data and merge them
from different sources.
During data preparation, data scientists and DBAs define the variables to be used in the
model.
All of the above statements are correct.
All of the above statements are correct.
Select the correct statement.
A training set is used for data visualization.
A training set is used for predictive modeling.
A training set is used for statistical analysis.
A training set is used for descriptive modeling.
None of the above statements are correct.
A training set is used for predictive modeling.
A statistician calls a false-negative, a type I error, and a false-positive, a type II error.
True
False
False
Select the correct statement about model evaluation.
Model evaluation can include statistical significance testing.
Model evaluation includes ensuring that the data are properly handled and interpreted.