Secured.
True or False: Data Mining can be said to be a process designed to detect patterns in data sets.
correct answers True
True or False: In unsupervised learning, the learning algorithm must be trained using data
attributes that have been paired with an outcome variable. correct answers False
True or False: Unsupervised learning involves building a statistical model for predicting, or
estimating an output based upon one or more inputs. correct answers False
Regression analysis involves developing a model where one or more inputs are used to predict an
output variable. Regression, in this context, represents what kind of learning. correct answers
Supervised learning
Assuming that we have a data set that includes sales data for every customer over the course of
several years and we wanted to use this data to predict future sales which would be the most
appropriate technique to investigate? correct answers Regression
Assume that you had a variety of data including medical history, diet, heredity factors on
individuals who developed cancer and you wanted to use this data to determine whether a person
is likely to develop cancer. Which technique would be the most promising to start with? correct
answers Classification
Which of the following is an example of an unsupervised learning algorithm? correct answers K-
Means
True or False: A predication outcome variable must be categorical? correct answers False
Which of the following is NOT a machine learning technique? correct answers Linear
Components Analytics
, True or False: In a supervised learning model, Bias refers to the error that is introduced from the
assumptions of the data analyst. correct answers False
The objective of ________ is to identify valid novel and potentially useful, and understandable
correlations and patterns in existing data. correct answers data mining
Which of the following is an example of a NOSQL Analytics database? correct answers
Cassandra
What does ETL stand for? correct answers Extract transform load
True or False: In a data warehouse, unidimensional data is stored in a star schema format. correct
answers False
The term OLAP stands for? correct answers Online Analytical Processing
A database where all of the values for a particular column are stored contiguously is called?
correct answers Column-oriented storage
True or False: The snowflake schema differs from the star schema in that the table holding the
dimensional data are normalized. correct answers True
True or False: Map/Reduce refers to an optimized approach to process SQL queries. correct
answers False
True or False: Information Retrieval or text analytics is NOT a form of data mining. correct
answers False