ANSWERS ALL PASSED FOR SUCCESS 2024 UPDATE
Define: Data Exploration
The analyst begins to understand the basic nature of data and the relationships within it
What does data exploration rely on? (2)
Use of data visualization tools and numerical summaries
Define: Predictive Modeling
Allow the analyst to move beyond describing the data to creating models that enable
predictions of outcomes of interest
What tools became popular with the ability of computers to look for patterns in
large amounts of data?
Data Mining Tools. Python, R
What is the last step of the data analytics journey?
Reporting and Visualization
What is the first step of the data analytics journey?
Discovery Phase / Business Understanding
What is the second step of the data analytics journey?
Data Acquisition / Data Collecting
What is the third step of the data analytics journey?
Data Cleaning / Data Wrangling / Feature Engineering / Data Munging
What is the fourth step of the data analytics journey?
Data Exploration
What is the fifth step of the data analytics journey?
Predictive Modeling
What tools are used for predictive modeling?
Python, R
What are the 7 steps of the data analytics journey?
Business Understanding (Discovery Phase)
Data Acquisition (Data Collecting)
Data Cleaning
Data Exploration
Predictive Modelling
Data Mining
Reporting & Visualization
Define: discovery phase
Analyst defines the major questions of interest that need to be answered, needs of the
stakeholders and assesses the resource constraints of the project
Define: data acquisition (data collecting)
Frequently retrieved from a database
What can an analyst do if data is not available?
use tools like web scraping or surveys to get it.
What happens if the data cleaning step is ignored?
The results from the analysis become irrelevant
, What tools are used for data cleansing?
SQL, Python, R, Excel
What is data quality measured in terms of? (2)
uniqueness, relevance
What is the sixth step of the data analytics journey?
Data Mining
What tools are used for reporting and visualization?
Tableau
What are the goals of the reporting and visualization phase?
To provide actionable insights for various stakeholders
Define: Reporting & Visualization
The analyst tells the story of the data and uses graphs or interactive dashboards to
inform others of the findings from the analyses
Data analysis is often seen as a combination of ______ and ______.
Value & Difficulty
What are the four types of analytics?
Descriptive
Diagnostic
Predictive
Prescriptive
What is the main question that descriptive analytics aims to answer?
What happened?
What is the main question that diagnostic analytics aims to answer?
Why did it happen?
What is the main question that predictive analytics aims to answer?
What will happen in the future?
What is the main question that prescriptive analytics aims to answer?
How can we make it happen? How can we change it? What action will achieve a
particular outcome?
T/F: Correlation does not imply Causation.
True
T/F: In data analytics, it is harder to show correctional results.
False. It is easier to show correctional results.
What does correlation imply?
That a relationship exists, but it stops short of proving causation.
A conclusion based on ____ is a common mistake on both ______ & -____
models.
A conclusion based on correlation is a common mistake on both predictive and data
mining models.
Define: Correlation
A relationship based on two variables.
T/F: According to correlation, when one variable changes, you know the degree in
which the other variable changes.
True
Define: Causation