Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Exam (elaborations)

Data Analytics Journey

Rating
-
Sold
-
Pages
9
Grade
A+
Uploaded on
27-06-2024
Written in
2023/2024

Data Analytics Journey

Institution
Course

Content preview

Data Analytics Journey

Business Understanding
Planning, Discovery - ANS-Scope Project /Identify stakeholders and research
questions/KPIs Identify timeline, budget, and participants
problems - Lack of clear focus on stakeholders, timeline, limitations, and budget could
potentially derail an analysis

Data acquisition
Extraction, Data gathering, Data query, Data collection ETL (extract, transform, load)
Web scraping - ANS-Gather/collect data from a variety of sources, Provide structure to
data accessible via relational databases (SQL), Build data pipeline (ETL), Use of API to
download data from an external source
problems - Quality and type of data may make access more difficult

Data cleaning
Wrangling, Scrubbing, Munging - ANS-Fixing improperly formatted values, Dealing with
duplicates, missing data, and outliers, Data reduction
problems - Some cleaning techniques could dramatically change data/outcomes,
Outliers not dealt with can cause problems with statistical models due to excessive
variability.

Data exploration
Exploratory Data Analysis (EDA), Descriptive Statistics - ANS-Central Tendency/
Measures of center (e.g., mean, median, mode), variability (e.g., standard deviations
and quartiles) and distributions (e.g., normal, skewed, etc.), Identify basic correlations
between variables, Pattern discovery
problems - Skipping this step could enable faulty perceptions of the data which hurt
advanced analytics.

Predictive Modeling
Data Modeling, Correlation based models, Regression models, Time series -
ANS-Estimate/project future values or likelihood of an event. Extend correlations found
in EDA to mathematical models. Predict/determine output values based on input values.
Cross-validation of predictive models to ensure accuracy.
problems - Too many input variables (predictors) can cause problems. Correlation does
not imply causation. Time series models often need sufficient time data to offer precise

, trending. Predictive model accuracy should be assessed using cross-validation.
Data mining
Machine Learning, Deep Learning, AI (artificial intelligence), Supervised/ Unsupervised
Models - ANS-Creating training and testing datasets to build models from.
Identify/detect patterns. Determine if groups (clusters) exist in data. Classify data into
groups. Create models that "learn" and improve (e.g., machine/deep learning, AI, etc.)
problems - Running on entire data is problematic; need to subset data into training and
testing datasets to build models.

Reporting and visualization
Dashboards - ANS-Tell a story with data. Provide a summary of analytic analysis.
Provide insights to stakeholders. Create insightful graphs that showcase trends and
forecasts
problems - Due to potential large audience consumption, mistakes can cause bad
business decisions and loss of revenue. Improper scales used in graphs could push for
interpretations of the story that is inaccurate

Descriptive - ANS-Key focus: Observation
Main question: What happened?
Example: In a healthcare setting, an unusually high number of people are admitted to
the emergency room in a short period of time. Descriptive analytics tells you that this is
happening and provides real-time data with all the corresponding statistics (date of
occurrence, volume, patient details, etc.).

Diagnostic - ANS-Key focus: Explained reason
Main question: Why did it happen?
Example: In the healthcare example mentioned earlier, diagnostic analytics would
explore the data and make correlations. For instance, it may help you determine that all
of the patients' symptoms — high fever, dry cough, and fatigue — point to the same
infectious agent. You now have an explanation for the sudden spike in volume at the
ER.

Predictive - ANS-Key focus: Correlation
Main question: What will happen in the future?
Example: Back in our hospital example, predictive analytics may forecast a surge in
patients admitted to the ER in the next several weeks. Based on patterns in the data,
the illness is spreading at a rapid rate.

Prescriptive - ANS-Key focus: Causal/manipulate

Written for

Course

Document information

Uploaded on
June 27, 2024
Number of pages
9
Written in
2023/2024
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

$8.49
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
scholartutor Chamberlain College Of Nursing
Follow You need to be logged in order to follow users or courses
Sold
2770
Member since
1 year
Number of followers
3
Documents
10727
Last sold
1 day ago

4.8

923 reviews

5
813
4
79
3
20
2
7
1
4

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions