Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Exam (elaborations)

DATA DRIVEN DECISION MAKING FINAL PRACTICE SOLUTION 2026 VIEW AHEAD TESTED SET

Rating
-
Sold
-
Pages
16
Grade
A+
Uploaded on
20-05-2026
Written in
2025/2026

DATA DRIVEN DECISION MAKING FINAL PRACTICE SOLUTION 2026 VIEW AHEAD TESTED SET

Institution
DATA
Course
DATA

Content preview

DATA MINING AND STAT LEARN STUDY GUIDE
2026 COMPREHENSIVE ANSWERS ALREADY
PASSED

◉ SVM Pros/Cons. Answer: Pros: It works really well with a clear
margin of separation
It is effective in high dimensional spaces.
It is effective in cases where the number of dimensions is greater
than the number of samples.
It uses a subset of training points in the decision function (called
support vectors), so it is also memory efficient.
Cons: Not good for very large data sets
Not good for when the data set has more noise i.e. target classes are
overlapping
Doesn't directly provide probability estimates.


◉ K-nearest neighbor (K-NN). Answer: An unsupervised
classification algorithm. Looks at the X number of closest points to
the new one and classifies as whichever is most common.


◉ K-nearest neighbor (K-NN) Pros/Cons. Answer: Pros: No
assumptions about data
Easy to understand/Interpret

,Varsatile


Cons: Computationally expensive because algorithm stores all
training data
Sensitive to irrelevant features and scale of data


◉ k-fold cross validation. Answer: Validation Technique where data
is divided into X number of data subsets. Each subset is then used as
a for testing while the rest are used for training. The algorithm then
rotates through each subset and averages the results


◉ K Fold cross Validation Pros/Cons. Answer: Pros: Validates
Performance of model
Can create balance across predicted features classes
Cons: Doesn't work well with time series data
The aggregate scores of your model could miss some important
extreme values or overpower them so theyre harder to pick up on


◉ k-means clustering. Answer: Unsupervised learning heuristic that
sets x starts by assigning x number of cluster centers, then clusters
all data points into each of them based on distance. The center point
of each cluster is then calculated and all data points are again re
clustered. Repeat process until no-data points change clusters. Ideal
number of clusters can be identified via elbow diagram.

, ◉ k-means pros and cons. Answer: Pros: Simple to implement
Scales well to large data sets
Easily adaptable
Cons: Choosing K manually can bias it towards initial values
sensitive to outliers


◉ Grubbs Outlier Test. Answer: A formula that uses an outlier's
value, the mean of the data, and the standard deviation to determine
whether or not the data point is within the confidence interval for a
normal distribution or should be thrown out


◉ CUSUM. Answer: Change detection model that keeps a running
total of the amount that observations vary above the expected value.
The running total exceeds a preset threshold value, it indicates there
has been a change.


◉ CUSUM Pros/Cons. Answer: Pros: Best way to detect the small
shifts of process mean especially 0.5 to 2 SD from the target mean
Easy to identify visually the shifts in process mean
Cons: Cumbersome to establish and maintain
Tough to interpret the patterns.
Choosing C and T values is a pro and con as it can cause bias but
creates more flexibility

Written for

Institution
DATA
Course
DATA

Document information

Uploaded on
May 20, 2026
Number of pages
16
Written in
2025/2026
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

$10.99
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
alcorbgeneralstore Havard School
Follow You need to be logged in order to follow users or courses
Sold
16
Member since
4 months
Number of followers
0
Documents
13595
Last sold
3 days ago
ALCORB STORES

ALCORB STORES

5.0

1 reviews

5
1
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions