Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Exam (elaborations)

DSCI 4520 - Data Mining Exam I Review Questions and Answers Solved Correctly

Rating
-
Sold
-
Pages
2
Grade
A+
Uploaded on
14-06-2025
Written in
2024/2025

DSCI 4520 - Data Mining Exam I Review Questions and Answers Solved Correctly Dimension - Answers ________ refers to the # of input variables (degrees of freedom). Over-fitting - Answers ________ is caused by an overly complex model that is too flexible; can also mean a situation in which there are too many variables in your model. "Which split is best?" - Answers ___________ refers to the Splitting Criteria method. Splitting Criteria methods - Answers The ___________ that are most widely used are based on Pearson Chi-Square test, Gini index, and entropy; all three measure difference in class distributions across the chid nodes and usually end with similar results. Pure child node - Answers If a split ends in a __________, the split is undisputedly best. Parent node - Answers If the expected target is same in child nodes as in the ____________ then the split is WORTHLESS. Pruning - Answers _______ creates a sequence of trees of increasing complexibility. Assessment criterion - Answers _______ is needed for deciding the best (sub) tree. Benefits of trees - Answers _______ include interpretability, mixed measurement scales, and missing values. Interpretability - Answers _______ is a tree-structured presentation. Binary outcome - Answers We CANNOT predict a __________. Binary - Answers When a response variable is ______, the data set (from past information) are zeroes. Simple Linear Regression - Answers A model with one predictor variable is called __________. Residual - Answers The _______ is the unexplained part of Y in the Simple Linear Regression formula. Dummy - Answers Indicator variables are also known as _________ variables. Indicator variables - Answers _________ allow for the inclusion of qualitative variables in the model. Model - Answers In Logistic Regression, the _______ is trying to predict the probability of an event. Logistic Regression assumption - Answers The idea that "the logit transformation of the probabilities of the target value results in a linear relationship with the input variables" is called __________.

Show more Read less
Institution
DSCI 4520
Course
DSCI 4520

Content preview

DSCI 4520 - Data Mining Exam I Review Questions and Answers Solved Correctly

Dimension - Answers ________ refers to the # of input variables (degrees of freedom).

Over-fitting - Answers ________ is caused by an overly complex model that is too flexible; can also mean
a situation in which there are too many variables in your model.

"Which split is best?" - Answers ___________ refers to the Splitting Criteria method.

Splitting Criteria methods - Answers The ___________ that are most widely used are based on Pearson
Chi-Square test, Gini index, and entropy; all three measure difference in class distributions across the
chid nodes and usually end with similar results.

Pure child node - Answers If a split ends in a __________, the split is undisputedly best.

Parent node - Answers If the expected target is same in child nodes as in the ____________ then the
split is WORTHLESS.

Pruning - Answers _______ creates a sequence of trees of increasing complexibility.

Assessment criterion - Answers _______ is needed for deciding the best (sub) tree.

Benefits of trees - Answers _______ include interpretability, mixed measurement scales, and missing
values.

Interpretability - Answers _______ is a tree-structured presentation.

Binary outcome - Answers We CANNOT predict a __________.

Binary - Answers When a response variable is ______, the data set (from past information) are zeroes.

Simple Linear Regression - Answers A model with one predictor variable is called __________.

Residual - Answers The _______ is the unexplained part of Y in the Simple Linear Regression formula.

Dummy - Answers Indicator variables are also known as _________ variables.

Indicator variables - Answers _________ allow for the inclusion of qualitative variables in the model.

Model - Answers In Logistic Regression, the _______ is trying to predict the probability of an event.

Logistic Regression assumption - Answers The idea that "the logit transformation of the probabilities of
the target value results in a linear relationship with the input variables" is called __________.

Logistic Regression - Answers In __________, the target is a discrete (binary or ordinal) variable.

Input variables - Answers In both Logistic and Linear Regression, the _________ have any measurement
level.

Written for

Institution
DSCI 4520
Course
DSCI 4520

Document information

Uploaded on
June 14, 2025
Number of pages
2
Written in
2024/2025
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

$10.89
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
TutorJosh Chamberlain College Of Nursing
Follow You need to be logged in order to follow users or courses
Sold
456
Member since
1 year
Number of followers
16
Documents
32132
Last sold
4 days ago
Tutor Joshua

Here You will find all Documents and Package Deals Offered By Tutor Joshua.

3.5

73 reviews

5
26
4
16
3
14
2
1
1
16

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions