Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Summary

Summary Data Mining Model Selection and Regularization

Rating
-
Sold
-
Pages
18
Uploaded on
12-08-2023
Written in
2022/2023

This document contains a summary of the theory seen in the lab sessions. In addition, the document contains the solutions to the exercises of the lab sessions.

Institution
Course

Content preview

Model selection and regularization
is.na(): identifies missing observations(TRUE = missing, FALSE = non-missing, sum() = counts all
missing elements)




Salary is missing for 59 players

na.omit(): removes all rows that have missing values in any variable




regsubsets(): performs best subset selection (by identifying the best model that contains a given
number of predictors, where best is quantified using RSS) (syntax is same as for lm())

, summary(): outputs best set of variables for each model size

*: variable included in corresponding model( here: best two-variable model contains only Hits and
CRBI)

regsubsets(): by default, reports only the best eight-variable model

 nvmax: returns as many variables as are desired




reg.summary(): returns R2, RSS, adjusted R2, Cp and BIC




R2 increases from 32% ( = 1 variable) to almost 55% ( = all variables)




Type= “1” : connect plotted points with lines

points(): like plot(), except it puts points on a plot that has already been created, instead of creating a
new plot

which.max(): identify location of the maximum point of a vector




Red dot = model
with largest
adjusted R2
statistic

Written for

Institution
Study
Course

Document information

Uploaded on
August 12, 2023
Number of pages
18
Written in
2022/2023
Type
SUMMARY

Subjects

$4.54
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller
Seller avatar
Worstje2021
5.0
(1)

Also available in package deal

Get to know the seller

Seller avatar
Worstje2021 Universiteit Gent
Follow You need to be logged in order to follow users or courses
Sold
7
Member since
2 year
Number of followers
5
Documents
13
Last sold
2 year ago

5.0

1 reviews

5
1
4
0
3
0
2
0
1
0

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions