Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Summary

JADS Premaster - Statistics for Data Scientists Summary

Rating
-
Sold
-
Pages
28
Uploaded on
10-10-2021
Written in
2021/2022

Summary for the Statistics for Data Scientists course of the Premaster Data Science and Entrepreneurship.

Institution
Course

Content preview

1. Chapter 1: Basics
Measurement Levels
Type Distinction Order Unit Origin

Nominal x

Ordinal x x

Interval x x x

Ratio x x x x
Able to distinguish
Can be ordered Has a unit Has a origin
categories



Statistics Nominal/Ordinal Data
● Frequency: amount of times a category occurs
● Mode: the value that is observed the most
● Median: the middle value

Plots Nominal/Ordinal Data
● Bar chart
● Pie chart

Statistics For Interval/Ratio Data
● Mean: the average
● Mode: the value that is observed the most
● Median: the middle value
● Quantile: points that divide data in intervals (i.e. 25%, 50%, 75%, 100%)
● Range: the difference between the largest and smallest value
● Inter Quartile Range (IQR): the difference between the first and the third quartile
● Mean Absolute Deviation (MAD): the average of how far every observation is from the
mean
● Mean Squared Deviation (MSD): the average of how far every observation is from the
mean squared
● Variance ( S 2 ): how far observation are spread out from their average
● Standard Deviation (SD): the amount of variation
● Skewness: the amount that the data is distributed to the left or the right
○ > 0 : skewed to the right
○ < 0 : skewed to the left
● Kurtosis: how heavy the tails of a distribution differ from the tails of a normal distribution
○ > 0 : heavy tails
○ < 0 : light tails

1

, Plots For Interval/Ratio Data
● Box plot
● Histogram plot
● Density plot
● Scatter plot




2. Chapter 2: Sampling




Representative Sample
A sample that has approximately the same distribution characteristics as the population.
● Simple Random Sampling: each unit in the population has the same probability of
ending up in a sample
● Systematic Sampling: the population is divided into n groups, then one random number
is used to draw a unit from each group at the same index
● Stratified Sampling: the population is divided into n groups, then a percentage of units is
taken from each group
● Cluster Sampling: the population is divided into clusters, then a random sample from
each of the clusters is taken
○ Single-stage: when a random sample is taken from the clusters and from every
sampled cluster all units are taken
○ Multi-stage: when units from the sampled clusters are also randomly sampled

Non-representative Sample
A sample that doesn’t have approximately the same distribution characteristics as the
population.
● Convenience Sampling: a sample that is easy to obtain i.e. in psychology studies
samples are generalized to fit the entire population
● Haphazard Sampling: a sample that may look like a random sample but actually is not
truly a random sample
● Purposive Sampling: a sample that is picked for a specific purpose i.e. customer
satisfaction


2

Connected book

Written for

Institution
Study
Course

Document information

Summarized whole book?
No
Which chapters are summarized?
Chapter 1 to 9
Uploaded on
October 10, 2021
Number of pages
28
Written in
2021/2022
Type
SUMMARY

Subjects

$7.27
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF


Also available in package deal

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
tomdewildt Jheronimus Academy of Data Science
Follow You need to be logged in order to follow users or courses
Sold
29
Member since
4 year
Number of followers
13
Documents
22
Last sold
11 months ago

5.0

1 reviews

5
1
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions