Geschreven door studenten die geslaagd zijn Direct beschikbaar na je betaling Online lezen of als PDF Verkeerd document? Gratis ruilen 4,6 TrustPilot
logo-home
Samenvatting

Full readings summary for Advanced Statistics

Beoordeling
-
Verkocht
1
Pagina's
8
Geüpload op
13-10-2021
Geschreven in
2019/2020

A full sumary of the literature of the course Advanced Statistics

Voorbeeld van de inhoud

Summary Advanced Statistics 2019-2020 Lieve Bastiaan




Summary: Advanced Statistics
Index
Agresti chapter 9: Linear regression and correlation....................................................................................1
9.1 Linear Relationships...........................................................................................................................1
9.2 Least Squares Prediction Equation.....................................................................................................1
Allison Chapter 1: What Is Multiple Regression?........................................................................................2
Allison Chapter 2: How Do I Interpret Multiple Regression Results?..........................................................2
Allison Chapter 3: What Can Go Wrong With Multiple Regression?..........................................................3
Allison Chapter 5: How Does Bivariate Regression Work?.........................................................................4
Allison Chapter 6: What Are The Assumptions Of Multiple Regression?....................................................5
Allison Chapter 7: What Can Be Done About Multicollinearity?.................................................................6
Allison Chapter 8: How Can Multiple Regression Handle Nonlinear Relationships?..................................7



AGRESTI CHAPTER 9: LINEAR REGRESSION AND CORRELATION

9.1 LINEAR RELATIONSHIPS

x→ Explanatory variable y→ Response variable
Linear function → y = α +βx → Expresses observations on y as a linear function of observations on x. The
formula has a straight line graph with slope β (beta) and y-intercept α (alpha). In the context of a
regression analysis α and β are called regression coefficients.

9.2 LEAST SQUARES PREDICTION EQUATION

When a scatterplot suggests that the model y= α + βx may be appropriate, we use the data to estimate this
line. The notation ^y =a+bx represents a sample equation that estimates the linear model. The sample
equation is called the prediction equation, because it provides a prediction for the response variable at
every value of x.
The formulas to calculate a and b are:
∑( x−x )( y− y )
b= 2 and a= y−b x
∑( x−x)

When is an observation a regression outlier?

 When it falls quite far from the trend that the rest of the data follow.
 If it is influential; meaning that removing it results in a large change in the prediction equation
 Unless the sample size is larger, an observation can have a strong influence on the slope, if its x-
value is low or high compared to the rest of the data


1

, Summary Advanced Statistics 2019-2020 Lieve Bastiaan



The prediction errors are called residuals, for an observation the difference between an observed value and
the predicted value of the response variable y− ^y , is called the residual.
We summarize the size of the residuals by the sum of their squared values. This quantity, denoted by SSE
(Sum of squared errors) is SSE=∑ ( y− ^y )2 .
The least squares estimates a and b are the values that provide the prediction equation for which the
residual sum of squares (SSE) is a minimum.

ALLISON CHAPTER 1: WHAT IS MULTIPLE REGRESSION?

Chapter highlights:

1. Multiples regression is used both for predicting outcomes and for investigating the causes of
outcomes
2. The most popular kind of regression is ordinary least squares but there are other, more
complicated regression methods
3. Ordinary multiple regression is called linear because it can be represented graphically by a
straight line
4. A linear relationship between two variables is usually described by two numbers, the slope and
the intercept.
5. Researchers typically assume that relationships are linear because it’s the simplest kind of
relationship and there’s usually no good reason to consider something more difficult.
6. To do a regression, you need more cases than variables, ideally lots more.
7. Ordinal variables are not well represented by linear regression equations. (Ordinal= An ordinal
variable is similar to a categorical variable. The difference between the two is that there is a clear
ordering of the variables. Ex. socio-economic status; low, middle, high)
8. Ordinary least squares chooses the regression coefficients (slopes and intercept) to minimize the
sum of squared prediction errors.
9. The R2 is the statistic most often used to measure how well the dependent variable can be
predicted from knowledge of the independent variables.
10. To evaluate the least squares estimates of the regression coefficients, we usually rely on
confidence intervals and hypothesis tests.
11. Multiples regression allows us to statistically control for measured variables, but this control is
never as good as a randomized experiment.

ALLISON CHAPTER 2: HOW DO I INTERPRET MULTIPLE REGRESSION RESULTS?

Chapter highlights:

1. Asterisks after a regression coefficient usually indicated that the coefficient is significantly
different from 0. The most common convention is one star for a p value below 0.05 and two stars
for a p value below 0.01. (This is not universal)
2. To interpret the numeric value of a regression coefficient, it’s essential to understand the metrics
of the dependent and independent variables
3. Coefficients for dummy (0,1) variables usually can be interpreted as differences in means on the
dependent variables for the two categories of the independent variable, controlling for other
variables in the regression model.

2

Documentinformatie

Geüpload op
13 oktober 2021
Aantal pagina's
8
Geschreven in
2019/2020
Type
SAMENVATTING
€6,69
Krijg toegang tot het volledige document:

Verkeerd document? Gratis ruilen Binnen 14 dagen na aankoop en voor het downloaden kun je een ander document kiezen. Je kunt het bedrag gewoon opnieuw besteden.
Geschreven door studenten die geslaagd zijn
Direct beschikbaar na je betaling
Online lezen of als PDF

Maak kennis met de verkoper

Seller avatar
De reputatie van een verkoper is gebaseerd op het aantal documenten dat iemand tegen betaling verkocht heeft en de beoordelingen die voor die items ontvangen zijn. Er zijn drie niveau’s te onderscheiden: brons, zilver en goud. Hoe beter de reputatie, hoe meer de kwaliteit van zijn of haar werk te vertrouwen is.
SociologyEconomics2 Universiteit van Amsterdam
Bekijk profiel
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
79
Lid sinds
7 jaar
Aantal volgers
57
Documenten
19
Laatst verkocht
2 maanden geleden

4,0

11 beoordelingen

5
4
4
3
3
4
2
0
1
0

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Bezig met je bronvermelding?

Maak nauwkeurige citaten in APA, MLA en Harvard met onze gratis bronnengenerator.

Bezig met je bronvermelding?

Veelgestelde vragen