Geschreven door studenten die geslaagd zijn Direct beschikbaar na je betaling Online lezen of als PDF Verkeerd document? Gratis ruilen 4,6 TrustPilot
logo-home
Tentamen (uitwerkingen)

ISYE 6501 Introduction to Analytics / ISYE 6501 MIDTERM EXAM 1

Beoordeling
-
Verkocht
-
Pagina's
35
Cijfer
A
Geüpload op
19-03-2022
Geschreven in
2022/2023

ISYE 6501 Introduction to Analytics / ISYE 6501 MIDTERM EXAM 1|ISYE 6501 Introduction to Analytics / ISYE 6501 MIDTERM EXAM 1

Instelling
Vak

Voorbeeld van de inhoud

ISYE 6501 MIDTERM EXAM 1

Why would we want to estimate the variance?

- Knowing the variance can help us estimate the amount of error


Why is GARCH different from ARIMA and exponential smoothing?

- GARCH estimates variance
- ARIMA and exponential smoothing both estimate the value of an attribute; GARCH
estimates the variance


When would regression be used instead of a time series model?

- When there are other factors or predictors that affect the response.

- Regression helps show the relationships between factors and a response


If two models are approximately equally good, measures like AIC and BIC will favor the simpler
model. Simpler models are often better because...

- Simpler models are less likely to be over-fit, easier to understand, and easier to explain


What is not a common use of regression?

- Prescriptive analytics: Determining the best course of action
- Regression is often good for describing and predicting, but is not as helpful for suggesting a
course of action


True or false: regression is a way to determine whether one thing causes another.

- False. Regression can show relationships between observations, but it doesn't show whether
one thing causes another

,Suppose our regression model to estimate how tall a 2-year-old will be as an adult has the
following coefficients:
0.56xFatherHeight + 0.51xMotherHeight - 0.02xFatherHeightxMotherHeight
The negative sign on the coefficient of FatherHeightxMotherHeight means:

- People with two taller-than-average parents won't be as tall as the individual effects of
father's height and mother's height add up to

- The negative coefficient for the interaction term brings down the overall estimate


What does "heteroscedasticity" mean?




- The variance is different in different ranges of the data


You might want to de-trend data before...

- ...using time-series data in a regression model

Factor-based models like regression generally don't account for time-based effects like trend.


Which of the following does principal component analysis (PCA) do?

- Transform data so there's no correlation between dimensions and rank the new dimensions
in likely order of importance.


If you use principal component analysis (PCA) to transform your data and then you run a
regression model on it, how can you interpret the regression coefficients in terms of the
original attributes?

- Each original attribute's implied regression coefficient is equal to a linear combination of the
principal components' regression coefficients.

,This is equivalent to using the inverse transformation.


True or false: In a regression tree, every leaf of the tree has a different regression model that
might use different attributes, have different coefficients, etc.

- True. Each leaf's individual model is tailored to the subset of data points that follow all of
the branches leading to the leaf.


Tree-based approaches can be used for other models besides regression.

- True. For example, a classification tree might have a different SVM or KNN model at each
leaf. It might even use SVM at some leaves and KNN at others (though that's probably rare).


A common rule of thumb is to stop branching if a leaf would contain less than 5% of the data
points. Why not keep branching and allow models to find very close fits to each very small
subset of data?

- Fitting to very small subsets of data will cause overfitting. With too few data points, the
models will fit to random patterns as well as real ones


True or False: When using a random forest model, it's easy to interpret how its results are
determined.

- False. Unlike a model like regression where we can show the result as a simple linear
combination of each attribute times its regression coefficient, in a random forest model there
are so many different trees used simultaneously that it's difficult to interpret exactly how any
factor or factors affect the result.


A logistic regression model can be especially useful when the response...

- ...is a probability (a number between zero and one) or is binary (either zero or one).


A model is built to determine whether data points belong to a category or not. A "true
negative" result is:

- A data point that is not in the category, and the model correctly says so. True' and 'false'
refer to whether the model is correct or not, and 'positive' and 'negative' refer to whether the
model says the point is in the category.

, True or False: The most useful classification models are the ones that correctly classify the
highest fraction of data points.

- False. Sometimes the cost of a false positive is so high that it's worth accepting more false
negatives, or vice versa.


What do descriptive questions ask?

- What happened? (e.g., which customers are most alike)


What do predictive questions ask?

- What will happen? (e.g., what will Google's stock price be?)


What do prescriptive questions ask?

- What action(s) would be best? (e.g., where to put traffic lights)


What is a model?

- Real-life situation expressed as math.


What do classifiers help you do?

- differentiate


What is a soft classifier and when is it used?

- In some cases, there won't be a line that separates all of the labeled examples. So we use a
classifier that minimizes the number of mistakes.


What does it mean when the classifier/decision boundary is almost parallel to the vertical x-
axis?

- The horizontal attribute is all that is needed.

Geschreven voor

Instelling
Vak

Documentinformatie

Geüpload op
19 maart 2022
Aantal pagina's
35
Geschreven in
2022/2023
Type
Tentamen (uitwerkingen)
Bevat
Vragen en antwoorden

Onderwerpen

$15.99
Krijg toegang tot het volledige document:

Verkeerd document? Gratis ruilen Binnen 14 dagen na aankoop en voor het downloaden kun je een ander document kiezen. Je kunt het bedrag gewoon opnieuw besteden.
Geschreven door studenten die geslaagd zijn
Direct beschikbaar na je betaling
Online lezen of als PDF

Maak kennis met de verkoper

Seller avatar
De reputatie van een verkoper is gebaseerd op het aantal documenten dat iemand tegen betaling verkocht heeft en de beoordelingen die voor die items ontvangen zijn. Er zijn drie niveau’s te onderscheiden: brons, zilver en goud. Hoe beter de reputatie, hoe meer de kwaliteit van zijn of haar werk te vertrouwen is.
Bri254 Rasmussen College
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
918
Lid sinds
5 jaar
Aantal volgers
738
Documenten
3524
Laatst verkocht
1 week geleden
Best Tutorials, Exam guides, Homework help.

When assignments start weighing you down, take a break. I'm here to create a hassle-free experience by providing up-to-date and recent study materials. Kindly message me if you can't find your tutorial and I will help.

4.0

181 beoordelingen

5
106
4
20
3
25
2
6
1
24

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Bezig met je bronvermelding?

Maak nauwkeurige citaten in APA, MLA en Harvard met onze gratis bronnengenerator.

Bezig met je bronvermelding?

Veelgestelde vragen