C784 STATISTICS FULL COURSE
REVIEW EXAM WITH CORRECT
QUESTIONS AND ANSWERS
L Lurking Variable - CORRECT-ANSWERSAssociated with the explanatory and the
response variables that is not directly being studied.
L Simpson's Paradox - CORRECT-ANSWERSOccurs when a trend or result that appears
in groups of data disappears when we combine the data.
L Regression Analysis - CORRECT-ANSWERSA process for estimating and analyzing
the relationship between variables.
L Least-Squares Regression Line - CORRECT-ANSWERSAKA the line of best fit.
L y = mx + b
L "rise over run"
L Interpolation - CORRECT-ANSWERSUsing existing data along with the Least-Squares
Regression Line to make a prediction.
L Extrapolation - CORRECT-ANSWERSEstimation by projecting OUTSIDE known data
points.
L p-value - CORRECT-ANSWERSthe probability that a result occurred by chance
L Significance Levels in Hypothesis Testing - CORRECT-ANSWERSStatistically
Significant: If a measured p-value is less than the significance level.
L NOT Statistically Significant: If a measured p-value is MORE than the significance
level.
L Empirical Rule - CORRECT-ANSWERS68 - 95 - 99.7
L 68% is w/in 1 standard deviation of the mean
L 95% is w/in 1 standard deviation of the mean
, L 99.7% is w/in 1 standard deviation of the mean
L Used for Categorical (Qual) Data - CORRECT-ANSWERSBar Chart
L Pie Chart
L Mean - CORRECT-ANSWERSAverage - adding series then dividing by the total
number in that series
L Center & Spread Measures - CORRECT-ANSWERSSYMMETRIC - Mean (center) /
Standard Deviation (spread)
L SKEWED - Median (center) / IQR (spread)
L CATEGORICAL - Mode (center) / no spread
L What is the difference between a bar chart and a histogram? - CORRECT-
ANSWERSBAR CHART - Categorical data distributed over groups of categories
L HISTOGRAM - Quantitative data distributed over various intervals
L Outlier Formula - CORRECT-ANSWERS1.5 x IQR above Q3 or below Q1
L Graphical Displays for Quantitative Variables - CORRECT-
ANSWERSHISTOGRAMS - large data sets
L STEM PLOTS - keeps individual data points
L DOT PLOTS - smaller data sets
L BOX PLOTS - center, spread and 5 number summary
L SCATTER PLOTS - x,y axis
L 5 Number Summary - CORRECT-ANSWERSMin - Q1 - Median - Q3 - Max
L Graphical Displays and Corresponding Numerical Analysis Tools for Examining Data
Relationships - CORRECT-ANSWERSC > C - Two-way Frequency Tables (aka
Contingency Table) use conditional percentages and relative frequencies
L C > Q - Side-by-Side Boxplots use 5 number summaries
L Q > Q - Scatterplots use correlation coefficients
L InterQuartile Range (IQR) - CORRECT-ANSWERSQ3-Q1
REVIEW EXAM WITH CORRECT
QUESTIONS AND ANSWERS
L Lurking Variable - CORRECT-ANSWERSAssociated with the explanatory and the
response variables that is not directly being studied.
L Simpson's Paradox - CORRECT-ANSWERSOccurs when a trend or result that appears
in groups of data disappears when we combine the data.
L Regression Analysis - CORRECT-ANSWERSA process for estimating and analyzing
the relationship between variables.
L Least-Squares Regression Line - CORRECT-ANSWERSAKA the line of best fit.
L y = mx + b
L "rise over run"
L Interpolation - CORRECT-ANSWERSUsing existing data along with the Least-Squares
Regression Line to make a prediction.
L Extrapolation - CORRECT-ANSWERSEstimation by projecting OUTSIDE known data
points.
L p-value - CORRECT-ANSWERSthe probability that a result occurred by chance
L Significance Levels in Hypothesis Testing - CORRECT-ANSWERSStatistically
Significant: If a measured p-value is less than the significance level.
L NOT Statistically Significant: If a measured p-value is MORE than the significance
level.
L Empirical Rule - CORRECT-ANSWERS68 - 95 - 99.7
L 68% is w/in 1 standard deviation of the mean
L 95% is w/in 1 standard deviation of the mean
, L 99.7% is w/in 1 standard deviation of the mean
L Used for Categorical (Qual) Data - CORRECT-ANSWERSBar Chart
L Pie Chart
L Mean - CORRECT-ANSWERSAverage - adding series then dividing by the total
number in that series
L Center & Spread Measures - CORRECT-ANSWERSSYMMETRIC - Mean (center) /
Standard Deviation (spread)
L SKEWED - Median (center) / IQR (spread)
L CATEGORICAL - Mode (center) / no spread
L What is the difference between a bar chart and a histogram? - CORRECT-
ANSWERSBAR CHART - Categorical data distributed over groups of categories
L HISTOGRAM - Quantitative data distributed over various intervals
L Outlier Formula - CORRECT-ANSWERS1.5 x IQR above Q3 or below Q1
L Graphical Displays for Quantitative Variables - CORRECT-
ANSWERSHISTOGRAMS - large data sets
L STEM PLOTS - keeps individual data points
L DOT PLOTS - smaller data sets
L BOX PLOTS - center, spread and 5 number summary
L SCATTER PLOTS - x,y axis
L 5 Number Summary - CORRECT-ANSWERSMin - Q1 - Median - Q3 - Max
L Graphical Displays and Corresponding Numerical Analysis Tools for Examining Data
Relationships - CORRECT-ANSWERSC > C - Two-way Frequency Tables (aka
Contingency Table) use conditional percentages and relative frequencies
L C > Q - Side-by-Side Boxplots use 5 number summaries
L Q > Q - Scatterplots use correlation coefficients
L InterQuartile Range (IQR) - CORRECT-ANSWERSQ3-Q1