_______ show(s) the shape of box plots using the same data but different visualizations **
Answ** histograms
________ determines the _______ and the ______ ** Answ** shape, center and spread
_________ (spread measure) IS inflated by outliers ** Answ** mean
_________ (spread measure) is NOT inflated by outliers ** Answ** median
_________ are necessary in situations where exact answers or too complex or impossible to
determine or predict ** Answ** simulations
5 number summary components ** Answ** Max, Q3, median, Q1, min
a coin flip is a _______ event that is ___________ ** Answ** random, equally likely
A p-value for correlation which is statistically significant implies the correlation is due to
random chance. (t/f) ** Answ** false
A strong relationship can have correlation equal to zero. (t/f) ** Answ** true, it would just
be non-linear
a user ID is an example of a _____ variable ** Answ** identifier
all data needs ____ ** Answ** context
association ** Answ** knowing one thing gives you information about another
, boxplots display ______ variables ** Answ** quantitative
can you remove outliers? ** Answ** only if they are ridiculously crazy for the data set and
you have a reason for it, then you must disclose that in the data set
categorical variables ** Answ** one whose possible values are given by short descriptors
conditional distribution ** Answ** shows the distribution of one variable for just the
individuals who satisfy some condition on another variable
ex: "Of the crew of the titanic, what % survived?"
continuous ** Answ** can continue with decimals
correlation ** Answ** a specific kind of association that is reserved for 2 quantitative
variables, is straight enough, and has no outliers
correlation coefficient ** Answ** based on the relationship between the z scores of x and y,
IS UNITLESS, cannot be used if data has no variation
data can be ____ ** Answ** anything!
discrete ** Answ** whole numbers
each quartile on a box plot has (the same/different) amount of data contained? ** Answ**
the same, no matter if one section is bigger or smaller!
explain "left to the low, right to the high" ** Answ** Left to the low = low numbers on the
box plot... if the length of the box plot fence is down low= left skew; right to the high = high
numbers