100% Correct
Which of these is generally a good reason to remove an outlier from your data set?
A. The outlier is an incorrectly-entered data, not real data.
B. Outliers like this only happen occasionally - ANSWER A. If the data point isn't a true
one, you should remove it from your data set.
What is an outlier? - ANSWER A data point that is very different from the rest
What graph or plot can we use to find outliers? - ANSWER box and whisker plot
What are the parts of a box and whisker plot? - ANSWER The bottom and top of the
box are the 25th and 75th percentile. The middle value is the median. The whiskers
stretch up and down to reasonable range of values (10 and 90th or 5th and 95th
percentiles)
Where would outliers exist in a box and whisker plot? - ANSWER Outside of the
whiskers
What are some ways to deal with outliers that are bad data? - ANSWER Omit them or
use imputation
What can change detection be used for? - ANSWER Determining whether action might
be needed, determining impact of past action, determining changes to help plan
What is Cumulative Sum (CUSUM) used for? - ANSWER detect increase, decrease, or
both
What is C used for in the Cusum formula? - ANSWER Since we expect there to be
some randomness, we include a value C to pull the running total down
If we have a larger C ... - ANSWER the harder for S_t to get large and the less
sensitive the method will be
If we have a smaller C ... - ANSWER the more sensitive the method is because S-t can
get larger faster