What is classification? - Putting things into categories; differentiating
To avoid error/misclassification in classification models, we maximize _____. - the
margin between the two groups of data points
What's the difference between hard and soft classifiers? - Soft classifiers give as good a
separation (between data classes) as possible, while hard classifiers separate perfectly.
What does a completely vertical or horizontal classifier tell us about the attributes? -
One of the attributes is not needed
When referring to table-formatted data, name 4 synonyms for a column - Attribute,
feature, covariate, predictor
What is unstructured data? Provide an example. - Data that is not easily described and
stored. Example: written text
What is quantitative data? - Numbers with meaning
What is categorical data? - Numbers without meaning OR non-numeric data OR binary
data
What's the difference between unrelated data and time series data? - Unrelated data =
no relationships between data points. Time series data = the same data recorded over
time, often at equal intervals.
Soft classifiers involve a trade off between what two things? - Minimizing the
classification errors we make with maximizing the margin between the support vectors