GTx: ISYE6501x Introduction to Analytics Modeling Midterm
Quiz 1 & QUIZ 2 - GT Students and Verified MM Learners latest
2023
Raw Data - ANSWER: Pertains to the collected data before it's processed or ranked.
Statistics - ANSWER: Defined as the science of collecting, analyzing, presenting, and
interpreting data, as well as of making decisions based on such analysis,.
Descriptive statistics - ANSWER: Consists of techniques for organizing, displaying, and
describing data through labels, graphs, and summary measures.
Inferential statistics - ANSWER: Consists of methods that use sample results to help
make decisions or predictions about a population.
Data set - ANSWER: A collection of observations on one or more variables.
Variable - ANSWER: A characteristic under study that assures different values for
different elements.
Quantitative Variables - ANSWER: Pertains to a variable that can be measured
numerically, divided into two types: discrete variables and continuous variables.
Discrete variable - ANSWER: A variable whose values are countable and can assume
only certain values.
Continuous variable - ANSWER: Data that can take on any interval values, also known
as float, interval, or numeric data.
Qualitative variable - ANSWER: Variables whose values cannot be measured.
Ordinal variables - ANSWER: Categorical data with an explicit ordering, also known as
an ordered factor.
Binary variables aka dichotomous, logical, indicator - ANSWER: A special case of
categorical data with just two categories, such as 0/1 or True/False.
Exploratory Data Analysis (EDA) - ANSWER: It refers to the critical process of
performing initial investigations on the data so as to discover patterns, to spot
anomalies, to test hypothesis and to check assumptions with the help of summary
statistics and graphical representations.
Measure of central tendencies - ANSWER: An estimated of where most of the data
are located
, Arithmetic Mean - ANSWER: Calculated by adding together the values in a sample
and dividing the sum by the number of items in the sample.
Median - ANSWER: The middle value when numbers are arranged in order, dividing
the ranked data set into two equal parts.
Mode - ANSWER: The most frequent value in a sample, calculated by determining
the value with the highest frequency.
Bimodal - ANSWER: The data set contains 2 modes
Multimodal - ANSWER: If the distribution has more than two modes.
Measures of Dispersion - ANSWER: Measures whether the data values are tightly
clustered or spread out.
Range (largest value - smallest value) - ANSWER: The difference between the largest
and smallest value in a data set.
Standard Deviation (small standard dev. = close to the mean, large standard dev. =
farther away from the mean,) - ANSWER: Used to measure the average deviation
from the mean in a normally distributed data set, can never be a negative number,
can be 0.
Graphically - ANSWER: Way to explore distribution using frequency histograms or
tally plots draws a picture of the sample shape.
Shape statistics - ANSWER: Way to explore distribution such as skewness and
kurtosis.
Boxplot - ANSWER: A plot introduced by Tukey as a quick way to visualize the
distribution of data.
Skewness - ANSWER: A measure of how central the average is in relation to the
overall spread of values, a measure of how central the average is in the distribution.
Positively Skewed - ANSWER: Indicates that the average is skewed to the left with a
long tail of more positive values.
Negatively Skewed - ANSWER: Indicates that the average is skewed to the right with
a long tail of more positive values.
Descriptive Statistics - ANSWER: Describing and summarizing data sets using pictures
and statistical quantities.
Inferential Statistics - ANSWER: Analyzing data sets and drawing conclusions from
them.
Quiz 1 & QUIZ 2 - GT Students and Verified MM Learners latest
2023
Raw Data - ANSWER: Pertains to the collected data before it's processed or ranked.
Statistics - ANSWER: Defined as the science of collecting, analyzing, presenting, and
interpreting data, as well as of making decisions based on such analysis,.
Descriptive statistics - ANSWER: Consists of techniques for organizing, displaying, and
describing data through labels, graphs, and summary measures.
Inferential statistics - ANSWER: Consists of methods that use sample results to help
make decisions or predictions about a population.
Data set - ANSWER: A collection of observations on one or more variables.
Variable - ANSWER: A characteristic under study that assures different values for
different elements.
Quantitative Variables - ANSWER: Pertains to a variable that can be measured
numerically, divided into two types: discrete variables and continuous variables.
Discrete variable - ANSWER: A variable whose values are countable and can assume
only certain values.
Continuous variable - ANSWER: Data that can take on any interval values, also known
as float, interval, or numeric data.
Qualitative variable - ANSWER: Variables whose values cannot be measured.
Ordinal variables - ANSWER: Categorical data with an explicit ordering, also known as
an ordered factor.
Binary variables aka dichotomous, logical, indicator - ANSWER: A special case of
categorical data with just two categories, such as 0/1 or True/False.
Exploratory Data Analysis (EDA) - ANSWER: It refers to the critical process of
performing initial investigations on the data so as to discover patterns, to spot
anomalies, to test hypothesis and to check assumptions with the help of summary
statistics and graphical representations.
Measure of central tendencies - ANSWER: An estimated of where most of the data
are located
, Arithmetic Mean - ANSWER: Calculated by adding together the values in a sample
and dividing the sum by the number of items in the sample.
Median - ANSWER: The middle value when numbers are arranged in order, dividing
the ranked data set into two equal parts.
Mode - ANSWER: The most frequent value in a sample, calculated by determining
the value with the highest frequency.
Bimodal - ANSWER: The data set contains 2 modes
Multimodal - ANSWER: If the distribution has more than two modes.
Measures of Dispersion - ANSWER: Measures whether the data values are tightly
clustered or spread out.
Range (largest value - smallest value) - ANSWER: The difference between the largest
and smallest value in a data set.
Standard Deviation (small standard dev. = close to the mean, large standard dev. =
farther away from the mean,) - ANSWER: Used to measure the average deviation
from the mean in a normally distributed data set, can never be a negative number,
can be 0.
Graphically - ANSWER: Way to explore distribution using frequency histograms or
tally plots draws a picture of the sample shape.
Shape statistics - ANSWER: Way to explore distribution such as skewness and
kurtosis.
Boxplot - ANSWER: A plot introduced by Tukey as a quick way to visualize the
distribution of data.
Skewness - ANSWER: A measure of how central the average is in relation to the
overall spread of values, a measure of how central the average is in the distribution.
Positively Skewed - ANSWER: Indicates that the average is skewed to the left with a
long tail of more positive values.
Negatively Skewed - ANSWER: Indicates that the average is skewed to the right with
a long tail of more positive values.
Descriptive Statistics - ANSWER: Describing and summarizing data sets using pictures
and statistical quantities.
Inferential Statistics - ANSWER: Analyzing data sets and drawing conclusions from
them.