BUSN 5000 CERTIFICATION EVALUATION
2026 COMPLETE REVIEW GRADED A+
⩥ A data set is made up of _____ that contain information on a specific
entity.
Answer: Records
⩥ Each record is made of _____ that contain measurements of known
types.
Answer: Fields
⩥ A data table is made up of rows containing _____ and columns
containing _____.
Answer: Observations, variables
⩥ We say that data are tidy if each variable corresponds to a _____, each
row an _____, and each cell a _____.
Answer: column, observation, single value
⩥ A quick-serve restaurant chain records sales, staffing and customer
traffic every day for each store. You recognize this as a _____ data set
where the unit of observation is the store-day.
Answer: panel
,⩥ We distinguish 4 stages of data analysis and refer to them compactly
as _____ (in all caps).
Answer: ATAC
⩥ Name the stages of ATAC
Answer: acquisition, transformation, analysis, communication
⩥ The second stage involves, among other things, making sure the data
are _____ (as the Posit folks would say).
Answer: Tidy
⩥ In the third stage, the workhorse will be the _____.
Answer: CEF
⩥ A variable will not have _____ if it does not measure what it is
supposed to.
Answer: Validity
⩥ How to handle missing data depends on whether they are missing
_____.
Answer: Endogenously
,⩥ A national company has developed a new product and is offering it for
sale at a discount to introduce it to the market. Randomly surveying
customer who purchased the product in the initial discount period
(would/would not) ______ generate a sample representing the
population of typical customers.
Answer: Would not
⩥ It is advisable to _____ the acquisition, transformation and analysis
tasks.
Answer: Separate
⩥ One reason reproducibility matters is to protect and support your
_____ self.
Answer: Future
⩥ Another reason reproducibility matters to guard against _____ and
_____.
Answer: error, fraud
⩥ One important component is describing the exact _____ of your raw
input data.
Answer: source
, ⩥ You should view a reproducible analysis as a _____ that you should
be able to produce again and again.
Answer: product
⩥ A _____ is a representation of the data structure comprising all of the
attributes of the data and their types.
Answer: data schema
⩥ This representation of the data structure identifies the _____ to which
each observation pertains.
Answer: unit of record
⩥ This representation of the data also makes clear what are the _____
that identify an observation.
Answer: key variables
⩥ A terabyte is equal to a _____ bytes.
Answer: 1 trillion
⩥ R stores real numbers as a _____ data type and allocates _____ bytes
of data to each number.
Answer: numeric, 8
2026 COMPLETE REVIEW GRADED A+
⩥ A data set is made up of _____ that contain information on a specific
entity.
Answer: Records
⩥ Each record is made of _____ that contain measurements of known
types.
Answer: Fields
⩥ A data table is made up of rows containing _____ and columns
containing _____.
Answer: Observations, variables
⩥ We say that data are tidy if each variable corresponds to a _____, each
row an _____, and each cell a _____.
Answer: column, observation, single value
⩥ A quick-serve restaurant chain records sales, staffing and customer
traffic every day for each store. You recognize this as a _____ data set
where the unit of observation is the store-day.
Answer: panel
,⩥ We distinguish 4 stages of data analysis and refer to them compactly
as _____ (in all caps).
Answer: ATAC
⩥ Name the stages of ATAC
Answer: acquisition, transformation, analysis, communication
⩥ The second stage involves, among other things, making sure the data
are _____ (as the Posit folks would say).
Answer: Tidy
⩥ In the third stage, the workhorse will be the _____.
Answer: CEF
⩥ A variable will not have _____ if it does not measure what it is
supposed to.
Answer: Validity
⩥ How to handle missing data depends on whether they are missing
_____.
Answer: Endogenously
,⩥ A national company has developed a new product and is offering it for
sale at a discount to introduce it to the market. Randomly surveying
customer who purchased the product in the initial discount period
(would/would not) ______ generate a sample representing the
population of typical customers.
Answer: Would not
⩥ It is advisable to _____ the acquisition, transformation and analysis
tasks.
Answer: Separate
⩥ One reason reproducibility matters is to protect and support your
_____ self.
Answer: Future
⩥ Another reason reproducibility matters to guard against _____ and
_____.
Answer: error, fraud
⩥ One important component is describing the exact _____ of your raw
input data.
Answer: source
, ⩥ You should view a reproducible analysis as a _____ that you should
be able to produce again and again.
Answer: product
⩥ A _____ is a representation of the data structure comprising all of the
attributes of the data and their types.
Answer: data schema
⩥ This representation of the data structure identifies the _____ to which
each observation pertains.
Answer: unit of record
⩥ This representation of the data also makes clear what are the _____
that identify an observation.
Answer: key variables
⩥ A terabyte is equal to a _____ bytes.
Answer: 1 trillion
⩥ R stores real numbers as a _____ data type and allocates _____ bytes
of data to each number.
Answer: numeric, 8