Busn 5000 Cornwell UGA Midterm
1. The term data is (singlular/plural) _____ Plural
2. A data set is made up of _____ that contain information Records
on a specific entity.
3. Each record is made of _____ that contain measure- Fields
ments of known types.
4. A data table is made up of rows containing _____ and Observations,
columns containing _____. Variables
5. We say that data are tidy if each variable corresponds Column, Observation, Sin-
to a _____, each row an _____, and each cell a _____. gle value
6. A quick-serve restaurant chain records sales, staffing Panel
and customer traffic every day for each store. You
recognize this as a _____ data set where the unit of
observation is the store-day.
7. We distinguish 4 stages of data analysis and refer to ATAC
them compactly as _____ (in all caps).
8. Name the stages. Acquisition, Transforma-
tion, Analysis, Communi-
cation
9. The second stage involves, among other things, mak- Tidy
ing sure the data are _____ (as the Posit folks would
say).
10. In the third stage, the workhorse will be the _____. CEF
11. A variable will not have _____ if it does not measure Validity
what it is supposed to.
1/9
, Busn 5000 Cornwell UGA Midterm
12. How to handle missing data depends on whether they Endogenously
are missing _____.
13. A national company has developed a new product and Would not
is offering it for sale at a discount to introduce it to the
market. Randomly surveying customer who purchased
the product in the initial discount period (would/would
not) ______ generate a sample representing the popu-
lation of typical customers.
14. It is advisable to _____ the acquisition, transformation Separate
and analysis tasks.
15. One reason reproducibility matters is to protect and Future
support your _____ self.
16. Another reason reproducibility matters to guard Error, Fraud
against _____ and _____.
17. One important component is describing the exact _____ Source
of your raw input data.
18. You should view a reproducible analysis as a _____ that Product
you should be able to produce again and again.
19. A _____ is a representation of the data structure com- Data Schema
prising all of the attributes of the data and their types.
20. This representation of the data structure identifies the Unit of record
_____ to which each observation pertains.
21. This representation of the data also makes clear what Key Variables
are the _____ that identify an observation.
2/9
1. The term data is (singlular/plural) _____ Plural
2. A data set is made up of _____ that contain information Records
on a specific entity.
3. Each record is made of _____ that contain measure- Fields
ments of known types.
4. A data table is made up of rows containing _____ and Observations,
columns containing _____. Variables
5. We say that data are tidy if each variable corresponds Column, Observation, Sin-
to a _____, each row an _____, and each cell a _____. gle value
6. A quick-serve restaurant chain records sales, staffing Panel
and customer traffic every day for each store. You
recognize this as a _____ data set where the unit of
observation is the store-day.
7. We distinguish 4 stages of data analysis and refer to ATAC
them compactly as _____ (in all caps).
8. Name the stages. Acquisition, Transforma-
tion, Analysis, Communi-
cation
9. The second stage involves, among other things, mak- Tidy
ing sure the data are _____ (as the Posit folks would
say).
10. In the third stage, the workhorse will be the _____. CEF
11. A variable will not have _____ if it does not measure Validity
what it is supposed to.
1/9
, Busn 5000 Cornwell UGA Midterm
12. How to handle missing data depends on whether they Endogenously
are missing _____.
13. A national company has developed a new product and Would not
is offering it for sale at a discount to introduce it to the
market. Randomly surveying customer who purchased
the product in the initial discount period (would/would
not) ______ generate a sample representing the popu-
lation of typical customers.
14. It is advisable to _____ the acquisition, transformation Separate
and analysis tasks.
15. One reason reproducibility matters is to protect and Future
support your _____ self.
16. Another reason reproducibility matters to guard Error, Fraud
against _____ and _____.
17. One important component is describing the exact _____ Source
of your raw input data.
18. You should view a reproducible analysis as a _____ that Product
you should be able to produce again and again.
19. A _____ is a representation of the data structure com- Data Schema
prising all of the attributes of the data and their types.
20. This representation of the data structure identifies the Unit of record
_____ to which each observation pertains.
21. This representation of the data also makes clear what Key Variables
are the _____ that identify an observation.
2/9