QUESTIONS WITH ACCURATE SOLUTIONS
1. Fill in the blank: A preference in favor of or against a person, group of
people, or thing is called _____. It is an error in data analytics that can
systematically skew results in a certain direction.
data
anonymization
data collection
data bias
data interoperability
2. Why is it important to identify trends with the available data when faced
with insufficient data?
It guarantees accurate results without any
bias. It eliminates the need for further data
collection.
It allows for insights to be drawn despite limitations, guiding
decision-making.
It ensures that all data is used, regardless of quality.
3. What classification of data does the height of a skyscraper fall under?
Ordinal
Nominal
Categorical
Continuous
4. If you have a large dataset in a spreadsheet and want to analyze it
without losing sight of the column headers, what action should you take?
,Change the font size of the header row
, Sort the data alphabetically
Delete the header row
Freeze the header row
5. Describe how an interrupted download can affect data integrity in the
context of data analysis.
An interrupted download has no effect on data integrity as
the data can be re-downloaded.
An interrupted download can lead to an incomplete dataset,
compromising the accuracy and reliability of the analysis.
An interrupted download only affects the speed of data
processing.
An interrupted download enhances data integrity by allowing for
selective data retrieval.
6. Describe the main objectives of normalization in database management.
Normalization is used to merge different datasets into one.
Normalization focuses on increasing data redundancy and
complexity.
Normalization is a technique for cleansing data of errors.
Normalization aims to eliminate data redundancy, enhance data
integrity, and simplify the database structure.
7. An analyst used a column of a table to uniquely identify each
record within a table. Which tool did they use?
Normalization
Field
Primary key
Foreign key
, 8. If a data analyst finds discrepancies in a data set after the cleaning
process, what step should they take to ensure data integrity?
Re-clean the data without further analysis.
Immediately discard the data set as unreliable.
Increase the sample size to gather more data.
Conduct verification to confirm the accuracy and reliability
of the data set.
9. Describe how the COUNTIF function can be utilized in data analysis
for filtering data.
The COUNTIF function helps in merging datasets from different
sources.
The COUNTIF function is used to sum all values in a dataset
regardless of conditions.
The COUNTIF function is primarily used for data cleansing
operations.
The COUNTIF function allows analysts to filter data by counting
occurrences that meet specific criteria, such as values below a
certain threshold.
10. Which one of the following statements about margin of error is correct?
A 70% "yes" response with a margin of error of 5% means that
between 65% and 75% of the actual population thinks that the
answer is yes.
A 70% "yes" response with a margin of error of 5% means that 5%
of the actual population thinks that the answer is yes.
11. A junior data analyst at a dental care provider uses a tool to explore the
data in its patient database. They learn about the data types it contains,