ASSESSMENT | OA V1 AND V2 | 2026 UPDATE |
ACTUAL QUESTIONS AND ANSWERS | WITH
COMPLETE SOLUTION
WGU D467 – EXPLORING DATA OBJECTIVE
ASSESSMENT (OA) V1
Question 1
Which of the following best describes the primary goal of exploratory data
analysis (EDA)?
A. To confirm a hypothesis using inferential statistics
B. To summarize and understand key characteristics of a dataset
C. To eliminate outliers before modeling
D. To automate decision-making processes
Correct Answer: B
Rationale:
Exploratory Data Analysis focuses on summarizing and understanding the
main features of a dataset before formal modeling or hypothesis testing. It
helps analysts identify patterns, trends, and anomalies through summary
statistics and visualizations. EDA does not attempt to confirm hypotheses
but rather to generate insights and questions. This foundational
understanding supports better decision-making later in the analysis
process.
,Question 2
Which type of data is best described as categories with no inherent order?
A. Ordinal
B. Interval
C. Nominal
D. Ratio
Correct Answer: C
Rationale:
Nominal data consists of categories that have no natural ranking or order,
such as colors or types of products. Unlike ordinal data, nominal data
cannot be meaningfully sorted. Mathematical operations like addition or
averaging are not applicable. This distinction is important when selecting
appropriate statistical methods.
Question 3
What measure of central tendency is most affected by extreme outliers?
A. Median
B. Mode
C. Mean
D. Range
Correct Answer: C
Rationale:
The mean is calculated by summing all values and dividing by the total
number of observations, making it highly sensitive to extreme values.
Outliers can significantly distort the mean compared to other measures.
The median and mode are more resistant to outliers. Understanding this
helps analysts choose appropriate summary statistics.
,Question 4
Which visualization is most appropriate for showing the distribution of a
single numerical variable?
A. Scatter plot
B. Histogram
C. Line chart
D. Pie chart
Correct Answer: B
Rationale:
A histogram displays how numerical data is distributed across intervals or
bins. It allows analysts to see the shape, spread, and center of the data.
Scatter plots are used for relationships between two variables, while pie
charts are best for proportions. Choosing the right visualization improves
interpretability.
Question 5
What does the range of a dataset represent?
A. The average value
B. The most frequent value
C. The difference between the highest and lowest values
D. The middle value when sorted
Correct Answer: C
Rationale:
The range measures variability by subtracting the smallest value from the
largest value in a dataset. It provides a simple indication of data spread.
However, it is sensitive to extreme values and does not describe
distribution shape. Analysts often pair it with other measures of variability.
, Question 6
Which measure best represents variability around the mean?
A. Mean
B. Median
C. Standard deviation
D. Mode
Correct Answer: C
Rationale:
Standard deviation quantifies how much data points typically differ from
the mean. A larger standard deviation indicates greater variability in the
data. It is widely used in data analysis to understand consistency and risk.
This measure is especially important when comparing datasets.
Question 7
A boxplot is particularly useful for identifying which of the following?
A. Correlation strength
B. Time trends
C. Outliers and data spread
D. Exact data values
Correct Answer: C
Rationale:
Boxplots visually display the median, quartiles, and potential outliers in a
dataset. They help analysts quickly identify skewness and variability.
While they do not show individual values, they provide a strong summary
of distribution. This makes them ideal for comparing multiple groups.
Question 8