Analytics Boot Camp Certification Exam
Guide
Question 1. **Which Excel function returns the value at the intersection of a given row and
column in a range?**
A) VLOOKUP
B) HLOOKUP
C) INDEX
D) MATCH
Answer: C
Explanation: INDEX retrieves the value at a specified row and column index within a range,
unlike VLOOKUP/HLOOKUP which search for a value.
Question 2. **In a Pivot Table, which field area should you place a categorical variable to see
the breakdown of each category?**
A) Values
B) Columns
C) Rows
D) Filters
Answer: C
Explanation: Placing a categorical field in the Rows area creates a row for each distinct category,
allowing you to view its breakdown.
Question 3. **What does the VBA statement `For i = 1 To 10 Step 2` do?**
A) Loops from 1 to 10 incrementing by 1
B) Loops from 1 to 10 incrementing by 2
C) Loops from 10 to 1 decrementing by 2
D) Loops indefinitely
, [GTDAB] Georgia Tech Data Science and
Analytics Boot Camp Certification Exam
Guide
Answer: B
Explanation: The `Step 2` clause makes the loop increment the counter by 2 each iteration,
producing values 1,3,5,7,9.
Question 4. **Which measure of central tendency is most appropriate for a highly skewed
distribution?**
A) Mean
B) Median
C) Mode
D) Standard deviation
Answer: B
Explanation: The median is resistant to extreme values and thus better represents the center of
a skewed distribution than the mean.
Question 5. **When performing a two‑sample t‑test assuming equal variances, which statistic is
compared to the critical value?**
A) Z‑score
B) t‑statistic
C) F‑statistic
D) chi‑square
Answer: B
Explanation: The two‑sample t‑test uses the t‑statistic, calculated from the difference of means
and pooled variance, to test significance.
Question 6. **In linear regression, the coefficient of determination (R²) represents:**
, [GTDAB] Georgia Tech Data Science and
Analytics Boot Camp Certification Exam
Guide
A) The slope of the regression line
B) The proportion of variance in the dependent variable explained by the model
C) The correlation between independent variables
D) The p‑value of the model
Answer: B
Explanation: R² quantifies how much of the total variability in the outcome is accounted for by
the predictors.
Question 7. **Which Python keyword is used to define a function?**
A) def
B) function
C) lambda
D) func
Answer: A
Explanation: The `def` keyword introduces a new function definition in Python.
Question 8. **What is the output type of `np.array([1, 2, 3]) * np.array([4, 5, 6])`?**
A) List
B) Tuple
C) NumPy array with element‑wise multiplication result
D) Scalar product
Answer: C
Explanation: NumPy arrays support element‑wise operations; the result is an array `[4,10,18]`.
, [GTDAB] Georgia Tech Data Science and
Analytics Boot Camp Certification Exam
Guide
Question 9. **Which Pandas method removes rows with any missing values?**
A) dropna()
B) fillna()
C) isnull()
D) replace()
Answer: A
Explanation: `dropna()` drops rows (or columns) that contain NaN values.
Question 10. **In Pandas, how would you select rows where column `age` is greater than 30?**
A) df[‘age’] > 30
B) df.loc[df[‘age’] > 30]
C) df.where(df[‘age’] > 30)
D) df.filter(age > 30)
Answer: B
Explanation: `df.loc[condition]` returns rows satisfying the Boolean condition.
Question 11. **Which Matplotlib function creates a scatter plot?**
A) plot()
B) bar()
C) scatter()
D) hist()
Answer: C
Explanation: `scatter()` draws a scatter plot of x‑y points.