California Research Data Specialist EXAM ACTUAL
Exam ALL 200 QUESTIONS AND CORRECT
ANSWERS LATEST UPDATE THIS YEAR
California Research Data Specialist Exam Coverage
• Data Administration & Governance: Data cleaning, ETL processes, data dictionaries, and
California Information Practices Act (IPA) compliance.
• Quantitative & Qualitative Analysis: Descriptive/inferential statistics, regression
models, and longitudinal studies.
• Research Design: Sampling techniques, hypothesis testing, validity/reliability, and
experimental vs. observational designs.
• Data Visualization & Reporting: ADA Section 508 compliance for state reports,
dashboarding, and interpreting complex data for policy makers.
• Software Proficiency: General principles of SQL, SAS, R, Python, and Excel for state-level
data manipulation.
California Research Data Specialist Exam: Batch 1 (Questions 1-100)
1
, Page 2 of 109
1. Which of the following best describes the primary responsibility of a Research Data
Specialist in a state agency setting?
• Answer: B. Managing and analyzing research data to inform public policy.
• Rationale: While labs and grants exist, the "Specialist" role in California state service is
primarily focused on the lifecycle of data—from collection to analysis—to support
evidence-based decision-making.
2. What is the primary purpose of a data dictionary when managing large, multi-year
longitudinal datasets for California state health programs?
• Answer: A. To provide standardized definitions, formats, and coding structures for all
variables.
• Rationale: A data dictionary ensures that different researchers across various
departments interpret the data identically, maintaining longitudinal integrity and
preventing "data silos."
3. When preparing a public-facing report for a California state department, which federal and
state standard must be met to ensure the document is accessible to individuals with
disabilities?
• A. The California Public Records Act (CPRA) formatting guidelines.
• B. Section 508 of the Rehabilitation Act and California Government Code 11135.
2
, Page 3 of 109
• C. The APA style manual for academic publication and peer review.
• D. The internal branding guidelines of the specific California agency.
• Answer: B. Section 508 and CA Government Code 11135.
• Rationale: State agencies are legally mandated to provide accessible electronic
information. This includes "Alt-text" for images and proper tagging for screen readers in
PDF or HTML reports.
4. A Research Data Specialist is tasked with merging two large datasets from different
departments that do not share a common unique identifier. Which technique is most
appropriate for this task?
• A. Simple random sampling to see if any rows match by coincidence.
• B. Deterministic matching based on a single variable like "Last Name."
• C. Probabilistic data linkage using a combination of variables like birthdate, gender, and
zip code.
• D. Manual data entry of one dataset into the other to ensure 100% accuracy.
• Answer: C. Probabilistic data linkage.
• Rationale: When unique keys (like SSN) are missing, probabilistic matching calculates
the likelihood that two records belong to the same entity based on a weighted score of
multiple identifiers.
3
, Page 4 of 109
5. Under the California Information Practices Act, what is the primary requirement for a
Research Data Specialist when handling "Personal Information" (PI)?
• A. All personal information must be posted on the agency's website for transparency.
• B. Access to PI must be limited to employees who require the data to perform their
official duties.
• C. PI should be encrypted only if the data is being sent to a federal agency.
• D. PI must be deleted immediately after the statistical analysis is completed.
• Answer: B. Access to PI must be limited to "need-to-know" staff.
• Rationale: The IPA protects the privacy of individuals by restricting data access and
requiring agencies to maintain strict security logs and justification for data usage.
6. If a Research Data Specialist observes a "p-value" of 0.02 in a study regarding the efficacy
of a new state employment program, how should they interpret this result relative to a
standard alpha level of 0.05?
• A. The results are not statistically significant and the null hypothesis should be accepted.
• B. There is a 2% chance that the null hypothesis is true, so the program is a failure.
• C. The results are statistically significant, providing enough evidence to reject the null
hypothesis.
• D. The sample size was too small to make any valid statistical conclusions.
4