Data bricks Data Engineering Associate – Certification
Exam Newest 2025/2026 With Complete Questions And
Correct Answers |Already Graded A+||Brand New Version
A data engineer needs to determine whether to use the
built-in Databricks Notebooks versioning or version
their project using Databricks Repos.Which of the
following is an advantage of using Databricks Repos
B. Databricks Repos
over the Databricks Notebooks versioning?
supports the use of multiple
A. Databricks Repos automatically saves development progress
branches
B. Databricks Repos supports the use of multiple branches
C. Databricks Repos allows users to revert to previous versions of
a notebook
D. Databricks Repos provides the ability to comment on specific
changes
E. Databricks Repos is wholly housed within the Databricks
Lakehouse Platform
A data engineer has left the organization. The data
team needs to transfer ownership of the data engineer's
Delta tables to a new data engineer. The new data
engineer is the lead engineer on the data
team.Assuming the original data engineer no longer
C. Workspace administrator
has access, which of the following individuals must be
the one to transfer ownership of the Delta tables in
Data Explorer?
A. Databricks account representative
B. This transfer is not possible
C. Workspace administrator
D. New lead data engineer
E. Original data engineer
A data analyst has created a Delta table sales that is
used by the entire data analysis team. They want help
from the data engineering team to implement a series of
tests to ensure the data is clean. However, the data
E. spark.table("sales") engineering team uses Python for its tests rather than
SQL.Which of the following commands could the data
/ 1/39
,8/28/25, 9:56 PM Databricks Certified Data Engineer Associate
engineering team use to access sales in PySpark?
A. SELECT * FROM sales
B. There is no way to share data between PySpark and SQL.
C. spark.sql("sales")D. spark.delta.table("sales")
E. spark.table("sales")
Which of the following commands will return the location of
database customer360?
A. DESCRIBE LOCATION customer360;
C. DESCRIBE DATABASE B. DROP DATABASE customer360;
customer360;
C. DESCRIBE DATABASE customer360;
D. ALTER DATABASE customer360 SET DBPROPERTIES ('location' =
'/user'};
E. USE DATABASE customer360;
D. COMMENT "Contains PII"
Which of the following benefits is provided by the array functions
from Spark SQL?
D. An ability to work with A. An ability to work with data in a variety of types at once
B. An ability to work with data within certain partitions and
complex, nested data
windows
ingested from JSON files
C. An ability to work with time-related data in specified intervals
D. An ability to work with complex, nested data ingested from
JSON files
E. An ability to work with an array of tables for procedural
automation
Which of the following commands can be used to write
data into a Delta table while avoiding the writing of
duplicate records?
C. MERGE A. DROP
B. IGNORE
C. MERGE
D. APPEND
E. INSERT
/ 2/39
,8/28/25, 9:56 PM Databricks Certified Data Engineer Associate
A. CREATE FUNCTION
combine_nyc(city STRING)
RETURNS
STRING
RETURN
CASE
WHEN city = "brookly" THEN
"new york" ELSE city
END;
A data analyst has a series of queries in a SQL program.
The data analyst wants this program to run every day.
They only want the final query in the program to run on
Sundays. They ask for help from the data engineering
team to complete this task.Which of the following
B. They could wrap the
approaches could be used by the data engineering team
queries using PySpark and
to complete this task?
use Python's control flow
A. They could submit a feature request with Databricks to add this
system to determine when functionality.
B. They could wrap the queries using PySpark and use
to run the final query.
Python's control flow system to determine when to
run the final query.
C. They could only run the entire program on Sundays.
D. They could automatically restrict access to the
source table in the final query so that it is only
accessible on Sundays.
E. They could redesign the data model to separate the
data used in the final query into a new table.
C. The previous day's file
has already been copied
into the table.
A. org.apache.spark.sql.jdbc
B. CREATE TABLE
all_transactions AS SELECT
* FROM march_transactions
/ 3/39
, 8/28/25, 9:56 PM Databricks Certified Data Engineer Associate
UNION SELECT * FROM
april_transactions;
A data engineer only wants to execute the final block
of a Python program if the Python variable
day_of_week is equal to 1 and the Python variable
review_period is True.Which of the following control
D. if day_of_week == 1 and flow statements should the data engineer use to begin
review_period:
this conditionally executed code block?
A. if day_of_week = 1 and review_period:
B. if day_of_week = 1 and review_period = "True":
C. if day_of_week == 1 and review_period == "True":
D. if day_of_week == 1 and review_period:
E. if day_of_week = 1 & review_period: = "True":
/ 4/39