Associate ACTUAL QUESTIONS AND
CORRECT ANSWERS
Which of the layers of the medallion architecture is most commonly used by data analysts? -
CORRECT ANSWERS✅✅Gold
Data analysts are primarily focused on deriving insights and making data-driven decisions.
The Gold layer, with its highly processed and refined data, is most suitable for their needs. It
provides them with reliable, business-ready data that can be used for in-depth analysis,
reporting, and visualization.
A data analyst has recently joined a new team that uses Databricks SQL, but the analyst has
never used Databricks before. The analyst wants to know where in Databricks SQL they can
write and execute SQL queries. On which of the following pages can the analyst write and
execute SQL queries? - CORRECT ANSWERS✅✅SQL Editor page
How should Databricks SQL should be used in relation to other business intelligence (BI)
tools like Tableau, Power BI, and looker? - CORRECT ANSWERS✅✅As a complementary
tool for quick in-platform BI work
Which of the following approaches can be used to connect Databricks to Fivetran for data
ingestion? - CORRECT ANSWERS✅✅Use Partner Connect's automated workflow to
establish a SQL warehouse (formerly known as a SQL endpoint) for Fivetran to interact with
Data professionals with varying titles use the Databricks SQL service as the primary
touchpoint with the Databricks Lakehouse Platform. However, some users will use other
services like Databricks Machine Learning or Databricks Data Science and Engineering.
Which of the following roles uses Databricks SQL as a secondary service while primarily
using one of the other services? - CORRECT ANSWERS✅✅Data engineer
A data analyst has set up a SQL query to run every four hours on a SQL endpoint, but the
SQL endpoint is taking too long to start up with each run.Which of the following changes can
the data analyst make to reduce the start-up time for the endpoint while managing costs? -
CORRECT ANSWERS✅✅Use a Serverless SQL endpoint
, Serverless SQL endpoints automatically scale resources based on the workload, ensuring
efficient utilization without the need for manual intervention.
A data engineering team has created a Structured Streaming pipeline that processes data in
micro-batches and populates gold-level tables. The microbatches are triggered every
minute.A data analyst has created a dashboard based on this gold-level data. The project
stakeholders want to see the results in the dashboard updated within one minute or less of
new data becoming available within the gold-level tables. Which of the following cautions
should the data analyst share prior to setting up the dashboard to complete this task? -
CORRECT ANSWERS✅✅The required compute resources could be costly
What approach can be used to ingest data directly from cloud-based object storage? -
CORRECT ANSWERS✅✅Create an external table while specifying the object storage path
to LOCATION
After running DESCRIBE EXTENDED accounts.customers;, the following was returned:
Name accounts.customer
Location dbfs:/stakeholders.customers
Provider Delta
Owner root
Type EXTERNAL
Now, a data analyst runs the following command:DROP accounts.customers;
Which of the following describes the result of running this command? - CORRECT
ANSWERS✅✅The accounts.customers table is removed from the metastore, but the
underlying data files are untouched.
Which of the following should data analysts consider when working with personally
identifiable information (PII) data?
A. Organization-specific best practices for PII data
B. Legal requirements for the area in which the data was collected
C. None of these considerations
D. Legal requirements for the area in which the analysis is being performed
E. All of these considerations - CORRECT ANSWERS✅✅E) All of these considerations