AND ANSWERS GUARANTEE A+
✔✔Question 10
Which of the following SQL keywords can be used to append new rows to an existing
Delta table?
A. UPDATE
B. COPY
C. INSERT INTO
D. DELETE
E. UNION - ✔✔C. INSERT INTO
✔✔Question 11
A data engineering team needs to query a Delta table to extract rows that all meet the
same condition. However, the team has noticed that the query is running slowly. The
team has already tuned the size of the data files. Upon investigating, the team has
concluded that the rows meeting the condition are sparsely located throughout each of
the data files.
Based on the scenario, which of the following optimization techniques could speed up
the query?
A. Data skipping
B. Z-Ordering
C. Bin-packing
D. Write as a Parquet file
E. Tuning the file size - ✔✔B. Z-Ordering
✔✔Question 12
A data engineer needs to create a database called customer360 at the location
/customer/customer360. The data engineer is unsure if one of their colleagues has
already created the database.
Which of the following commands should the data engineer run to complete this task?
A. CREATE DATABASE customer360 LOCATION '/customer/customer360';
B. CREATE DATABASE IF NOT EXISTS customer360;
C. CREATE DATABASE IF NOT EXISTS customer360 LOCATION
'/customer/customer360';
D. CREATE DATABASE IF NOT EXISTS customer360 DELTA LOCATION
'/customer/customer360';
E. CREATE DATABASE customer360 DELTA LOCATION '/customer/customer360'; -
✔✔C. CREATE DATABASE IF NOT EXISTS customer360 LOCATION
'/customer/customer360';
✔✔Question 13
A junior data engineer needs to create a Spark SQL table my_table for which Spark
manages both the data and the metadata. The metadata and data should also be stored
in the Databricks Filesystem (DBFS).
, Which of the following commands should a senior data engineer share with the junior
data engineer to complete this task?
A. CREATE TABLE my_table (id STRING, value STRING) USING
org.apache.spark.sql.parquet OPTIONS (PATH "storage-path");
B. CREATE MANAGED TABLE my_table (id STRING, value STRING) USING
org.apache.spark.sql.parquet OPTIONS (PATH "storage-path");
C. CREATE MANAGED TABLE my_table (id STRING, value STRING);
D. CREATE TABLE my_table (id STRING, value STRING) USING DBFS;
E. CREATE TABLE my_table (id STRING, value STRING); - ✔✔E. CREATE TABLE
my_table (id STRING, value STRING);
✔✔Question 14
A data engineer wants to create a relational object by pulling data from two tables. The
relational object must be used by other data engineers in other sessions. In order to
save on storage costs, the data engineer wants to avoid copying and storing physical
data.
Which of the following relational objects should the data engineer create?
A. View
B. Temporary view
C. Delta Table
D. Database
E. Spark SQL Table - ✔✔A. View
✔✔Question 15
A data engineering team has created a series of tables using Parquet data stored in an
external system. The team is noticing that after appending new rows to the data in the
external system, their queries within Databricks are not returning the new rows. They
identify the caching of the previous data as the cause of this issue.
Which of the following approaches will ensure that the data returned by queries is
always up-to-date?
A. The tables should be converted to the Delta format
B. The tables should be stored in a cloud-based external system
C. The tables should be refreshed in the writing cluster before the next query is run
D. The tables should be altered to include metadata to not cache
E. The tables should be updated before the next query is run - ✔✔A. The tables should
be converted to the Delta format
✔✔Question 16
A table customerLocations exists with the following schema:
id STRING, date STRING, city STRING, country STRING
A senior data engineer wants to create a new table from this table using the following
command:
CREATE TABLE customersPerCountry AS SELECT country,
COUNT(*) AS customers FROM customerLocations