AND ANSWERS GUARANTEE A+
✔✔What are key features and advantages of using Photon. - ✔✔- Support for SQL and
equivalent DataFrame operations with Delta and Parquet tables.
- Accelerated queries that process data faster and include aggregations and joins.
- Faster performance when data is accessed repeatedly from the disk cache.
- Robust scan performance on tables with many columns and many small files.
- Faster Delta and Parquet writing using UPDATE, DELETE, MERGE INTO, INSERT,
and CREATE TABLE AS SELECT, including wide tables that contain thousands of
columns.
- Replaces sort-merge joins with hash-joins.
✔✔What is the benefit to a business if they use Photon? - ✔✔While it is more
expensive, it offers a more performant experience.
Overall, the TCO is worth it for the business as cluster maintenance, optimization
exercises took time and required expensive and specialized talent, while this just works
✔✔What is a consequence of using Unity Catalog to manage, organize and segregate
data objects? - ✔✔Complete data object referencing requires three levels
✔✔In which of the following ways do serverless compute resources differ from classic
compute resources within the Databricks Lakehouse Platform? - ✔✔- They exist within
the Databricks cloud account
- They are always running and reserved for a single, specific customer when needed
✔✔Where do non-serverless compute resources exist? - ✔✔Inside the customers
AWS/Azure/GCP environment
✔✔Which of the Databricks Lakehouse Platform services or capabilities provides a data
warehousing experience to its users? - ✔✔Databricks SQL
✔✔Explain Databricks to a five year old - ✔✔Makes little bits of big computers use data
in lots of ways and in lots of languages.
✔✔Explain Databricks to a 15 year old - ✔✔It's a way of executing 5 or so languages on
spark distributed computing, the code can be anything from ETL to Datascience and
Machine Learning, depends what you write.
, It also acts as a platform for management of all of the above, sharing, collaboration,
cluster (virtual computer) management.
(Users share that it is pretty intuitive, they can just write SQL and interact with their
datalake, can be expensive if not managed right)
✔✔If a data architect is evaluating data warehousing solutions for their organization to
use, what are some benefits of using the Databricks Lakehouse Platform for
warehousing that you'd reference? - ✔✔- Engineering capabilities supporting
warehouse source data
- Best available price/performance
- A rich ecosystem of business intelligence (BI) integrations
- Local development software to integrate with other capabilities
✔✔True or False, Databricks Workflows supports workloads across multiple cloud
service providers and tools? - ✔✔True; Databricks Workflows supports workloads
across multiple cloud service providers and tools
✔✔What is Databricks Workflows - ✔✔a managed orchestration service, fully integrated
with the Databricks Data Intelligence Platform.
✔✔What does Databricks Workflows let you do? - ✔✔It lets you easily define, manage
and monitor multitask workflows for ETL, analytics and machine learning pipelines.
With a wide range of supported task types, deep observability capabilities and high
reliability, your data teams are empowered to better automate and orchestrate any
pipeline and become more productive.
✔✔Why did Databricks develop Databricks Workflows? - ✔✔Many organizations use a
variety of open-source and proprietary tools for data orchestration, but these tools often
have their own limitations.
✔✔What are the benefits of Databricks Workflows (big picture) - ✔✔- Unified with the
Databricks Data Intelligence Platform
- Reliability at scale
- Deep monitoring and observability
- Batch and streaming