AWS Certified Data Engineer Associate DEA-
C01 Exam Questions With Correct Answers
What feature in AWS Glue must be used if you have a lot of partitions
| | | | | | | | | | | | | | |
for a table and need to implement server-side partition pruning?
| | | | | | | | |
Catalog Partition Predicates
| |
Helps you create point-to-point integrations between event producers
| | | | | | | |
and consumers with optional transform, filter and enrich steps.
| | | | | | | |
Amazon EventBridge Pipes | |
It refers to a temporary cluster that is created on-demand to perform
| | | | | | | | | | | |
specific tasks or jobs and is terminated once the tasks are completed.
| | | | | | | | | | |
Transient EMR cluster | |
Helps you monitor and track database activities, including queries
| | | | | | | | |
executed, connections established, and schema changes, providing
| | | | | | |
visibility and enhancing security and compliance.
| | | | |
,Amazon Redshift audit logging | | |
It gathers container-related metrics at the cluster, pod, and container
| | | | | | | | | |
levels. Also it collect performance and application logs.
| | | | | | |
Amazon CloudWatch Container Insights| | |
It's a feature of Amazon OpenSearch Service that stores large amounts
| | | | | | | | | | |
of read-only data cost-effectively through standard data nodes that use
| | | | | | | | |
|"hot" storage for faster performance.
| | | |
UltraWarm data nodes | |
An Amazon Redshift feature that helps you query and analyze data
| | | | | | | | | | |
directly from files stored in Amazon S3, extending Redshift's querying
| | | | | | | | | |
capabilities to exabytes of data without loading it into the data
| | | | | | | | | | |
warehouse.
Amazon Redshift Spectrum | |
An AWS Service tool for visual data preparation that allows users to
| | | | | | | | | | | |
clean and normalize data without writing code.
| | | | | |
, AWS Glue DataBrew
| |
Users can easily modify their data structure with schema evolution,
| | | | | | | | | |
allowing them to add, rename, or remove columns from a data table
| | | | | | | | | | | |
without affecting the underlying data.
| | | |
Apache Iceberg table format | | |
A performance optimization technique used in databases and data
| | | | | | | | |
processing frameworks where filtering is applied as close to the data
| | | | | | | | | | |
source as possible. | |
Pushdown Predicates |
An integrated development environment (IDE) for creating, running, and
| | | | | | | |
monitoring ETL (Extract, Transform, Load) jobs in AWS Glue.
| | | | | | | | |
AWS Glue Studio
| |
Helps organizations manage the complexity of data schemas in modern
| | | | | | | | | |
data architectures, ensuring data consistency, quality, and reliability
| | | | | | | |
across various data sources and applications.
| | | | |
C01 Exam Questions With Correct Answers
What feature in AWS Glue must be used if you have a lot of partitions
| | | | | | | | | | | | | | |
for a table and need to implement server-side partition pruning?
| | | | | | | | |
Catalog Partition Predicates
| |
Helps you create point-to-point integrations between event producers
| | | | | | | |
and consumers with optional transform, filter and enrich steps.
| | | | | | | |
Amazon EventBridge Pipes | |
It refers to a temporary cluster that is created on-demand to perform
| | | | | | | | | | | |
specific tasks or jobs and is terminated once the tasks are completed.
| | | | | | | | | | |
Transient EMR cluster | |
Helps you monitor and track database activities, including queries
| | | | | | | | |
executed, connections established, and schema changes, providing
| | | | | | |
visibility and enhancing security and compliance.
| | | | |
,Amazon Redshift audit logging | | |
It gathers container-related metrics at the cluster, pod, and container
| | | | | | | | | |
levels. Also it collect performance and application logs.
| | | | | | |
Amazon CloudWatch Container Insights| | |
It's a feature of Amazon OpenSearch Service that stores large amounts
| | | | | | | | | | |
of read-only data cost-effectively through standard data nodes that use
| | | | | | | | |
|"hot" storage for faster performance.
| | | |
UltraWarm data nodes | |
An Amazon Redshift feature that helps you query and analyze data
| | | | | | | | | | |
directly from files stored in Amazon S3, extending Redshift's querying
| | | | | | | | | |
capabilities to exabytes of data without loading it into the data
| | | | | | | | | | |
warehouse.
Amazon Redshift Spectrum | |
An AWS Service tool for visual data preparation that allows users to
| | | | | | | | | | | |
clean and normalize data without writing code.
| | | | | |
, AWS Glue DataBrew
| |
Users can easily modify their data structure with schema evolution,
| | | | | | | | | |
allowing them to add, rename, or remove columns from a data table
| | | | | | | | | | | |
without affecting the underlying data.
| | | |
Apache Iceberg table format | | |
A performance optimization technique used in databases and data
| | | | | | | | |
processing frameworks where filtering is applied as close to the data
| | | | | | | | | | |
source as possible. | |
Pushdown Predicates |
An integrated development environment (IDE) for creating, running, and
| | | | | | | |
monitoring ETL (Extract, Transform, Load) jobs in AWS Glue.
| | | | | | | | |
AWS Glue Studio
| |
Helps organizations manage the complexity of data schemas in modern
| | | | | | | | | |
data architectures, ensuring data consistency, quality, and reliability
| | | | | | | |
across various data sources and applications.
| | | | |