AWS Certified Data Engineer – Associate
Exam Questions With Correct Answers
A company is running an Amazon Redshift cluster. A data engineer must
| | | | | | | | | | | |
design a solution that would give the company the ability to provide
| | | | | | | | | | | |
analysis on a separate test environment in Amazon Redshift. The
| | | | | | | | | |
solution would use the data from the main Redshift cluster. The second
| | | | | | | | | | | |
cluster is expected to be used for only 2 hours every 2 weeks as part of
| | | | | | | | | | | | | | | |
the new testing process. - CORRECT ANSWER✔✔-Create a data share
| | | | | | | | | |
from the main Redshift test cluster. Use Amazon Redshift Serverless for
| | | | | | | | | | |
the test environment
| |
A company is running a cloud-based software application in an Amazon
| | | | | | | | | | |
EC2 instance backed by an Amazon RDS for Microsoft SQL Server
| | | | | | | | | | |
database. The application collects, processes, and stores confidential
| | | | | | | |
information and records in the database. The company wants to
| | | | | | | | | |
eliminate the risk of credential exposure. - CORRECT ANSWER✔✔-You
| | | | | | | | |
can use Secrets Manager to store credentials and to configure
| | | | | | | | | |
automatic rotation. a good rotation is 30 days
| | | | | | |
An Amazon Kinesis application is trying to read data from a Kinesis data
| | | | | | | | | | | | |
stream. However, the read data call is rejected. The following error
| | | | | | | | | | |
message is displayed: ProvisionedThroughputExceededException. -
| | | | |
CORRECT ANSWER✔✔-The ProvisionedThroughputExceededException
| | |
, error is caused by the capacity quotas of the data stream exceeding its
| | | | | | | | | | | | |
provisioned amount. A sustained rise of the stream's output data rate | | | | | | | | | | |
can cause this issue. To resolve the issue, you can increase the number
| | | | | | | | | | | | |
of shards within your stream to provide enough capacity for the read
| | | | | | | | | | | |
data calls to consistently succeed.
| | | |
An ecommerce company runs several applications on AWS. The
| | | | | | | | |
company wants to design a centralized streaming log ingestion solution.
| | | | | | | | | |
The solution needs to be able to convert the log files to Apache Parquet
| | | | | | | | | | | | | |
format. Then, the solution must store the log files in Amazon S3. The
| | | | | | | | | | | | |
number of log files being created varies throughout the day. A data
| | | | | | | | | | | |
engineer must configure a solution that ensures the log files are
| | | | | | | | | | |
delivered in near real time. | | | |
Which solution will meet these requirements with the LEAST
| | | | | | | | |
operational overhead - CORRECT ANSWER✔✔-You can use Kinesis Data | | | | | | | | |
Firehose to deliver log files to Amazon S3 with the least operational
| | | | | | | | | | | |
overhead. You can use a data-transformation Lambda function with
| | | | | | | | |
Kinesis Data Firehose. This solution can convert log files to the correct
| | | | | | | | | | | |
format before the log files are delivered to Amazon S3.
| | | | | | | | |
An ecommerce company is running an application on AWS. The
| | | | | | | | | |
application sources recent data from tables in Amazon Redshift. Data
| | | | | | | | | |
that is older than 1 year is accessible in Amazon S3. Recently, a new
| | | | | | | | | | | | | |
report has been written in SQL. The report needs to compare a few
| | | | | | | | | | | | |
columns from the current year sales table with the same columns from
| | | | | | | | | | | |
tables with sales data from previous years. The report runs slowly, with
| | | | | | | | | | | |
poor performance and long wait times to get results.
| | | | | | | |
Exam Questions With Correct Answers
A company is running an Amazon Redshift cluster. A data engineer must
| | | | | | | | | | | |
design a solution that would give the company the ability to provide
| | | | | | | | | | | |
analysis on a separate test environment in Amazon Redshift. The
| | | | | | | | | |
solution would use the data from the main Redshift cluster. The second
| | | | | | | | | | | |
cluster is expected to be used for only 2 hours every 2 weeks as part of
| | | | | | | | | | | | | | | |
the new testing process. - CORRECT ANSWER✔✔-Create a data share
| | | | | | | | | |
from the main Redshift test cluster. Use Amazon Redshift Serverless for
| | | | | | | | | | |
the test environment
| |
A company is running a cloud-based software application in an Amazon
| | | | | | | | | | |
EC2 instance backed by an Amazon RDS for Microsoft SQL Server
| | | | | | | | | | |
database. The application collects, processes, and stores confidential
| | | | | | | |
information and records in the database. The company wants to
| | | | | | | | | |
eliminate the risk of credential exposure. - CORRECT ANSWER✔✔-You
| | | | | | | | |
can use Secrets Manager to store credentials and to configure
| | | | | | | | | |
automatic rotation. a good rotation is 30 days
| | | | | | |
An Amazon Kinesis application is trying to read data from a Kinesis data
| | | | | | | | | | | | |
stream. However, the read data call is rejected. The following error
| | | | | | | | | | |
message is displayed: ProvisionedThroughputExceededException. -
| | | | |
CORRECT ANSWER✔✔-The ProvisionedThroughputExceededException
| | |
, error is caused by the capacity quotas of the data stream exceeding its
| | | | | | | | | | | | |
provisioned amount. A sustained rise of the stream's output data rate | | | | | | | | | | |
can cause this issue. To resolve the issue, you can increase the number
| | | | | | | | | | | | |
of shards within your stream to provide enough capacity for the read
| | | | | | | | | | | |
data calls to consistently succeed.
| | | |
An ecommerce company runs several applications on AWS. The
| | | | | | | | |
company wants to design a centralized streaming log ingestion solution.
| | | | | | | | | |
The solution needs to be able to convert the log files to Apache Parquet
| | | | | | | | | | | | | |
format. Then, the solution must store the log files in Amazon S3. The
| | | | | | | | | | | | |
number of log files being created varies throughout the day. A data
| | | | | | | | | | | |
engineer must configure a solution that ensures the log files are
| | | | | | | | | | |
delivered in near real time. | | | |
Which solution will meet these requirements with the LEAST
| | | | | | | | |
operational overhead - CORRECT ANSWER✔✔-You can use Kinesis Data | | | | | | | | |
Firehose to deliver log files to Amazon S3 with the least operational
| | | | | | | | | | | |
overhead. You can use a data-transformation Lambda function with
| | | | | | | | |
Kinesis Data Firehose. This solution can convert log files to the correct
| | | | | | | | | | | |
format before the log files are delivered to Amazon S3.
| | | | | | | | |
An ecommerce company is running an application on AWS. The
| | | | | | | | | |
application sources recent data from tables in Amazon Redshift. Data
| | | | | | | | | |
that is older than 1 year is accessible in Amazon S3. Recently, a new
| | | | | | | | | | | | | |
report has been written in SQL. The report needs to compare a few
| | | | | | | | | | | | |
columns from the current year sales table with the same columns from
| | | | | | | | | | | |
tables with sales data from previous years. The report runs slowly, with
| | | | | | | | | | | |
poor performance and long wait times to get results.
| | | | | | | |