The AWS Certified Data Engineer – Associate (DEA-C01) exam validates your skills in data engineering on AWS. It is suitable for those who want to demonstrate their ability to design and implement data solutions.
A company needs to analyze real-time streaming data. Which service should they use?
AAWS Data Pipeline
BAmazon Kinesis
CAmazon Redshift
DAWS Glue
Explanation
Amazon Kinesis is designed for real-time stream processing, unlike the other services.
Q102
You are configuring a multi-Availability Zone deployment for Amazon RDS. What happens if one Availability Zone goes down?
ADatabase is completely offline
BConnection to primary database fails
CAutomatic failover to standby occurs
DOnly read replicas are available
Explanation
Automatic failover to the standby ensures high availability, while the other options introduce unnecessary downtimes.
Q103
Which service helps in real-time processing of streaming data?
AAmazon Kinesis
BAmazon S3
CAmazon RDS
DAWS Lambda
Explanation
Amazon Kinesis is specifically designed for real-time data streams, unlike the other services which serve different purposes.
Q104
A company needs a scalable architecture for its sporadic workloads. Which AWS service should it consider deploying?
AAmazon EC2
BAWS Lambda
CAmazon S3
DAmazon RDS
Explanation
AWS Lambda automatically scales with workloads, while the other options require manual scaling.
Q105
You are configuring an AWS Glue job. What happens if you set the job bookmark to 'job bookmarks enable'?
AAlways process all files again
BSkip previously processed files
CErrors in job execution
DJob cannot be executed
Explanation
Enabling job bookmarks allows Glue to keep track of processed files, skipping them in future runs.
Q106
Which AWS service is primarily used for data transformation in ETL processes?
AAWS Glue
BAmazon S3
CAmazon RDS
DAmazon Kinesis
Explanation
AWS Glue is specifically designed for data transformation, while the others serve different data management roles.
Q107
A company needs to store large volumes of unstructured data. What is the best choice?
AAmazon RDS
BAmazon S3
CAmazon DynamoDB
DAWS Lambda
Explanation
Amazon S3 is ideal for unstructured data storage, whereas the others are better for structured data or compute processes.
Q108
What happens when you enable versioning on an S3 bucket?
AOld versions are permanently deleted
BSaved data remains immutable
CUnique URLs are created for each version
DAll objects are encrypted automatically
Explanation
Enabling versioning results in unique URLs for each version, while the other options incorrectly describe S3's behavior.
Q109
Which AWS service provides a serverless event-driven compute engine?
AAWS Lambda
BAmazon EC2
CAmazon RDS
DAWS Fargate
Explanation
AWS Lambda executes code without provisioning servers; EC2 requires server management, RDS is for relational databases, and Fargate is for containers.
Q110
A company needs to store large volumes of unstructured data for analytics. Which AWS service should they use?
AAmazon S3
BAWS DynamoDB
CAmazon RDS
DAWS Batch
Explanation
Amazon S3 is optimized for storing large amounts of unstructured data; DynamoDB is for NoSQL, RDS is for structured data, and Batch manages job execution.