AWS Certified Solutions Architect - Associate / Question #1158 of 1019

Question #1158

A data processing team receives daily uploads of large datasets in an Amazon S3 bucket. The team manually transfers these files to a processing S3 bucket each day for analysis with Amazon QuickSight. As more teams contribute larger files, the manual process becomes inefficient.

The team wants to automate transferring files to the processing bucket upon upload. They also need to trigger AWS Lambda functions for data validation and feed the data into Amazon SageMaker Pipelines for model training.

What should a solutions architect recommend to meet these requirements with the LEAST operational overhead?

A

Create a Lambda function to copy the files to the processing S3 bucket. Create an S3 event notification for the processing S3 bucket. Configure Lambda and SageMaker Pipelines as destinations of the event notification. Configure s3:ObjectCreated:Put as the event type.

B

Create a Lambda function to copy the files to the processing S3 bucket. Configure the processing S3 bucket to send event notifications to Amazon EventBridge (Amazon CloudWatch Events). Configure an ObjectCreated rule in EventBridge (CloudWatch Events). Configure Lambda and SageMaker Pipelines as targets for the rule.

C

Configure S3 replication between the S3 buckets. Create an S3 event notification for the processing S3 bucket. Configure Lambda and SageMaker Pipelines as destinations of the event notification. Configure s3:ObjectCreated:Put as the event type.

D

Configure S3 replication between the S3 buckets. Configure the processing S3 bucket to send event notifications to Amazon EventBridge (Amazon CloudWatch Events). Configure an ObjectCreated rule in EventBridge (CloudWatch Events). Configure Lambda and SageMaker Pipelines as targets for the rule.

Explanation

Option D is correct because:
1. S3 Replication automates file transfer between buckets without custom code, reducing operational overhead.
2. EventBridge (CloudWatch Events) captures ObjectCreated events from the processing bucket and triggers both Lambda (for validation) and SageMaker Pipelines (for training) via a single rule, enabling multi-target routing.

Other options are incorrect because:
- A & B: Require a Lambda function to copy files, adding code maintenance overhead.
- C: Relies on S3 event notifications, which cannot send events to multiple destinations (Lambda + SageMaker) without separate configurations.

Key Points:
- Use S3 Replication for automated cross-bucket copying.
- Use EventBridge for flexible event routing to multiple targets.
- Avoid custom Lambda for file transfers when managed services (S3 Replication) suffice.

Answer

The correct answer is: D