AWS Certified Cloud Practitioner / Question #1332 of 719

Question #1332

A company needs to perform a one-time migration of a petabyte-scale dataset comprising millions of files from their on-premises network-attached storage to Amazon S3. The migration must preserve file metadata, including timestamps and permissions. Which AWS service should the company use for this task?

A

AWS Database Migration Service (AWS DMS)

B

AWS DataSync

C

AWS Snowball Edge

D

AWS Storage Gateway

Explanation

AWS DataSync (B) is designed for high-speed, automated data transfers between on-premises storage and AWS services like Amazon S3. It preserves file metadata, including timestamps, permissions, and other attributes, meeting the requirement. DataSync optimizes transfer speeds and handles petabyte-scale datasets efficiently over the network.

Other options:
- A. AWS DMS: Focuses on database migrations, not file-based storage, and does not handle file metadata.
- C. AWS Snowball Edge: Suitable for offline transfers but requires physical logistics and additional steps to preserve metadata.
- D. AWS Storage Gateway: Provides ongoing hybrid storage access, not optimized for one-time bulk migrations.

Key Points:
1. Metadata Preservation: DataSync ensures timestamps, permissions, etc., are retained.
2. Scalability: Handles millions of files and petabyte-scale data efficiently.
3. One-Time Migration: DataSync is purpose-built for such scenarios, unlike Storage Gateway (ongoing) or Snowball (offline).

Answer

The correct answer is: B