Question #1617
A manufacturing company uses on-premises sensors to generate log data in CSV format. The sensors can write data to an NFS file share. The company wants to migrate this data to AWS for analysis using SQL queries, which are run several times a day. The solution must be cost-effective.
Which combination of steps will meet these requirements MOST cost-effectively? (Choose three.)
Deploy an AWS Storage Gateway on premises in Amazon S3 File Gateway mode.
Deploy an AWS Storage Gateway on premises in Amazon FSx File Gateway mode.
Set up an AWS Glue crawler to create a table based on the data in Amazon S3.
Set up an Amazon EMR cluster with EMR File System (EMRFS) to query the data in Amazon S3. Provide access to analysts.
Set up an Amazon Redshift cluster to query the data in Amazon S3. Provide access to analysts.
Set up Amazon Athena to query the data in Amazon S3. Provide access to analysts.
Explanation
The solution uses AWS Storage Gateway in S3 File Gateway mode (A) to transfer CSV data from on-premises NFS to Amazon S3, ensuring cost-effective storage. AWS Glue crawler (C) auto-discovers the CSV schema and creates a table in the Glue Data Catalog, enabling SQL queries. Amazon Athena (F) allows serverless querying with pay-per-use pricing, ideal for intermittent usage.
Other options are less cost-effective: FSx File Gateway (B) involves higher costs for FSx vs. S3. EMR (D) and Redshift (E) require running clusters, which are expensive for sporadic queries. Athena avoids infrastructure costs, aligning with the requirement for cost-effectiveness. Key points: Use S3 for storage, Glue for schema, and Athena for serverless SQL.
Answer
The correct answer is: ACF