Associate-Data-Practitioner Exam Dumps | You need to design a data pipeline to process large volumes of raw server log data stored in Cloud Storage.

<< Prev Question Next Question >>

Question 43/47

You need to design a data pipeline to process large volumes of raw server log data stored in Cloud Storage.
The data needs to be cleaned, transformed, and aggregated before being loaded into BigQuery for analysis.
The transformation involves complex data manipulation using Spark scripts that your team developed. You need to implement a solution that leverages your team's existing skillset, processes data at scale, and minimizes cost. What should you do?

A. Use Dataflow with a custom template for the transformation logic.

B. Use Cloud Data Fusion to visually design and manage the pipeline.

C. Use Dataform to define the transformations in SQLX.

D. Use Dataproc to run the transformations on a cluster.

Question 43/47

LEAVE A REPLY

Download PDF File