Valid Databricks-Certified-Data-Engineer-Associate Dumps shared by ExamDiscuss.com for Helping Passing Databricks-Certified-Data-Engineer-Associate Exam! ExamDiscuss.com now offer the newest Databricks-Certified-Data-Engineer-Associate exam dumps, the ExamDiscuss.com Databricks-Certified-Data-Engineer-Associate exam questions have been updated and answers have been corrected get the newest ExamDiscuss.com Databricks-Certified-Data-Engineer-Associate dumps with Test Engine here:
A data analyst has developed a query that runs against Delta table. They want help from the data engineering team to implement a series of tests to ensure the data returned by the query is clean. However, the data engineering team uses Python for its tests rather than SQL. Which of the following operations could the data engineering team use to run the query and operate with the results in PySpark?
Correct Answer: C
The spark.sql operation allows the data engineering team to run a SQL query and return the result as a PySpark DataFrame. This way, the data engineering team can use the same query that the data analyst has developed and operate with the results in PySpark. For example, the data engineering team can use spark.sql("SELECT * FROM sales") to get a DataFrame of all the records from the sales Delta table, and then apply various tests or transformations using PySpark APIs. The other options are either not valid operations (A, D), not suitable for running a SQL query (B, E), or not returning a DataFrame (A). References: Databricks Documentation - Run SQL queries, Databricks Documentation - Spark SQL and DataFrames.