Valid Databricks-Certified-Professional-Data-Engineer Dumps shared by EduDump.com for Helping Passing Databricks-Certified-Professional-Data-Engineer Exam! EduDump.com now offer the newest Databricks-Certified-Professional-Data-Engineer exam dumps, the EduDump.com Databricks-Certified-Professional-Data-Engineer exam questions have been updated and answers have been corrected get the newest EduDump.com Databricks-Certified-Professional-Data-Engineer dumps with Test Engine here:
A query is taking too long to run. After investigating the Spark UI, the data engineer discovered a significant amount of disk spill. The compute instance being used has a core-to-memory ratio of 1:2. What are the two steps the data engineer should take to minimize spillage? (Choose 2 answers)
Correct Answer: A,D
Comprehensive and Detailed Explanation From Exact Extract of Databricks Data Engineer Documents: Databricks recommends addressing disk spilling-which occurs when Spark tasks run out of memory-by increasing memory per core and controlling partition size. Selecting an instance type with a higher memory-to-core ratio (A) provides each task with more available RAM, directly reducing the chance of spilling to disk. Additionally, reducing spark.sql.files.maxPartitionBytes (D) creates smaller partitions, preventing any single task from holding too much data in memory. Increasing partition size (C) or disk capacity (B) does not solve memory bottlenecks, and bandwidth (E) affects network I/O, not spill behavior. Therefore, the correct actions are A and D.