<< Prev Question Next Question >>

Question 22/113

A global retail company sells products across multiple categories (e.g., Electronics, Clothing) and regions (e.g., North, South, East, West). The sales team has provided the data engineer with a PySpark dataframe named sales_df as below and the team wants the data engineer to analyze the sales data to help them make strategic decisions.

Calculate the total sales amount for each product category and store the results in a new dataframe called category_sales.
What will generate the expected result of category_sales?

LEAVE A REPLY

Your email address will not be published. Required fields are marked *

Question List (113q)
Question 1: A data engineer has been given a new record of data: id STRI...
Question 2: A data engineer is maintaining a data pipeline. Upon data in...
Question 3: What are the transformations typically included in building ...
Question 4: A data engineer is decommissioning a sandbox schema in Unity...
Question 5: A data engineer is writing Spark code to group sales data by...
Question 6: A data engineer has realized that they made a mistake when m...
Question 7: A data engineer has configured a Structured Streaming job to...
Question 8: In which of the following scenarios should a data engineer s...
Question 9: A data engineer needs to apply custom logic to identify empl...
Question 10: A Databricks workflow fails at the last stage due to an erro...
Question 11: A data engineer needs to create a table in Databricks using ...
Question 12: Which of the following describes a scenario in which a data ...
Question 13: A data engineer is designing a streaming pipeline and wants ...
Question 14: An engineering manager uses a Databricks SQL query to monito...
Question 15: A dataset has been defined using Delta Live Tables and inclu...
Question 16: A data engineer is attempting to drop a Spark SQL table my_t...
Question 17: Which of the following benefits is provided by the array fun...
Question 18: Which of the following data workloads will utilize a Gold ta...
Question 19: Differentiate between all-purpose clusters and jobs clusters...
Question 20: A data engineer needs to develop integration tests for an ET...
Question 21: A data engineer has a Delta table containing old data files ...
Question 22: A global retail company sells products across multiple categ...
Question 23: Which of the following code blocks will remove the rows wher...
Question 24: A data architect has determined that a table of the followin...
Question 25: A data engineer team has decided to implement a new data pla...
Question 26: A data engineer wants to create a new table containing the n...
Question 27: A data engineer must deliver a trustworthy customer 360 data...
Question 28: Which of the following describes when to use the CREATE STRE...
Question 29: A data engineer needs to apply custom logic to string column...
Question 30: An organization has data stored across multiple external sys...
Question 31: A dataset has been defined using Delta Live Tables and inclu...
Question 32: A Data Engineer is designing Bronze layer in Databricks Meda...
Question 33: A data engineer has configured a Structured Streaming job to...
Question 34: Which of the following commands will return the number of nu...
Question 35: What Databricks feature can be used to check the data source...
Question 36: A data engineering team has noticed that their Databricks SQ...
Question 37: A data engineer is implementing a data ingestion pipeline th...
Question 38: A data engineer is setting up a new Databricks pipeline that...
Question 39: A data engineer wants to run SQL queries directly on data st...
Question 40: A data engineer has a Python notebook in Databricks, but the...
Question 41: A data engineer wants to delete records from a Delta table w...
Question 42: A data engineer wants to schedule multiple tasks in a workfl...
Question 43: Which of the following Git operations must be performed outs...
Question 44: A data engineer needs to create a table in Databricks using ...
Question 45: A data engineer wants to create an external table in Databri...
Question 46: A data engineer notices that a Spark job performing a join b...
Question 47: A data engineer wants to create a data entity from a couple ...
Question 48: Which of the following describes the storage organization of...
Question 49: A data engineer needs to ingest streaming data from Kafka in...
Question 50: A Databricks single-task workflow fails at the last task due...
Question 51: A data engineer has written a function in a Databricks Noteb...
Question 52: In which of the following scenarios should a data engineer u...
Question 53: A data engineer wants to reduce costs and optimize cloud spe...
Question 54: In order for Structured Streaming to reliably track the exac...
Question 55: A data engineer is working on a personal laptop and needs to...
Question 56: A data engineer is working in a Python notebook on Databrick...
Question 57: A data engineering team has two tables. The first table marc...
Question 58: A data engineer is working on a Databricks project that util...
Question 59: A data engineer wants to create a relational object by pulli...
Question 60: A data engineer has inherited a Databricks pipeline from a p...
Question 61: A data engineer is configuring a cluster in Databricks and w...
Question 62: A data engineer has been using a Databricks SQL dashboard to...
Question 63: A data engineer needs to optimize the data layout and query ...
Question 64: A data engineer needs to provide access to a group named man...
Question 65: A data engineer is implementing a data pipeline using Databr...
Question 66: A data engineer needs to conduct Exploratory Analysis on dat...
Question 67: A new data engineering team team. has been assigned to an EL...
Question 68: A data engineer is migrating pipeline tasks to reduce operat...
Question 69: A data engineer only wants to execute the final block of a P...
Question 70: A data engineering team wants to validate a new ingestion pi...
Question 71: Which of the following SQL keywords can be used to convert a...
Question 72: A new data engineering team has been assigned to work on a p...
Question 73: A data analyst has a series of queries in a SQL program. The...
Question 74: A data engineer is configuring Unity Catalog in Databricks a...
Question 75: A data engineer has a single-task Job that runs each morning...
Question 76: A data engineer needs to append new records to an existing D...
Question 77: What is the functionality of AutoLoader in Databricks?...
Question 78: A data engineer needs to conduct Exploratory Analysis on dat...
Question 79: A data engineer needs to use a Delta table as part of a data...
Question 80: What can be used to simplify and unify siloed data architect...
Question 81: A data engineer is processing ingested streaming tables and ...
Question 82: A data engineer needs access to a table new_table, but they ...
Question 83: A data engineer is standardizing repository layouts for mult...
Question 84: Which of the following statements regarding the relationship...
Question 85: A data engineering project involves processing large batches...
Question 86: A data engineer has a Python variable table_name that they w...
Question 87: A data engineer is attempting to write Python and SQL in the...
Question 88: A data organization leader is upset about the data analysis ...
Question 89: A data engineer wants to read only a subset of columns from ...
Question 90: A data engineer wants to store intermediate data in a tempor...
Question 91: Which of the following describes a benefit of creating an ex...
Question 92: A Python file is ready for production and the client wants t...
Question 93: A data engineer is configuring a batch processing job in Dat...
Question 94: A new data engineering team team has been assigned to an ELT...
Question 95: An organization needs to share a dataset stored in its Datab...
Question 96: Which of the following describes a scenario in which a data ...
Question 97: A data engineer notices that a Spark job repeatedly scans a ...
Question 98: A data engineer is working with a very large dataset and wan...
Question 99: A data engineer has been given a new record of data: id STRI...
Question 100: A data engineer has been using a Databricks SQL dashboard to...
Question 101: A data engineer is developing an ETL process based on Spark ...
Question 102: A data engineer is designing an ETL pipeline to process both...
Question 103: A data engineer is reading a very large dataset that contain...
Question 104: A data engineer needs to use a Delta table as part of a data...
Question 105: A data engineer needs to access the view created by the sale...
Question 106: A data engineer has a Job that has a complex run schedule, a...
Question 107: A data engineer needs to ingest from both streaming and batc...
Question 108: A data engineer is onboarding a new bronze ingestion pipelin...
Question 109: A data engineer wants to ensure that duplicate records are r...
Question 110: Which SQL code snippet will correctly demonstrate a Data Def...
Question 111: A data engineer needs to reduce the number of small files ge...
Question 112: A data engineering team is using Kafka to capture event data...
Question 113: A Databricks single-task workflow fails due to an error in a...