<< Prev Question Next Question >>

Question 4/180

Your analytics team wants to build a simple statistical model to determine which customers are most likely to work with your company again, based on a few different metrics. They want to run the model on Apache Spark, using data housed in Google Cloud Storage, and you have recommended using Google Cloud Dataproc to execute this job. Testing has shown that this workload can run in approximately 30 minutes on a 15-node cluster, outputting the results into Google BigQuery. The plan is to run this workload weekly. How should you optimize the cluster for cost?

LEAVE A REPLY

Your email address will not be published. Required fields are marked *

Question List (180q)
Question 1: You are designing the architecture to process your data from...
Question 2: You are running a Dataflow streaming pipeline, with Streamin...
Question 3: You need to move 2 PB of historical data from an on-premises...
Question 4: Your analytics team wants to build a simple statistical mode...
Question 5: Your company produces 20,000 files every hour. Each data fil...
Question 6: You have a query that filters a BigQuery table using a WHERE...
Question 7: Your globally distributed auction application allows users t...
Question 8: If you want to create a machine learning model that predicts...
Question 9: You work for a car manufacturer and have set up a data pipel...
Question 10: You are designing a pipeline that publishes application even...
Question 11: Which of the following job types are supported by Cloud Data...
Question 12: The _________ for Cloud Bigtable makes it possible to use Cl...
Question 13: You have a Standard Tier Memorystore for Redis instance depl...
Question 14: You are creating a data model in BigQuery that will hold ret...
Question 15: Your chemical company needs to manually check documentation ...
Question 16: You are responsible for writing your company's ETL pipelines...
Question 17: Your company's customer and order databases are often under ...
Question 18: You work for a large financial institution that is planning ...
Question 19: A live TV show asks viewers to cast votes using their mobile...
Question 20: What are all of the BigQuery operations that Google charges ...
Question 21: You have data located in BigQuery that is used to generate r...
Question 22: You are working on a linear regression model on BigQuery ML ...
Question 23: Your company uses a proprietary system to send inventory dat...
Question 24: The Dataflow SDKs have been recently transitioned into which...
Question 25: What is the general recommendation when designing your row k...
Question 26: An aerospace company uses a proprietary data format to store...
Question 27: For the best possible performance, what is the recommended z...
Question 28: You stream order data by using a Dataflow pipeline, and writ...
Question 29: You are designing a messaging system by using Pub/Sub to pro...
Question 30: You are integrating one of your internal IT applications and...
Question 31: You have a data pipeline that writes data to Cloud Bigtable ...
Question 32: You work for a mid-sized enterprise that needs to move its o...
Question 33: You are building new real-time data warehouse for your compa...
Question 34: You want to process payment transactions in a point-of-sale ...
Question 35: You have one BigQuery dataset which includes customers' stre...
Question 36: You are administering a BigQuery dataset that uses a custome...
Question 37: You have a data pipeline with a Cloud Dataflow job that aggr...
Question 38: You are designing the architecture of your application to st...
Question 39: When a Cloud Bigtable node fails, ____ is lost....
Question 40: When running a pipeline that has a BigQuery source, on your ...
Question 41: Cloud Bigtable is a recommended option for storing very larg...
Question 42: You are creating a model to predict housing prices. Due to b...
Question 43: You work for a shipping company that uses handheld scanners ...
Question 44: You have a job that you want to cancel. It is a streaming pi...
Question 45: Your company has recently grown rapidly and now ingesting da...
Question 46: Government regulations in the banking industry mandate the p...
Question 47: Your company is migrating its on-premises data warehousing s...
Question 48: Does Dataflow process batch data pipelines or streaming data...
Question 49: You have a petabyte of analytics data and need to design a s...
Question 50: The data analyst team at your company uses BigQuery for ad-h...
Question 51: Your company handles data processing for a number of differe...
Question 52: You are designing a data mesh on Google Cloud with multiple ...
Question 53: You have uploaded 5 years of log data to Cloud Storage A use...
Question 54: You currently have transactional data stored on-premises in ...
Question 55: You create an important report for your large team in Google...
Question 56: You are developing a model to identify the factors that lead...
Question 57: You need to store and analyze social media postings in Googl...
Question 58: You have Cloud Functions written in Node.js that pull messag...
Question 59: Your company built a TensorFlow neural-network model with a ...
Question 60: Google Cloud Bigtable indexes a single value in each row. Th...
Question 61: You are building a model to make clothing recommendations. Y...
Question 62: Which of the following are examples of hyperparameters? (Sel...
Question 63: Your organization is modernizing their IT services and migra...
Question 64: You are building an ELT solution in BigQuery by using Datafo...
Question 65: Which of the following is NOT true about Dataflow pipelines?...
Question 66: How can you get a neural network to learn about relationship...
Question 67: You have terabytes of customer behavioral data streaming fro...
Question 68: Your company is loading comma-separated values (CSV) files i...
Question 69: You need ads data to serve Al models and historical data tor...
Question 70: You work for a large fast food restaurant chain with over 40...
Question 71: You are designing an Apache Beam pipeline to enrich data fro...
Question 72: You have a variety of files in Cloud Storage that your data ...
Question 73: You are collecting loT sensor data from millions of devices ...
Question 74: You work for an advertising company, and you've developed a ...
Question 75: If you're running a performance test that depends upon Cloud...
Question 76: You've migrated a Hadoop job from an on-prem cluster to data...
Question 77: Which Java SDK class can you use to run your Dataflow progra...
Question 78: You want to build a managed Hadoop system as your data lake....
Question 79: Which of the following is NOT a valid use case to select HDD...
Question 80: You store historic data in Cloud Storage. You need to perfor...
Question 81: Your company is migrating their 30-node Apache Hadoop cluste...
Question 82: You have thousands of Apache Spark jobs running in your on-p...
Question 83: Which of the following statements about the Wide &amp; Deep ...
Question 84: Suppose you have a dataset of images that are each labeled a...
Question 85: The marketing team at your organization provides regular upd...
Question 86: You are migrating your on-premises data warehouse to BigQuer...
Question 87: Which of the following is not true about Dataflow pipelines?...
Question 88: You are planning to load some of your existing on-premises d...
Question 89: Which of these is not a supported method of putting data int...
Question 90: Your company is implementing a data warehouse using BigQuery...
Question 91: Which of these are examples of a value in a sparse vector? (...
Question 92: You are creating the CI'CD cycle for the code of the directe...
Question 93: Which TensorFlow function can you use to configure a categor...
Question 94: As your organization expands its usage of GCP, many teams ha...
Question 95: You are building an application to share financial market da...
Question 96: Your company needs to upload their historic data to Cloud St...
Question 97: Cloud Bigtable is Google's ______ Big Data database service....
Question 98: Which action can a Cloud Dataproc Viewer perform?...
Question 99: In order to securely transfer web traffic data from your com...
Question 100: Which software libraries are supported by Cloud Machine Lear...
Question 101: You have an Apache Kafka Cluster on-prem with topics contain...
Question 102: Which Cloud Dataflow / Beam feature should you use to aggreg...
Question 103: You are migrating your on-premises data warehouse to BigQuer...
Question 104: You have a table that contains millions of rows of sales dat...
Question 105: Your company has hired a new data scientist who wants to per...
Question 106: You are implementing security best practices on your data pi...
Question 107: Which Google Cloud Platform service is an alternative to Had...
Question 108: You are planning to migrate your current on-premises Apache ...
Question 109: You are operating a streaming Cloud Dataflow pipeline. Your ...
Question 110: Your software uses a simple JSON format for all messages. Th...
Question 111: You are designing a fault-tolerant architecture to store dat...
Question 112: The YARN ResourceManager and the HDFS NameNode interfaces ar...
Question 113: You are developing a software application using Google's Dat...
Question 114: A data scientist has created a BigQuery ML model and asks yo...
Question 115: MJTelco needs you to create a schema in Google Bigtable that...
Question 116: You have a network of 1000 sensors. The sensors generate tim...
Question 117: You need to choose a database for a new project that has the...
Question 118: When you store data in Cloud Bigtable, what is the recommend...
Question 119: You are building a data pipeline on Google Cloud. You need t...
Question 120: Which SQL keyword can be used to reduce the number of column...
Question 121: You have a BigQuery table that ingests data directly from a ...
Question 122: You are choosing a NoSQL database to handle telemetry data s...
Question 123: You are using Workflows to call an API that returns a 1 KB J...
Question 124: You are working on a sensitive project involving private use...
Question 125: Which of these numbers are adjusted by a neural network as i...
Question 126: You have enabled the free integration between Firebase Analy...
Question 127: You have developed three data processing jobs. One executes ...
Question 128: You want to store your team's shared tables in a single data...
Question 129: You architect a system to analyze seismic data. Your extract...
Question 130: You maintain ETL pipelines. You notice that a streaming pipe...
Question 131: You work for a manufacturing company that sources up to 750 ...
Question 132: You want to automate execution of a multi-step data pipeline...
Question 133: You are designing storage for two relational tables that are...
Question 134: You operate a database that stores stock trades and an appli...
Question 135: An organization maintains a Google BigQuery dataset that con...
Question 136: You need to choose a database to store time series CPU and m...
Question 137: When you design a Google Cloud Bigtable schema it is recomme...
Question 138: You issue a new batch job to Dataflow. The job starts succes...
Question 139: Which of these sources can you not load data into BigQuery f...
Question 140: You are using Google BigQuery as your data warehouse. Your u...
Question 141: You have a BigQuery table that contains customer data, inclu...
Question 142: You work for a global shipping company. You want to train a ...
Question 143: You have a requirement to insert minute-resolution data from...
Question 144: Your company is streaming real-time sensor data from their f...
Question 145: You are building a report-only data warehouse where the data...
Question 146: Your company operates in three domains: airlines, hotels, an...
Question 147: You are a head of BI at a large enterprise company with mult...
Question 148: You want to schedule a number of sequential load and transfo...
Question 149: You are designing a data warehouse in BigQuery to analyze sa...
Question 150: You create a new report for your large team in Google Data S...
Question 151: Which of these is NOT a way to customize the software on Dat...
Question 152: You have historical data covering the last three years in Bi...
Question 153: Which is not a valid reason for poor Cloud Bigtable performa...
Question 154: You are troubleshooting your Dataflow pipeline that processe...
Question 155: You are architecting a data transformation solution for BigQ...
Question 156: Your car factory is pushing machine measurements as messages...
Question 157: Your startup has never implemented a formal security policy....
Question 158: You are designing the database schema for a machine learning...
Question 159: Your team is responsible for developing and maintaining ETLs...
Question 160: You work for a manufacturing plant that batches application ...
Question 161: MJTelco is building a custom interface to share data. They h...
Question 162: You are operating a Cloud Dataflow streaming pipeline. The p...
Question 163: You have spent a few days loading data from comma-separated ...
Question 164: You plan to deploy Cloud SQL using MySQL. You need to ensure...
Question 165: You orchestrate ETL pipelines by using Cloud Composer One of...
Question 166: You are administering a BigQuery on-demand environment. Your...
Question 167: You have created an external table for Apache Hive partition...
Question 168: You are migrating an application that tracks library books a...
Question 169: You're training a model to predict housing prices based on a...
Question 170: You are building a streaming Dataflow pipeline that ingests ...
Question 171: You work for a bank. You have a labelled dataset that contai...
Question 172: You store and analyze your relational data in BigQuery on Go...
Question 173: You are running a streaming pipeline with Dataflow and are u...
Question 174: When creating a new Cloud Dataproc cluster with the projects...
Question 175: You need to modernize your existing on-premises data strateg...
Question 176: You are developing an application on Google Cloud that will ...
Question 177: To give a user read permission for only the first three colu...
Question 178: Your team is building a data lake platform on Google Cloud. ...
Question 179: Suppose you have a table that includes a nested column calle...
Question 180: You operate an IoT pipeline built around Apache Kafka that n...