<< Prev Question Next Question >>

Question 7/35

You need to create a job that does frequency analysis on input data. You will do this by writing a Mapper that uses TextInputFormat and splits each value (a line of text from an input file) into individual characters.
For each one of these characters, you will emit the character as a key and an InputWritable as the value.
As this will produce proportionally more intermediate data than input data, which two resources should you expect to be bottlenecks?

LEAVE A REPLY

Your email address will not be published. Required fields are marked *

Question List (35q)
Question 1: You have written a Mapper which invokes the following five c...
Question 2: For each intermediate key, each reducer task can emit:...
Question 3: The Hadoop framework provides a mechanism for coping with ma...
Question 4: You write MapReduce job to process 100 files in HDFS. Your M...
Question 5: Which process describes the lifecycle of a Mapper?...
Question 6: You have the following key-value pairs as output from your M...
Question 7: You need to create a job that does frequency analysis on inp...
Question 8: What types of algorithms are difficult to express in MapRedu...
Question 9: In a MapReduce job with 500 map tasks, how many map task att...
Question 10: In a MapReduce job, the reducer receives all values associat...
Question 11: In a MapReduce job, you want each of your input files proces...
Question 12: Indentify which best defines a SequenceFile?...
Question 13: MapReduce v2 (MRv2/YARN) splits which major functions of the...
Question 14: Workflows expressed in Oozie can contain:...
Question 15: When can a reduce class also serve as a combiner without aff...
Question 16: For each input key-value pair, mappers can emit:...
Question 17: Table metadata in Hive is:
Question 18: A client application creates an HDFS file named foo.txt with...
Question 19: A combiner reduces:
Question 20: You have user profile records in your OLPT database, that yo...
Question 21: Which best describes how TextInputFormat processes input fil...
Question 22: What is the disadvantage of using multiple reducers with the...
Question 23: Given a directory of files with the following structure: lin...
Question 24: In a large MapReduce job with m mappers and n reducers, how ...
Question 25: On a cluster running MapReduce v1 (MRv1), a TaskTracker hear...
Question 26: In the reducer, the MapReduce API provides you with an itera...
Question 27: What data does a Reducer reduce method process?...
Question 28: Identify the MapReduce v2 (MRv2 / YARN) daemon responsible f...
Question 29: All keys used for intermediate output from mappers must:...
Question 30: To process input key-value pairs, your mapper needs to lead ...
Question 31: You need to perform statistical analysis in your MapReduce j...
Question 32: Analyze each scenario below and indentify which best describ...
Question 33: Determine which best describes when the reduce method is fir...
Question 34: You need to move a file titled "weblogs" into HDFS. When you...
Question 35: Which best describes what the map method accepts and emits?...