Your customer is worried that the redundancy of HDFS will not meet its needs. The customer needs to store certain files with higher levels of redundancy than other files.
Which architectural feature of HDFS should you choose?
Correct Answer: D
Explanation/Reference:
You can change the replication factor on a per-file basis using the Hadoop FS shell.
To set replication of an individual file to 4:
./bin/hadoop dfs -setrep -w 4 /path/to/file
Incorrect Answers:
A: Cloudera Impala is Cloudera's open source massively parallel processing (MPP) SQL query engine for data stored in a computer cluster running Apache Hadoop.
C: Apache Scala is a high-level programming language which is a combination of object-oriented programming and functional programming. It is highly scalable which is why it is called Scala.
References: https://sites.google.com/site/hadoopandhive/home/how-to-change-replication-factor-of- existing-files-in-hdfs