beginner

What is the main difference between Apache Spark and Hadoop MapReduce in terms of data processing?

Apache Spark processes data in-memory, making it faster, while Hadoop MapReduce reads and writes data to disk between each step, which is slower.

Click to reveal answer

intermediate

How does Spark handle iterative algorithms compared to Hadoop MapReduce?

Spark keeps data in memory across iterations, speeding up iterative algorithms, whereas Hadoop MapReduce reloads data from disk each time, slowing down the process.

Click to reveal answer

beginner

Which framework supports real-time stream processing: Spark or Hadoop MapReduce?

Apache Spark supports real-time stream processing through Spark Streaming, while Hadoop MapReduce is designed mainly for batch processing.

Click to reveal answer

intermediate

What programming languages can you use with Apache Spark that are not natively supported by Hadoop MapReduce?

Apache Spark supports Scala, Python, Java, and R, while Hadoop MapReduce primarily supports Java.

Click to reveal answer

advanced

Why might a company choose Hadoop MapReduce over Spark despite Spark's speed advantages?

Hadoop MapReduce can be more cost-effective for very large batch jobs on disk-based storage and has a mature ecosystem; also, it requires less memory than Spark.

Click to reveal answer

Which of the following best describes Apache Spark's data processing?

AIn-memory processing for faster computation

BDisk-based processing only

CProcesses data one record at a time

DOnly supports batch processing

Hadoop MapReduce is best suited for which type of processing?

AInteractive querying

BReal-time streaming

CIn-memory iterative processing

DBatch processing

Which language is NOT natively supported by Hadoop MapReduce but is supported by Spark?

AJava

BC++

CScala

DPython

What feature allows Spark to speed up iterative algorithms?

ADisk caching

BMap-only jobs

CIn-memory data storage

DSequential processing

Which framework would be better for a company with limited memory resources?

AApache Spark

BHadoop MapReduce

CBoth are equal

DNeither

Explain the main performance differences between Apache Spark and Hadoop MapReduce.

Describe scenarios where Hadoop MapReduce might be preferred over Apache Spark.