Bird
0
0

To run a Spark job on Azure HDInsight, which of the following is a necessary step before job submission?

hard📝 Application Q8 of 15
Hadoop - Modern Data Architecture with Hadoop
To run a Spark job on Azure HDInsight, which of the following is a necessary step before job submission?
AInstall Spark manually on each HDInsight node
BUpload the Spark application JAR file to Azure Blob Storage
CConfigure Hadoop YARN to disable Spark support
DUse Azure Data Factory to create a virtual machine
Step-by-Step Solution
Solution:
  1. Step 1: Understand HDInsight Spark job requirements

    HDInsight clusters come pre-installed with Spark; manual installation is unnecessary.
  2. Step 2: Identify job submission prerequisites

    The Spark job JAR must be accessible, typically uploaded to Azure Blob Storage.
  3. Final Answer:

    Upload the Spark application JAR file to Azure Blob Storage -> Option B
  4. Quick Check:

    Upload job files to Blob Storage before submitting [OK]
Quick Trick: Upload Spark JAR to Blob Storage before submitting job [OK]
Common Mistakes:
  • Trying to install Spark manually on HDInsight
  • Disabling Spark support in YARN
  • Confusing Azure Data Factory with VM creation

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More Hadoop Quizzes