[Solved] Given the following PySpark code snippet:df.write.format('orc').save('/data/output')What will be the format of the saved file? | Hadoop

Hadoop - Performance Tuning

Given the following PySpark code snippet:

df.write.format('orc').save('/data/output')

What will be the format of the saved file?

AAvro

BORC

CCSV

DParquet

Step-by-Step Solution

Solution:

Step 1: Analyze the write format method
The code uses df.write.format('orc'), which specifies the ORC format for saving data.
Step 2: Confirm the output format
The save method writes the data in ORC format to the specified path.
Final Answer:
ORC -> Option B
Quick Check:
Write format 'orc' = ORC file [OK]

Quick Trick: format('orc') saves data in ORC format [OK]

Common Mistakes:

Master "Performance Tuning" in Hadoop

9 interactive learning modes - each teaches the same concept differently

More Hadoop Quizzes

Given the following PySpark code snippet: