Bird
0
0

In a Kappa architecture using Kafka and Spark, what will happen if the streaming job is restarted without checkpointing?

medium📝 Predict Output Q5 of 15
Hadoop - Modern Data Architecture with Hadoop

In a Kappa architecture using Kafka and Spark, what will happen if the streaming job is restarted without checkpointing?

AThe job will resume from the last processed offset
BThe job will skip all data and process only new messages
CThe job will fail to start due to missing checkpoint
DThe job will start processing from the earliest offset
Step-by-Step Solution
Solution:
  1. Step 1: Understand checkpoint role in streaming

    Checkpoint stores offsets to resume processing from last point.
  2. Step 2: Effect of missing checkpoint on restart

    Without checkpoint, Spark starts from default "latest" offset, processing only new messages.
  3. Final Answer:

    The job will skip all data and process only new messages -> Option B
  4. Quick Check:

    No checkpoint means start from latest offset [OK]
Quick Trick: No checkpoint = process only new messages [OK]
Common Mistakes:
  • Thinking it resumes from last offset
  • Believing job fails without checkpoint
  • Assuming it starts from earliest offset

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More Hadoop Quizzes