0
0
Hadoopdata~5 mins

Kappa architecture (streaming only) in Hadoop - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is the main idea behind Kappa architecture?
Kappa architecture processes data only as a stream, avoiding separate batch layers. It treats all data as a continuous flow for real-time processing.
Click to reveal answer
intermediate
How does Kappa architecture handle data reprocessing?
It reprocesses data by replaying the entire stream from the beginning, using the same streaming code, instead of having a separate batch process.
Click to reveal answer
beginner
Which component is central in Kappa architecture for storing data streams?
A durable log system like Apache Kafka stores the data stream, allowing replay and fault tolerance.
Click to reveal answer
intermediate
Why might Kappa architecture be simpler than Lambda architecture?
Because it uses only one processing path (streaming), it avoids maintaining separate batch and streaming code, reducing complexity.
Click to reveal answer
beginner
Name a common streaming processing framework used in Kappa architecture.
Apache Flink, Apache Spark Streaming, or Apache Kafka Streams are popular frameworks for processing streams in Kappa architecture.
Click to reveal answer
What type of data processing does Kappa architecture use?
AOnly streaming processing
BOnly batch processing
CBoth batch and streaming processing
DNeither batch nor streaming processing
How does Kappa architecture reprocess data?
ABy running a batch job on stored data
BBy replaying the data stream from the start
CBy ignoring old data
DBy manual data correction
Which system is commonly used to store streams in Kappa architecture?
ARelational database
BDistributed file system
CDurable log like Kafka
DIn-memory cache
What is a key benefit of Kappa architecture over Lambda architecture?
ASimpler code maintenance
BSupports batch processing
CRequires more storage
DUses multiple processing paths
Which framework is NOT typically used in Kappa architecture?
AApache Flink
BApache Spark Streaming
CApache Kafka Streams
DHadoop MapReduce batch jobs
Explain how Kappa architecture processes and reprocesses data using streaming.
Think about how data flows and how old data is handled.
You got /4 concepts.
    Compare Kappa architecture with Lambda architecture focusing on complexity and processing layers.
    Focus on the number of processing paths and maintenance.
    You got /4 concepts.