0
0
GCPcloud~5 mins

Data pipeline patterns in GCP - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is a data pipeline in cloud computing?
A data pipeline is a set of steps that move and process data from one place to another, often automatically and continuously, like a water pipe moving water from a source to a destination.
Click to reveal answer
beginner
What is the 'batch processing' pattern in data pipelines?
Batch processing collects data over time and processes it all at once, like baking a big batch of cookies instead of one at a time.
Click to reveal answer
beginner
Explain the 'streaming processing' pattern in data pipelines.
Streaming processing handles data continuously as it arrives, like a conveyor belt where items are processed one by one without waiting.
Click to reveal answer
intermediate
What is the role of Google Cloud Pub/Sub in data pipelines?
Google Cloud Pub/Sub acts like a message bus that moves data messages between services quickly and reliably, supporting streaming data pipelines.
Click to reveal answer
intermediate
Why use Dataflow in a data pipeline?
Dataflow runs data processing tasks that can handle both batch and streaming data, automatically scaling and managing resources so you don't have to worry about the details.
Click to reveal answer
Which data pipeline pattern processes data in real-time as it arrives?
AStreaming processing
BBatch processing
CManual processing
DArchival processing
What GCP service is best for messaging between components in a streaming pipeline?
ACloud Storage
BBigQuery
CCloud Pub/Sub
DCloud SQL
Which service can handle both batch and streaming data processing in GCP?
ADataflow
BCloud Functions
CCloud Run
DCloud Spanner
Batch processing is best described as:
AProcessing data one piece at a time immediately
BIgnoring data until manually triggered
CProcessing data only during business hours
DCollecting data and processing it all at once
Which pattern would you choose for a pipeline that needs to analyze sensor data instantly?
ABatch processing
BStreaming processing
CArchival processing
DManual processing
Describe the main differences between batch and streaming data pipeline patterns.
Think about how and when data is processed.
You got /4 concepts.
    Explain how Google Cloud services like Pub/Sub and Dataflow work together in a streaming data pipeline.
    Consider messaging and processing roles.
    You got /4 concepts.