0
0
Apache Airflowdevops~5 mins

GCP operators (BigQuery, GCS, Dataflow) in Apache Airflow - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is the purpose of the BigQueryOperator in Airflow?
The BigQueryOperator runs SQL queries on Google BigQuery. It helps automate data analysis and loading tasks in workflows.
Click to reveal answer
beginner
How does the GoogleCloudStorageCreateBucketOperator help in Airflow workflows?
It creates a new bucket in Google Cloud Storage (GCS). This is useful for organizing and storing data files automatically.
Click to reveal answer
intermediate
What is the role of DataflowTemplateOperator in Airflow?
It launches a Dataflow job from a pre-built template. This operator helps run data processing pipelines without writing code each time.
Click to reveal answer
intermediate
Which Airflow operator would you use to load data from GCS to BigQuery?
The GCSToBigQueryOperator loads data stored in Google Cloud Storage into a BigQuery table.
Click to reveal answer
beginner
Why is it important to set the correct GCP connection in Airflow when using GCP operators?
The GCP connection provides authentication and project details. Without it, Airflow cannot access GCP services securely.
Click to reveal answer
Which Airflow operator runs SQL queries on BigQuery?
AGCSToBigQueryOperator
BGoogleCloudStorageCreateBucketOperator
CDataflowTemplateOperator
DBigQueryOperator
What does the GoogleCloudStorageCreateBucketOperator do?
ACreates a BigQuery dataset
BRuns a Dataflow job
CCreates a GCS bucket
DLoads data from GCS to BigQuery
Which operator launches a Dataflow job from a template?
ADataflowTemplateOperator
BBigQueryOperator
CGCSToBigQueryOperator
DGoogleCloudStorageCreateBucketOperator
To move data from GCS to BigQuery, which operator is best?
AGCSToBigQueryOperator
BGoogleCloudStorageCreateBucketOperator
CBigQueryOperator
DDataflowTemplateOperator
Why must you configure a GCP connection in Airflow for GCP operators?
ATo schedule tasks
BTo authenticate and access GCP services
CTo write SQL queries
DTo create DAGs
Explain how Airflow uses GCP operators to automate data workflows involving BigQuery, GCS, and Dataflow.
Think about each operator's role and how they connect in a data pipeline.
You got /5 concepts.
    Describe the steps to set up an Airflow DAG that loads data from GCS into BigQuery using GCP operators.
    Focus on the sequence of operators and configuration needed.
    You got /5 concepts.