0
0
GCPcloud~5 mins

Data Fusion for ETL in GCP - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is Google Cloud Data Fusion?
Google Cloud Data Fusion is a fully managed, cloud-native data integration service that helps you build and manage ETL (Extract, Transform, Load) pipelines visually without writing code.
Click to reveal answer
beginner
What does ETL stand for in Data Fusion?
ETL stands for Extract, Transform, Load. It means taking data from sources, changing it to the needed format, and loading it into a destination for analysis or storage.
Click to reveal answer
intermediate
How does Data Fusion simplify building ETL pipelines?
Data Fusion provides a drag-and-drop interface with pre-built connectors and transformations, so you can visually design pipelines without coding, making it easier and faster to prepare data.
Click to reveal answer
beginner
What is a pipeline in Data Fusion?
A pipeline is a sequence of steps that extracts data from sources, transforms it, and loads it into targets. In Data Fusion, pipelines are visual workflows that automate data processing.
Click to reveal answer
beginner
Name two common data sources you can connect to with Data Fusion.
You can connect to sources like Google Cloud Storage and BigQuery, among many others, to extract data for your ETL pipelines.
Click to reveal answer
What is the main purpose of Google Cloud Data Fusion?
ATo build and manage ETL pipelines visually
BTo store large amounts of data
CTo run virtual machines
DTo create machine learning models
In ETL, what does the 'Transform' step do?
AExtracts data from sources
BChanges data into the needed format
CLoads data into the destination
DDeletes old data
Which interface does Data Fusion use to build pipelines?
ASpreadsheet
BCommand line only
CText editor
DDrag-and-drop visual interface
Which of these is NOT a typical Data Fusion pipeline step?
ACompile
BTransform
CLoad
DExtract
Which Google Cloud service is commonly used as a data source in Data Fusion?
AApp Engine
BCompute Engine
CBigQuery
DCloud Functions
Explain how Google Cloud Data Fusion helps in building ETL pipelines.
Think about how it makes ETL easier for people who don't want to write code.
You got /5 concepts.
    Describe the three main steps of ETL and their roles in data processing.
    ETL is like preparing ingredients, cooking, and serving a meal.
    You got /3 concepts.