Bird
0
0

Given this dbt model config:

medium📝 Predict Output Q4 of 15
dbt - Performance Optimization
Given this dbt model config:
config(
  materialized='table',
  partition_by={'field': 'created_at', 'data_type': 'date'},
  clustering=['user_id']
)

What is the effect of this configuration?
AData is clustered by 'created_at' and partitioned by 'user_id'
BData is partitioned by 'created_at' date and clustered by 'user_id' for faster queries
CData is only partitioned by 'user_id' ignoring 'created_at'
DData is neither partitioned nor clustered
Step-by-Step Solution
Solution:
  1. Step 1: Analyze partition_by parameter

    Partitioning is done on 'created_at' column with data type 'date'.
  2. Step 2: Analyze clustering parameter

    Clustering is applied on 'user_id' column to sort data within partitions.
  3. Final Answer:

    Data is partitioned by 'created_at' date and clustered by 'user_id' for faster queries -> Option B
  4. Quick Check:

    Partition by 'created_at' + cluster by 'user_id' = Data is partitioned by 'created_at' date and clustered by 'user_id' for faster queries [OK]
Quick Trick: Partition by date column, cluster by user or ID columns [OK]
Common Mistakes:
MISTAKES
  • Swapping partition and clustering columns
  • Ignoring data_type in partitioning
  • Assuming clustering applies to partition column

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More dbt Quizzes