Bird
0
0

Given this dbt model config:

medium📝 Predict Output Q13 of 15
dbt - Performance Optimization
Given this dbt model config:
config(
  materialized='table',
  partition_by={'field': 'order_date', 'data_type': 'date'},
  cluster_by=['customer_id']
)

What is the effect of this configuration?
AData is partitioned by order_date and clustered by customer_id
BData is partitioned by customer_id and clustered by order_date
CData is clustered by order_date only
DData is partitioned by order_date only, no clustering
Step-by-Step Solution
Solution:
  1. Step 1: Identify partitioning field

    The config uses partition_by={'field': 'order_date', 'data_type': 'date'}, so partitioning is by order_date.
  2. Step 2: Identify clustering field

    The config sets cluster_by=['customer_id'], so clustering is by customer_id.
  3. Final Answer:

    Data is partitioned by order_date and clustered by customer_id -> Option A
  4. Quick Check:

    Partition = order_date, Clustering = customer_id [OK]
Quick Trick: Partition_by sets partition, cluster_by sets cluster columns [OK]
Common Mistakes:
MISTAKES
  • Swapping partition and clustering columns
  • Ignoring clustering setting
  • Assuming clustering replaces partitioning

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More dbt Quizzes