Bird
0
0

In Snowflake, what is the effect of adding cluster_by=['customer_id'] in a dbt model config for a large table?

medium📝 Predict Output Q5 of 15
dbt - Performance Optimization
In Snowflake, what is the effect of adding cluster_by=['customer_id'] in a dbt model config for a large table?
ACreates an index on customer_id for faster lookups
BAutomatically partitions the table by customer_id
CDuplicates data for each customer_id to speed up queries
DImproves query performance on filters involving customer_id by organizing data storage
Step-by-Step Solution
Solution:
  1. Step 1: Understand clustering in Snowflake

    Clustering reorganizes data storage to group rows with similar customer_id values.
  2. Step 2: Clarify what clustering does not do

    It does not partition the table, create indexes, or duplicate data.
  3. Final Answer:

    Improves query performance on filters involving customer_id by organizing data storage -> Option D
  4. Quick Check:

    Snowflake clustering = better query performance by data organization [OK]
Quick Trick: Clustering groups data, speeding filter queries [OK]
Common Mistakes:
MISTAKES
  • Confusing clustering with partitioning
  • Thinking clustering creates indexes
  • Assuming data duplication occurs

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More dbt Quizzes