dbtdata~10 mins

dbt-utils package tests - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Concept Flow - dbt-utils package tests

Write test config in schema.yml

↓

Run dbt test command

↓

dbt-utils runs test SQL

↓

Test queries data in warehouse

↓

Results: Pass or Fail

↓

Review test output in CLI or logs

This flow shows how dbt-utils tests are configured, run, and produce pass/fail results.

Execution Sample

dbt

models:
  - name: my_model
    tests:
      - dbt_utils.unique_combination_of_columns:
          combination_of_columns:
            - id
            - email

This YAML config runs a dbt-utils test to check unique combinations of 'id' and 'email' columns in 'my_model'.

Execution Table

Step	Action	Test SQL Generated	Test Query Result	Test Outcome
1	dbt reads schema.yml and finds test config	N/A	N/A	N/A
2	dbt compiles test SQL for unique_combination_of_columns	SELECT id, email, COUNT() FROM my_model GROUP BY id, email HAVING COUNT() > 1	N/A	N/A
3	dbt runs test SQL against data warehouse	N/A	0 rows returned	Pass
4	dbt reports test result in CLI	N/A	N/A	Test Passed

💡 Test stops after running SQL and getting zero rows, meaning no duplicates found.

Variable Tracker

Variable	Start	After Step 2	After Step 3	Final
test_sql	None	SELECT id, email, COUNT() FROM my_model GROUP BY id, email HAVING COUNT() > 1	Executed	Executed
query_result	None	None	0 rows	0 rows
test_outcome	None	None	Pass	Pass

Key Moments - 3 Insights

Why does the test SQL use GROUP BY and HAVING COUNT(*) > 1?

What does zero rows returned from the test query mean?

How does dbt know which model to test?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution_table, what SQL does dbt generate for the unique_combination_of_columns test?

ASELECT * FROM my_model WHERE id IS NULL

BSELECT id, email, COUNT(*) FROM my_model GROUP BY id, email HAVING COUNT(*) > 1

CSELECT COUNT(*) FROM my_model

DSELECT DISTINCT id, email FROM my_model

Concept Snapshot

dbt-utils tests are configured in schema.yml under models.
Tests generate SQL queries to check data conditions.
Run 'dbt test' to execute tests.
Test passes if query returns zero rows (no issues).
Test fails if query returns rows (issues found).

Full Transcript

This visual execution shows how dbt-utils package tests work step-by-step. First, you write test configurations in schema.yml specifying the model and test type, such as unique_combination_of_columns. When you run 'dbt test', dbt reads this config and generates SQL to check the data. For the unique combination test, it groups data by specified columns and finds duplicates by counting rows greater than one. The SQL runs in the data warehouse. If no duplicates are found (zero rows returned), the test passes. The results are shown in the CLI. Variables like test_sql, query_result, and test_outcome change as the test runs. Common confusions include why grouping is used, what zero rows mean, and how dbt knows which model to test. The quizzes check understanding of SQL generated, when outcome is determined, and what results mean. The snapshot summarizes the key points for quick reference.