dbtdata~10 mins

Source freshness checks in dbt - Step-by-Step Execution

Choose your learning style10 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Concept Flow - Source freshness checks

Define source in dbt project

↓

Configure freshness criteria

↓

Run dbt source freshness command

↓

dbt queries source metadata

↓

Compare source timestamps to criteria

↓

Report freshness status

↓

Take action if stale or warn

↓

END

This flow shows how dbt checks source data freshness by defining sources, setting freshness rules, running checks, and reporting results.

Execution Sample

dbt

sources:
  - name: my_source
    tables:
      - name: users
        freshness:
          warn_after: {count: 12, period: hour}
          error_after: {count: 24, period: hour}

This YAML config defines a source table with freshness thresholds for warnings and errors.

Execution Table

Step	Action	Source Timestamp	Current Time	Age (hours)	Check Result	Message
1	Start freshness check	-	2024-06-01 12:00	-	Pending	Starting check for source 'users'
2	Query source metadata	2024-06-01 01:30	2024-06-01 12:00	10.5	Within threshold	Source data is fresh
3	Compare to warn_after (12h)	10.5 < 12	N/A	True	No warning	No freshness warning needed
4	Compare to error_after (24h)	10.5 < 24	N/A	True	No error	No freshness error needed
5	Finish check	-	-	-	Success	Source freshness check passed

💡 Source timestamp age 10.5 hours is less than warn_after 12 hours, so freshness is good.

Variable Tracker

Variable	Start	After Step 2	After Step 3	After Step 4	Final
source_timestamp	-	2024-06-01 01:30	2024-06-01 01:30	2024-06-01 01:30	2024-06-01 01:30
current_time	2024-06-01 12:00	2024-06-01 12:00	2024-06-01 12:00	2024-06-01 12:00	2024-06-01 12:00
age_hours	-	10.5	10.5	10.5	10.5
check_result	Pending	Within threshold	No warning	No error	Success

Key Moments - 2 Insights

Why does dbt compare the source timestamp age to both warn_after and error_after?

What happens if the source timestamp is missing or null?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution table, what is the age in hours of the source data at step 2?

A12

B24

C10.5

D1.5

Concept Snapshot

Source freshness checks in dbt:
- Define sources and tables in YAML
- Set freshness thresholds: warn_after and error_after
- Run 'dbt source freshness' to check timestamps
- Compare source data age to thresholds
- Report status: fresh, warning, or error
- Helps monitor data timeliness automatically

Full Transcript

Source freshness checks in dbt help ensure your data is up-to-date. You define your source tables and set freshness rules like warn_after and error_after in your dbt project YAML. When you run the freshness check, dbt queries the source metadata to get the latest timestamp. It calculates how old the data is by comparing the source timestamp to the current time. Then it compares this age to your thresholds. If the data is younger than warn_after, it passes with no warnings. If it is older than warn_after but younger than error_after, dbt issues a warning. If it is older than error_after, it triggers an error. This process helps catch stale data early and keeps your analytics reliable.

Practice

(1/5)

1. What is the main purpose of source freshness checks in dbt?

easy

A. To track how recent the data in your source tables is

B. To create new tables from raw data

C. To optimize SQL query performance

D. To schedule dbt runs automatically

Source freshness checks in dbt - Step-by-Step Execution

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of freshness checks

Step 2: Compare options to the purpose

Final Answer:

Quick Check:

Solution

Step 1: Recall correct YAML syntax for freshness

Step 2: Match options to syntax

Final Answer:

Quick Check:

Solution

Step 1: Calculate data age from last loaded timestamp

Step 2: Compare data age to thresholds

Final Answer:

Quick Check:

Solution

Step 1: Check period values in freshness YAML

Step 2: Identify error cause

Final Answer:

Quick Check:

Solution

Step 1: Identify correct period and count values

Step 2: Check warn_after and error_after order

Step 3: Validate options

Final Answer:

Quick Check: