Recall & Review

beginner

What is a source freshness check in dbt?

A source freshness check in dbt is a way to monitor how up-to-date your source data is by checking the age of the newest data record against defined thresholds.

Click to reveal answer

beginner

Which configuration key in dbt defines the freshness thresholds for a source?

The key freshness defines thresholds like warn_after and error_after to set limits on acceptable data age.

Click to reveal answer

intermediate

What happens if the source data is older than the error_after threshold in a freshness check?

dbt will mark the freshness check as failed and raise an error, signaling that the source data is too old and may need attention.

Click to reveal answer

intermediate

How do you define a freshness check for a source table in the sources.yml file?

You add a freshness block under the source with warn_after and error_after times, and specify the column to check for freshness.

Click to reveal answer

beginner

Why are source freshness checks important in data pipelines?

They help ensure that data is updated on time, so downstream analysis and reports use fresh and reliable data, preventing decisions based on stale information.

Click to reveal answer

What does the warn_after threshold in a freshness check do?

AAutomatically refreshes the source data

BStops the dbt run immediately

CDeletes old data from the source

DTriggers a warning if data is older than this time

Where do you define source freshness checks in dbt?

AIn the <code>profiles.yml</code> file

BIn the <code>sources.yml</code> file

CIn the model SQL files

DIn the <code>dbt_project.yml</code> file

What column type is typically used for freshness checks?

AInteger column

BText column

CTimestamp or date column

DBoolean column

If a freshness check fails with an error, what should you do?

AInvestigate why the source data is stale and fix the data pipeline

BIgnore the error and continue

CDelete the source table

DChange the <code>warn_after</code> threshold to a higher value

Which dbt command runs source freshness checks?

A<code>dbt source freshness</code>

B<code>dbt run</code>

C<code>dbt test</code>

D<code>dbt compile</code>

Explain how to set up a source freshness check in dbt and why it is useful.

Describe what happens when source data exceeds the error_after threshold in a freshness check.

Practice

(1/5)

1. What is the main purpose of source freshness checks in dbt?

easy

A. To track how recent the data in your source tables is

B. To create new tables from raw data

C. To optimize SQL query performance

D. To schedule dbt runs automatically

Source freshness checks in dbt - Cheat Sheet & Quick Revision

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of freshness checks

Step 2: Compare options to the purpose

Final Answer:

Quick Check:

Solution

Step 1: Recall correct YAML syntax for freshness

Step 2: Match options to syntax

Final Answer:

Quick Check:

Solution

Step 1: Calculate data age from last loaded timestamp

Step 2: Compare data age to thresholds

Final Answer:

Quick Check:

Solution

Step 1: Check period values in freshness YAML

Step 2: Identify error cause

Final Answer:

Quick Check:

Solution

Step 1: Identify correct period and count values

Step 2: Check warn_after and error_after order

Step 3: Validate options

Final Answer:

Quick Check: