dbtdata~10 mins

Configuring sources in YAML in dbt - Visual Walkthrough

Choose your learning style10 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Concept Flow - Configuring sources in YAML

Start YAML file

↓

Define 'sources' key

↓

Add source name

↓

Add tables under source

↓

Specify table details (name, description)

↓

Save YAML

↓

dbt reads source config

↓

Sources available for models

This flow shows how to write a YAML file to define data sources and tables for dbt to use in models.

Execution Sample

dbt

sources:
  - name: raw_data
    tables:
      - name: users
        description: 'User data from app'

Defines a source named 'raw_data' with a table 'users' and a description.

Execution Table

Step	YAML Line	Action	State Change	Result
1	sources:	Start defining sources	Create 'sources' key	Empty list for sources
2	- name: raw_data	Add source name	Append source dict with name 'raw_data'	sources = [{'name': 'raw_data'}]
3	tables:	Add tables key	Add empty 'tables' list to source	sources[0]['tables'] = []
4	- name: users	Add table name	Append table dict with name 'users'	sources[0]['tables'] = [{'name': 'users'}]
5	description: 'User data from app'	Add description	Add description to table dict	sources[0]['tables'][0]['description'] = 'User data from app'
6	End of YAML	Finish parsing	YAML fully parsed	Source config ready for dbt

💡 Reached end of YAML file, source configuration complete.

Variable Tracker

Variable	Start	After Step 2	After Step 3	After Step 4	After Step 5	Final
sources	undefined	[{'name': 'raw_data'}]	[{'name': 'raw_data', 'tables': []}]	[{'name': 'raw_data', 'tables': [{'name': 'users'}]}]	[{'name': 'raw_data', 'tables': [{'name': 'users', 'description': 'User data from app'}]}]	[{'name': 'raw_data', 'tables': [{'name': 'users', 'description': 'User data from app'}]}]

Key Moments - 2 Insights

Why do we indent 'tables' under the source name?

What happens if we forget to add a description for a table?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution table, what is the state of 'sources' after step 4?

A[{'name': 'raw_data', 'tables': []}]

B[{'name': 'raw_data', 'tables': [{'name': 'users'}]}]

C[{'name': 'raw_data'}]

Dundefined

Concept Snapshot

Configuring sources in YAML for dbt:
- Use 'sources:' at top level
- Define each source with '- name:'
- Under each source, add 'tables:' list
- Each table has '- name:' and optional 'description:'
- Indentation shows hierarchy
- dbt reads this to know where data comes from

Full Transcript

This visual execution shows how to configure sources in YAML for dbt. We start by creating a 'sources' key, then add a source name. Under that source, we add a 'tables' list. Each table has a name and can have a description. Indentation is important to show the structure. The execution table traces each step of parsing the YAML lines and how the internal data structure changes. The variable tracker shows how the 'sources' variable builds up step by step. Key moments clarify why indentation matters and the role of descriptions. The quiz tests understanding of the state after steps and the effect of missing keys. This helps beginners see exactly how dbt reads source configs from YAML.

Practice

(1/5)

1. What is the main purpose of configuring sources in a dbt YAML file?

easy

A. To write SQL queries for data transformation

B. To tell dbt where to find raw data tables

C. To create dashboards for data visualization

D. To schedule dbt runs automatically

5. You want to add a test to ensure the 'email' column in the 'users' table source is never null. Which YAML snippet correctly adds this test?

hard

A. sources: - name: app_data tables: - name: users columns: - name: email tests: - not_null

B. sources: - name: app_data tables: - name: users tests: - column: email test: not_null

C. sources: - name: app_data tables: - users: columns: - email: tests: - not_null

D. sources: - name: app_data tables: - name: users columns: - email test: not_null

Configuring sources in YAML in dbt - Visual Walkthrough

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of source configuration

Step 2: Differentiate from other dbt tasks

Final Answer:

Quick Check:

Solution

Step 1: Recall correct YAML source structure

Step 2: Compare options to syntax

Final Answer:

Quick Check:

Solution

Step 1: Locate the 'loaded_at_field' key in YAML

Step 2: Identify the value assigned

Final Answer:

Quick Check:

Solution

Step 1: Understand dbt freshness period syntax

Step 2: Check the YAML periods

Step 3: Rule out other options

Final Answer:

Quick Check:

Solution

Step 1: Recall correct test syntax in source YAML

Step 2: Check each option's structure

Step 3: Identify errors in other options

Final Answer:

Quick Check: