dbtdata~10 mins

Multi-source fan-in patterns in dbt - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Concept Flow - Multi-source fan-in patterns

Source Table A

↓

Transform A

↓

Fan-in Join

↓

Source Table B

↓

Transform B

↓

Final Output Table

Data flows from multiple source tables through transformations, then joins together in a fan-in pattern to create a combined output.

Execution Sample

dbt

with a as (
  select id, value from source_a
), b as (
  select id, value from source_b
)
select a.id, a.value, b.value as b_value
from a
join b on a.id = b.id

This SQL code combines data from two sources by joining on a common id.

Execution Table

Step	Action	Evaluation	Result
1	Read source_a	Table source_a loaded	Rows: 3 (id=1,2,3)
2	Read source_b	Table source_b loaded	Rows: 3 (id=2,3,4)
3	Transform a	Select id, value	a: [(1,10), (2,20), (3,30)]
4	Transform b	Select id, value	b: [(2,200), (3,300), (4,400)]
5	Join a and b on id	Match ids in both	Joined rows: [(2,20,200), (3,30,300)]
6	Output final table	Combined data	Final rows: 2 with columns (id, value, b_value)

💡 Join only includes ids present in both sources, so id=1 and id=4 are excluded.

Variable Tracker

Variable	Start	After Step 3	After Step 4	After Step 5	Final
a	empty	[(1,10), (2,20), (3,30)]	[(1,10), (2,20), (3,30)]	[(2,20), (3,30)]	[(2,20), (3,30)]
b	empty	empty	[(2,200), (3,300), (4,400)]	[(2,200), (3,300)]	[(2,200), (3,300)]
joined	empty	empty	empty	[(2,20,200), (3,30,300)]	[(2,20,200), (3,30,300)]

Key Moments - 2 Insights

Why are some ids missing from the final output?

What happens if one source has more rows than the other?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution_table at step 5, how many rows are in the joined result?

Concept Snapshot

Multi-source fan-in pattern:
- Extract data from multiple sources
- Transform each source separately
- Join transformed data on common keys
- Result is combined dataset with matching records only
- Useful for merging related data from different tables

Full Transcript

This visual execution shows how multi-source fan-in patterns work in dbt. We start by reading two source tables, source_a and source_b. Each is transformed by selecting relevant columns. Then, these transformed tables are joined on a common id column. The join keeps only rows where ids match in both tables, excluding unmatched rows. Variables 'a' and 'b' hold transformed data, and 'joined' holds the combined result. This pattern helps combine data from multiple sources into one clean output table.