dbtdata~15 mins

Organizing models in directories in dbt - Deep Dive

Choose your learning style10 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Overview - Organizing models in directories

What is it?

Organizing models in directories means arranging your dbt models into folders inside your project. Each folder can hold related SQL files that build parts of your data pipeline. This helps keep your project tidy and easier to understand. Instead of one big folder with many files, you group models by theme or function.

Why it matters

Without organizing models in directories, your dbt project can become messy and hard to navigate as it grows. This slows down development and increases mistakes. Good organization saves time, helps teams collaborate, and makes it easier to find and update models. It also helps dbt understand dependencies and run models efficiently.

Where it fits

Before this, you should know basic dbt model creation and how dbt runs SQL files. After learning this, you can explore advanced dbt features like model configurations, macros, and testing. Organizing models is a foundational skill that supports scaling your dbt projects.

Mental Model

Core Idea

Organizing models in directories is like sorting your tools into labeled drawers so you can quickly find and use the right one when building your data pipeline.

Think of it like...

Imagine a kitchen drawer where all your cooking tools are mixed together. Finding a whisk or a spatula takes time. But if you have separate drawers for utensils, knives, and gadgets, cooking becomes faster and less frustrating. Similarly, organizing dbt models into folders groups related work together for easy access.

dbt_project/
├── models/
│   ├── staging/
│   │   ├── customers.sql
│   │   └── orders.sql
│   ├── marts/
│   │   ├── sales/
│   │   │   └── sales_summary.sql
│   │   └── finance/
│   │       └── revenue.sql
│   └── README.md

Build-Up - 7 Steps

FoundationWhat is a dbt model file?

Concept: Introduce the basic unit of work in dbt: the model SQL file.

A dbt model is a single SQL file that defines a transformation query. When you run dbt, it turns these SQL files into tables or views in your database. Each model file lives inside the 'models' folder by default.

Result

You understand that each SQL file is a building block of your data pipeline.

Knowing that models are individual SQL files helps you see why organizing them matters as projects grow.

FoundationDefault model folder structure

IntermediateCreating subdirectories for model grouping

IntermediateUsing path-based model selection

IntermediateConfiguring models by directory

AdvancedHandling dependencies across directories

ExpertAdvanced directory patterns for large projects

Under the Hood

dbt scans the 'models' directory and all its subdirectories recursively to find SQL files. Each file is parsed and compiled into a SQL query. dbt builds a directed acyclic graph (DAG) of model dependencies using the {{ ref() }} function. This graph determines the order of execution. Folder structure does not affect dependency resolution but helps humans navigate the project.

Why designed this way?

dbt was designed to be flexible and simple. Automatically discovering models in subfolders avoids extra configuration, lowering the barrier to organizing projects. The separation of physical file structure and logical dependencies allows teams to organize code for readability without affecting execution logic.

dbt_project/
├── models/
│   ├── staging/
│   │   ├── customers.sql
│   │   └── orders.sql
│   ├── marts/
│   │   ├── sales/
│   │   │   └── sales_summary.sql
│   │   └── finance/
│   │       └── revenue.sql

Dependency Graph:
customers.sql ──┐
                ├─> orders.sql ──> sales_summary.sql
revenue.sql ────┘

Myth Busters - 4 Common Misconceptions

Quick: Does placing models in different folders change their execution order automatically? Commit yes or no.

Common Belief:If I put models in different folders, dbt will run them in folder order.

Tap to reveal reality

Quick: Can I only configure models individually, not by folder? Commit yes or no.

Common Belief:I must set configs like materialization inside each model file; folders don't help.

Tap to reveal reality

Quick: Does nesting folders infinitely always improve project clarity? Commit yes or no.

Common Belief:More folders and subfolders always make the project easier to understand.

Tap to reveal reality

Quick: Does dbt require extra config to find models in subfolders? Commit yes or no.

Common Belief:I need to tell dbt where each subfolder is in the config file.

Tap to reveal reality

Expert Zone

Folder names can be used as namespaces in model selectors, enabling precise targeting in commands.

Using folder-level configs can override model-level configs, so order and specificity matter.

Combining directory organization with dbt packages allows modular, reusable components across projects.

When NOT to use

Avoid deeply nested directories when your project is small or when team members are new to dbt. Instead, keep a flat structure for simplicity. For very large projects, consider splitting into multiple dbt packages or repositories to manage complexity.

Production Patterns

Teams often organize models by data source (staging), business domain (marts), and function (analytics). Folder-level configs enforce standards like materialization types. Selectors based on folders speed up CI/CD pipelines by running only changed parts.

Connections

Software Project Structure

Organizing code files into folders is a shared pattern between dbt models and software projects.

Understanding folder organization in software development helps grasp why dbt projects benefit from similar structure for maintainability.

Dependency Graphs

dbt uses dependency graphs to order model execution, similar to task scheduling in project management.

Knowing how dependency graphs work in other fields clarifies why folder order does not control execution order in dbt.

Library Classification in Libraries

Just like books are organized by topic and genre in a library, dbt models are organized by function and domain.

This cross-domain connection shows how organizing information for easy retrieval is a universal challenge.

Common Pitfalls

#1Assuming folder order controls model run order

Wrong approach:dbt run --select models/staging models/marts

Correct approach:dbt run --select staging+

Root cause:Misunderstanding that dbt runs models by dependency, not folder listing order.

#2Duplicating configs in every model file instead of using folder configs

Wrong approach:In every model SQL file: {{ config(materialized='table') }}

Correct approach:In dbt_project.yml: models: my_project: marts: +materialized: table

Root cause:Not knowing folder-level config options in dbt_project.yml.

#3Creating too many nested folders making navigation hard

Wrong approach:models/domain1/subdomainA/typeX/categoryY/model.sql

Correct approach:models/domain1/typeX/model.sql

Root cause:Believing deeper nesting always improves clarity without considering usability.

Key Takeaways

Organizing dbt models in directories groups related SQL files, making projects easier to navigate and maintain.

dbt automatically discovers models in subfolders, so you can organize freely without extra configuration.

Folder structure does not affect model execution order; dependencies defined by references control that.

Using folder-level configurations in dbt_project.yml reduces repetition and enforces consistency.

Balancing folder depth is key: too flat is messy, too deep is confusing; modularization can help scale.

Practice

(1/5)

1. Why is it helpful to organize dbt models into directories?

easy

A. It keeps the project clean and easier to manage.

B. It makes dbt run faster.

C. It prevents errors in SQL syntax.

D. It automatically creates dashboards.

Organizing models in directories in dbt - Deep Dive

Start learning this pattern below

Practice

Solution

Step 1: Understand project organization benefits

Step 2: Relate to dbt model management

Final Answer:

Quick Check:

Solution

Step 1: Understand dbt model referencing

Step 2: Check each option

Final Answer:

Quick Check:

Solution

Step 1: Identify model location

Step 2: Use correct reference syntax

Final Answer:

Quick Check:

Solution

Step 1: Understand model referencing in subfolders

Step 2: Identify cause of error

Final Answer:

Quick Check:

Solution

Step 1: Understand multi-level directory organization

Step 2: Check referencing simplicity

Final Answer:

Quick Check: