0
0
dbtdata~10 mins

Why documentation makes data discoverable in dbt - Visual Breakdown

Choose your learning style9 modes available
Concept Flow - Why documentation makes data discoverable
Start: Data exists in dbt models
Add documentation to models
Generate docs site with dbt docs
Users search and explore docs
Users find data definitions and lineage
Data becomes discoverable and trusted
This flow shows how adding documentation to dbt models leads to generating a docs site, which users explore to find and trust data.
Execution Sample
dbt
models/sales.sql:
-- docs: Sales data by region
select * from raw.sales

# Run: dbt docs generate
# Run: dbt docs serve
This code adds a comment as documentation in a dbt model, then generates and serves the documentation site.
Execution Table
StepActionInputOutputEffect
1Create dbt model with doc commentSQL file with -- docs commentModel file savedDocumentation added to model metadata
2Run 'dbt docs generate'Model files with docsDocumentation site filesDocs site created with model info
3Run 'dbt docs serve'Docs site filesLocal web server runningDocs site accessible in browser
4User opens docs siteBrowser requestDocs homepage loadsUser sees searchable data docs
5User searches for 'sales'Search inputFiltered docs pageUser finds sales data details
6User views data lineageClick lineage tabLineage graph shownUser understands data source and flow
7EndUser satisfiedData discoveredData is discoverable and trusted
💡 Process ends when user finds and understands data through documentation site.
Variable Tracker
VariableStartAfter Step 1After Step 2After Step 3After Step 4After Step 5After Step 6Final
DocumentationNoneAdded to modelIncluded in docs site filesDocs site runningDocs site loadedSearch results filteredLineage graph displayedData discoverable
Key Moments - 3 Insights
Why do we need to run 'dbt docs generate' after adding documentation?
Because 'dbt docs generate' reads the model files and creates the documentation site files that include the new docs. Without this step, the docs site won't update. See execution_table step 2.
How does the documentation help users find data?
The documentation site provides searchable descriptions and lineage, so users can search keywords and see where data comes from. This is shown in execution_table steps 4 and 5.
What happens if we don't add documentation comments in models?
The docs site will have minimal info, making it hard for users to understand or trust data. Documentation is key to making data discoverable, as shown by the difference between step 1 and later steps.
Visual Quiz - 3 Questions
Test your understanding
Look at the execution_table, what is the output after running 'dbt docs generate'?
ADocumentation site files
BModel files saved
CLocal web server running
DDocs homepage loads
💡 Hint
Check execution_table step 2 output column.
At which step does the user first interact with the documentation site in a browser?
AStep 3
BStep 4
CStep 5
DStep 6
💡 Hint
Look for 'User opens docs site' in execution_table.
If documentation comments are missing in models, how does it affect the variable 'Documentation' in variable_tracker?
AIt shows 'Docs site running'
BIt becomes 'Added to model' anyway
CIt stays 'None' throughout
DIt shows 'Search results filtered'
💡 Hint
Refer to variable_tracker 'Documentation' row start and after step 1.
Concept Snapshot
Why documentation makes data discoverable:
- Add docs comments in dbt models
- Run 'dbt docs generate' to build docs site
- Run 'dbt docs serve' to view site locally
- Users search and explore docs to find data
- Documentation shows data definitions and lineage
- This makes data easier to find and trust
Full Transcript
This visual execution shows how adding documentation to dbt models leads to generating a documentation site that users can explore. First, a doc comment is added to a model file. Then, running 'dbt docs generate' creates the documentation site files. Running 'dbt docs serve' starts a local web server to view the docs. Users open the site in a browser, search for data like 'sales', and view data lineage. This process makes data discoverable and trusted. Key moments include understanding why generating docs is needed after adding comments, how docs help users find data, and the impact of missing documentation. The quizzes test understanding of these steps and variable changes. The snapshot summarizes the key steps to make data discoverable with documentation in dbt.