0
0
dbtdata~10 mins

Why governance ensures data trust in dbt - Visual Breakdown

Choose your learning style9 modes available
Concept Flow - Why governance ensures data trust
Define Data Governance Rules
Apply Rules to Data Pipelines
Monitor Data Quality & Compliance
Identify & Fix Issues
Ensure Data is Reliable & Trusted
Data governance sets rules, applies them to data, monitors quality, fixes issues, and ensures data trust.
Execution Sample
dbt
version: 2
models:
  - name: sales
    columns:
      - name: id
        tests:
          - unique
          - not_null
      - name: status
        tests:
          - accepted_values:
              values: ["completed", "pending", "cancelled"]
This dbt model config applies tests to ensure data quality and trust.
Execution Table
StepActionRule AppliedResultData Trust Impact
1Define tests in schema.ymlunique, not_null, accepted_valuesRules ready to applyFoundation for trust
2Run dbt tests on sales modelunique on idPass: all ids uniqueEnsures no duplicates
3Run dbt tests on sales modelnot_null on idPass: no null idsEnsures completeness
4Run dbt tests on sales modelaccepted_values on statusFail: found 'in_progress'Detects invalid data
5Fix data issue: update 'in_progress' to 'pending'Data correctionPass on re-testRestores trust
6Monitor ongoing testsContinuous checksAll tests passingMaintains trust over time
💡 All tests pass after fixing invalid data, ensuring trusted data
Variable Tracker
VariableStartAfter Step 2After Step 4After Step 5Final
Data Quality StatusUndefinedUnique test passedAccepted values test failedData correctedAll tests passed
Key Moments - 3 Insights
Why is it important to define tests before running them?
Defining tests sets clear rules to check data quality, as shown in Step 1 of the execution_table.
What happens when a test fails during data validation?
A failure, like in Step 4, signals bad data that must be fixed to restore trust.
How does fixing data issues affect data trust?
Fixing issues (Step 5) makes tests pass again, which rebuilds and maintains trust.
Visual Quiz - 3 Questions
Test your understanding
Look at the execution_table, what test failed at Step 4?
Aunique on id
Bnot_null on id
Caccepted_values on status
Dno test failed
💡 Hint
Check the 'Rule Applied' and 'Result' columns at Step 4 in execution_table
According to variable_tracker, what is the data quality status after Step 5?
AUnique test passed
BData corrected
CAccepted values test failed
DAll tests passed
💡 Hint
Look at the 'After Step 5' column for 'Data Quality Status' in variable_tracker
If the data issue was not fixed at Step 5, what would happen to data trust?
AData trust would be lost due to failing tests
BData trust would remain high
CData trust would improve automatically
DData trust is unrelated to tests
💡 Hint
Refer to Step 4 failure and Step 5 fix impact in execution_table
Concept Snapshot
Data governance means setting rules to check data quality.
Use tests like unique, not_null, and accepted_values.
Run tests regularly to find bad data.
Fix issues quickly to keep data trusted.
Trusted data helps teams make good decisions.
Full Transcript
Data governance ensures data trust by defining clear rules to check data quality. In dbt, these rules are tests like unique, not_null, and accepted_values. The process starts by defining tests, then running them on data models. If tests pass, data is trusted. If tests fail, it shows data problems that must be fixed. Fixing issues makes tests pass again, restoring trust. Continuous monitoring keeps data reliable over time. This step-by-step approach builds confidence that data is accurate and ready for use.