0
0
dbtdata~20 mins

Why governance ensures data trust in dbt - Challenge Your Understanding

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
Data Governance Mastery
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate
2:00remaining
Why is data governance important for trust?

Which of the following best explains why data governance is essential to ensure data trust in an organization?

AIt defines clear rules and responsibilities for data quality and security, so users can rely on the data.
BIt increases the amount of data collected, making datasets larger and more useful.
CIt automates all data processing tasks, removing the need for human oversight.
DIt restricts access to data so only IT staff can view it, preventing misuse.
Attempts:
2 left
💡 Hint

Think about how trust is built when everyone knows who manages data and how it is kept accurate.

data_output
intermediate
2:00remaining
Identify the effect of governance on data quality metrics

Given a dataset with data quality metrics before and after applying governance policies, which metric shows the biggest improvement?

dbt
import pandas as pd

data = {'Metric': ['Completeness', 'Accuracy', 'Timeliness', 'Consistency'],
        'Before': [0.75, 0.80, 0.60, 0.70],
        'After': [0.90, 0.85, 0.85, 0.75]}
df = pd.DataFrame(data)
df['Improvement'] = df['After'] - df['Before']
print(df)
ATimeliness
BAccuracy
CCompleteness
DConsistency
Attempts:
2 left
💡 Hint

Look at the difference between 'After' and 'Before' values for each metric.

🔧 Debug
advanced
2:00remaining
Find the error in this dbt model related to governance enforcement

Which option correctly identifies the error in this dbt SQL model that enforces data governance by filtering only approved records?

dbt
select * from sales_data
where approval_status = 'approved'
  and date > '2023-01-01'
  and region = 'US'
  group by customer_id
AThe WHERE clause should come after GROUP BY, so the order is wrong.
BThe GROUP BY clause is used without aggregation functions, causing a syntax error.
CThe filter on approval_status should use double quotes instead of single quotes.
DThe date comparison should use >= instead of > to include the start date.
Attempts:
2 left
💡 Hint

Check if GROUP BY is used correctly with SELECT columns and aggregation.

🚀 Application
advanced
2:00remaining
How does data governance improve collaboration in dbt projects?

Which option best describes how data governance practices improve collaboration among data teams using dbt?

ABy limiting access to dbt projects so only senior analysts can edit models.
BBy removing the need for version control since governance tracks changes automatically.
CBy automatically generating all dbt models without human input, reducing errors.
DBy enforcing standardized naming conventions and documentation, making models easier to understand and share.
Attempts:
2 left
💡 Hint

Think about how clear rules and documentation help teams work together smoothly.

visualization
expert
3:00remaining
Interpreting a data lineage graph for governance impact

Given a data lineage graph showing sources, transformations, and outputs, which statement best reflects how governance ensures trust in this pipeline?

dbt
import matplotlib.pyplot as plt
import networkx as nx

G = nx.DiGraph()
G.add_edges_from([
    ('Raw Sales', 'Clean Sales'),
    ('Raw Customers', 'Clean Customers'),
    ('Clean Sales', 'Sales Summary'),
    ('Clean Customers', 'Sales Summary'),
    ('Sales Summary', 'Executive Dashboard')
])
pos = nx.spring_layout(G)
nx.draw(G, pos, with_labels=True, node_color='lightblue', node_size=2500, font_size=10, arrowsize=20)
plt.title('Data Lineage Graph')
plt.show()
AGovernance focuses only on the final dashboard, ignoring intermediate transformations.
BGovernance removes the need to document data sources because lineage is automatically inferred.
CGovernance tracks each step from raw data to dashboard, ensuring transparency and easier error tracing.
DGovernance requires manual data entry at each step to verify data correctness.
Attempts:
2 left
💡 Hint

Look at how knowing the path of data helps build trust.