Which of the following best explains why data governance is essential to ensure data trust in an organization?
Think about how trust is built when everyone knows who manages data and how it is kept accurate.
Data governance sets clear policies and roles that ensure data is accurate, secure, and well-managed. This clarity builds trust among users.
Given a dataset with data quality metrics before and after applying governance policies, which metric shows the biggest improvement?
import pandas as pd data = {'Metric': ['Completeness', 'Accuracy', 'Timeliness', 'Consistency'], 'Before': [0.75, 0.80, 0.60, 0.70], 'After': [0.90, 0.85, 0.85, 0.75]} df = pd.DataFrame(data) df['Improvement'] = df['After'] - df['Before'] print(df)
Look at the difference between 'After' and 'Before' values for each metric.
Timeliness improved from 0.60 to 0.85, which is a 0.25 increase, the largest among all metrics.
Which option correctly identifies the error in this dbt SQL model that enforces data governance by filtering only approved records?
select * from sales_data where approval_status = 'approved' and date > '2023-01-01' and region = 'US' group by customer_id
Check if GROUP BY is used correctly with SELECT columns and aggregation.
GROUP BY requires aggregation functions on other columns or only grouping columns in SELECT. Here, SELECT * with GROUP BY causes a syntax error.
Which option best describes how data governance practices improve collaboration among data teams using dbt?
Think about how clear rules and documentation help teams work together smoothly.
Standardized naming and documentation help everyone understand and maintain dbt models, improving teamwork and trust.
Given a data lineage graph showing sources, transformations, and outputs, which statement best reflects how governance ensures trust in this pipeline?
import matplotlib.pyplot as plt import networkx as nx G = nx.DiGraph() G.add_edges_from([ ('Raw Sales', 'Clean Sales'), ('Raw Customers', 'Clean Customers'), ('Clean Sales', 'Sales Summary'), ('Clean Customers', 'Sales Summary'), ('Sales Summary', 'Executive Dashboard') ]) pos = nx.spring_layout(G) nx.draw(G, pos, with_labels=True, node_color='lightblue', node_size=2500, font_size=10, arrowsize=20) plt.title('Data Lineage Graph') plt.show()
Look at how knowing the path of data helps build trust.
Data lineage shows the full path data takes, so governance can ensure each step is correct and transparent, building trust.