What is the primary purpose of data lineage tracking in a Snowflake data warehouse?
Think about why knowing where data comes from and where it goes is important.
Data lineage helps track the origin and movement of data, which is essential for auditing, debugging, and compliance.
Which Snowflake feature is essential to capture detailed data lineage information automatically?
Consider which feature records executed queries and their details.
Snowflake Query History records all queries executed, which can be analyzed to understand data flow and lineage.
Given Snowflake's QUERY_HISTORY view, which SQL query correctly extracts the source tables used in queries for lineage tracking?
SELECT query_id, array_agg(DISTINCT referenced_object_name) AS source_tables FROM snowflake.account_usage.query_history WHERE start_time > CURRENT_TIMESTAMP() - INTERVAL '1 DAY' GROUP BY query_id;Check the exact column names in the QUERY_HISTORY view for referenced tables.
The column referenced_object_name contains the names of objects (tables) referenced by queries. Other column names do not exist in QUERY_HISTORY.
Which Snowflake feature should you use to restrict access to sensitive data lineage metadata to only authorized users?
Think about controlling access to specific rows or data within tables.
Row Access Policies allow fine-grained control over which rows users can see, useful for protecting sensitive lineage metadata.
Snowflake retains query history for 365 days by default. What is the impact on data lineage tracking if this retention period is reduced to 7 days?
Consider how query history retention affects available lineage data.
Reducing query history retention limits the time window for which lineage can be reconstructed, restricting historical lineage analysis.