Overview - Seeds for static reference data
What is it?
Seeds in dbt are simple CSV files that hold static reference data. This data does not change often and is used to enrich or join with other datasets during transformations. Instead of storing this data in a database table manually, dbt loads these CSV files automatically into your data warehouse. This makes managing small, fixed datasets easier and keeps your project organized.
Why it matters
Without seeds, teams often manually create and maintain static reference tables in the database, which can lead to errors and inconsistencies. Seeds automate this process, ensuring that static data is version-controlled, easy to update, and always in sync with your dbt project. This saves time and reduces mistakes when working with important reference data like country codes, product categories, or status lists.
Where it fits
Before learning seeds, you should understand basic dbt project structure and how models work. After seeds, you can explore more advanced dbt features like snapshots and incremental models. Seeds fit early in the data transformation workflow as a foundation for joining static data with dynamic datasets.