Challenge - 5 Problems

🎖️

Incremental Model Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

2:00remaining

Why do incremental models reduce processing time?

In dbt, incremental models only process new or changed data instead of the entire dataset. Why does this approach reduce processing time?

ABecause it stores data in memory instead of disk.

BBecause it deletes the entire table before loading new data.

CBecause it runs all transformations in parallel regardless of data size.

DBecause it processes only the new or updated rows, avoiding reprocessing all data.

Attempts:

2 left

❓ data_output

intermediate

2:00remaining

Output of incremental model run with new data

Given a table with 1000 rows, an incremental model processes 100 new rows. What is the total number of rows after the incremental run?

A1000

B100

C1100

D900

Attempts:

2 left

❓ Predict Output

advanced

2:00remaining

Result of incremental model SQL logic

Consider this simplified incremental model SQL snippet:

select * from source_table where updated_at > (select max(updated_at) from target_table)

What does this query return during an incremental run?

AAll rows from source_table regardless of updated_at.

BOnly rows from source_table with updated_at newer than the latest in target_table.

CRows from target_table with updated_at older than source_table.

DAn empty result set.

Attempts:

2 left

❓ visualization

advanced

2:00remaining

Visualizing time saved by incremental models

You have two bar charts showing processing times: one for full refresh runs and one for incremental runs over 5 days. Which chart best represents the time saved by incremental models?

AA bar chart with full refresh times constant and incremental times much lower each day.

BA line chart showing incremental times increasing above full refresh times.

CA bar chart with incremental times higher than full refresh times.

DA pie chart showing equal time distribution.

Attempts:

2 left

🚀 Application

expert

3:00remaining

Choosing incremental model strategy for cost savings

You manage a large dataset updated daily. Running full refreshes takes hours and costs a lot. Which incremental model strategy will save the most cost and time?

AProcess only new rows based on a timestamp column and append them to the target table.

BDelete the entire target table and reload all data every day.

CProcess all rows but skip transformations to save time.

DRun full refreshes only on weekends and incremental runs on weekdays.

Attempts:

2 left