Concept Flow - Bucket pattern for time-series data

Receive time-series data

↓

Group data by time intervals

↓

Create bucket document for each interval

↓

Store multiple data points inside bucket

↓

Query buckets for time range

↓

Extract individual data points from buckets

Data points are grouped into time buckets to store many points in one document, improving write and query efficiency.

Execution Sample

MongoDB

db.sensorData.insertOne({
  bucketStart: ISODate("2024-06-01T00:00:00Z"),
  readings: [
    {time: ISODate("2024-06-01T00:01:00Z"), value: 20},
    {time: ISODate("2024-06-01T00:02:00Z"), value: 21}
  ]
})

Insert a bucket document containing multiple sensor readings within a 1-hour interval.

Execution Table

Step	Action	Data State	Result
1	Receive new data point at 00:01	No buckets yet	Prepare to create bucket for 00:00-01:00
2	Check if bucket for 00:00 exists	No bucket found	Create new bucket document with bucketStart 00:00
3	Insert data point into readings array	Bucket readings empty	Readings array now has 1 data point
4	Receive new data point at 00:02	Bucket for 00:00 exists with 1 reading	Add new data point to readings array
5	Query data for 00:00-01:00	One bucket with 2 readings	Return bucket document with both readings
6	Extract individual readings from bucket	Bucket readings array	Two separate data points available for analysis
7	Receive data point at 01:05	Bucket for 00:00 full, no bucket for 01:00	Create new bucket for 01:00 interval
8	Insert data point into new bucket	New bucket readings empty	Readings array now has 1 data point
9	Query data for 01:00-02:00	One bucket with 1 reading	Return bucket document with that reading
10	End of demonstration	Buckets created and queried	Efficient storage and retrieval achieved

💡 No more data points to process; buckets created and queried successfully.

Variable Tracker

Variable	Start	After Step 3	After Step 4	After Step 7	After Step 8	Final
buckets	{}	{"00:00": [{time: "00:01", value: 20}]}	{"00:00": [{time: "00:01", value: 20}, {time: "00:02", value: 21}]}	{"00:00": [...], "01:00": []}	{"00:00": [...], "01:00": [{time: "01:05", value: "X"}]}	{"00:00": [...], "01:00": [...] }

Key Moments - 3 Insights

Why do we group multiple time-series points into one bucket document?

How do we know which bucket a new data point belongs to?

What happens when a data point falls outside existing buckets?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution table, at which step is the first bucket document created?

AStep 3

BStep 1

CStep 2

DStep 4

Concept Snapshot

Bucket pattern groups many time-series points into one document per time interval.
Each bucket document has a start time and an array of readings.
New data points go into the bucket matching their time interval.
Queries retrieve buckets for a time range, then extract points.
This reduces document count and improves performance.

Full Transcript

The bucket pattern for time-series data stores many data points in one document per time interval. When new data arrives, the system checks if a bucket for that time interval exists. If not, it creates one. Then it adds the data point to the bucket's readings array. Queries fetch buckets for a time range and extract individual points. This method improves write and query efficiency by reducing the number of documents handled. The execution table shows steps from receiving data points, creating buckets, inserting readings, to querying buckets. The variable tracker shows how the buckets variable changes as points are added. Key moments clarify why grouping helps, how buckets are chosen, and what happens when data falls outside existing buckets. The visual quiz tests understanding of bucket creation, readings count, and bucket assignment.