Process Flow - Online vs offline feature stores
Data Ingestion
Model Training
Model Serving
Predictions Delivered
Data flows into offline store for training features, then online store holds real-time features for serving models.
1. Ingest raw data 2. Compute batch features -> save to offline store 3. Train model using offline features 4. Compute real-time features -> save to online store 5. Serve model using online features
| Step | Action | Data Location | Purpose | Result |
|---|---|---|---|---|
| 1 | Ingest raw data | Raw data source | Collect data | Data ready for feature computation |
| 2 | Compute batch features | Offline feature store | Prepare training features | Features stored for model training |
| 3 | Train model | Offline feature store | Use batch features | Model trained with historical data |
| 4 | Compute real-time features | Online feature store | Prepare serving features | Features stored for fast access |
| 5 | Serve model | Online feature store | Use real-time features | Predictions delivered quickly |
| 6 | End | - | - | Process complete |
| Variable | Start | After Step 2 | After Step 4 | Final |
|---|---|---|---|---|
| Raw Data | Empty | Available | Available | Available |
| Batch Features | None | Stored in offline store | Stored in offline store | Stored in offline store |
| Real-time Features | None | None | Stored in online store | Stored in online store |
| Model | Untrained | Trained | Trained | Trained |
Offline feature store: stores batch features for training models on historical data. Online feature store: stores real-time features for fast model serving. Training uses offline store; serving uses online store. Both stores keep features consistent but serve different purposes. This separation ensures efficient, low-latency predictions.