Model Pipeline - Text splitters
This pipeline breaks long text into smaller pieces so a model can understand and work with it better. It splits text by sentences or paragraphs before processing.
This pipeline breaks long text into smaller pieces so a model can understand and work with it better. It splits text by sentences or paragraphs before processing.
Loss
0.5 |****
0.4 |***
0.3 |**
0.2 |*
0.1 |
+----
1 2 3 4 Epochs
| Epoch | Loss ↓ | Accuracy ↑ | Observation |
|---|---|---|---|
| 1 | 0.45 | 0.60 | Initial split quality is moderate, some sentences split incorrectly. |
| 2 | 0.30 | 0.75 | Improved splitting rules reduce errors, better sentence boundaries. |
| 3 | 0.20 | 0.85 | Splitting is mostly correct, chunk sizes optimized for model input. |
| 4 | 0.15 | 0.90 | Final tuning reduces overlap and preserves context well. |