Model Pipeline - Text recognition pipeline
This pipeline takes pictures of text and turns them into words you can read on a computer. It first cleans the image, finds the text parts, then reads the letters, and finally gives the text as output.
This pipeline takes pictures of text and turns them into words you can read on a computer. It first cleans the image, finds the text parts, then reads the letters, and finally gives the text as output.
Loss
2.3 |****
1.8 |***
1.4 |**
1.1 |*
0.9 |*
0.8 |*
+---------
Epochs 1-6| Epoch | Loss ↓ | Accuracy ↑ | Observation |
|---|---|---|---|
| 1 | 2.3 | 0.25 | Model starts learning, loss is high, accuracy low |
| 2 | 1.8 | 0.40 | Loss decreases, accuracy improves |
| 3 | 1.4 | 0.55 | Model learns letter shapes better |
| 4 | 1.1 | 0.65 | Better sequence understanding |
| 5 | 0.9 | 0.72 | Model converging, good text recognition |
| 6 | 0.8 | 0.76 | Small improvements, nearing stable performance |