Model Pipeline - Text recognition pipeline
This pipeline takes pictures of text and turns them into words you can read on a computer. It first cleans the image, finds the text parts, then reads the letters, and finally gives the text as output.
Jump into concepts and practice - no test required
This pipeline takes pictures of text and turns them into words you can read on a computer. It first cleans the image, finds the text parts, then reads the letters, and finally gives the text as output.
Loss
2.3 |****
1.8 |***
1.4 |**
1.1 |*
0.9 |*
0.8 |*
+---------
Epochs 1-6| Epoch | Loss ↓ | Accuracy ↑ | Observation |
|---|---|---|---|
| 1 | 2.3 | 0.25 | Model starts learning, loss is high, accuracy low |
| 2 | 1.8 | 0.40 | Loss decreases, accuracy improves |
| 3 | 1.4 | 0.55 | Model learns letter shapes better |
| 4 | 1.1 | 0.65 | Better sequence understanding |
| 5 | 0.9 | 0.72 | Model converging, good text recognition |
| 6 | 0.8 | 0.76 | Small improvements, nearing stable performance |
import pytesseract
from PIL import Image
img = Image.new('RGB', (100, 30), color='white')
text = pytesseract.image_to_string(img)
print(text)