Model Pipeline - Key models overview (GPT, DALL-E, Stable Diffusion)
This pipeline shows how three popular AI models work: GPT for text, DALL-E for images from text, and Stable Diffusion for creating images by gradually improving noise.
This pipeline shows how three popular AI models work: GPT for text, DALL-E for images from text, and Stable Diffusion for creating images by gradually improving noise.
Loss
2.3 |****
1.2 |************
0.7 |******************
0.4 |**********************
1 5 10 20 Epochs| Epoch | Loss ↓ | Accuracy ↑ | Observation |
|---|---|---|---|
| 1 | 2.3 | 0.10 | High loss and low accuracy as model starts learning basic patterns |
| 5 | 1.2 | 0.45 | Loss decreases, accuracy improves as model learns language/image features |
| 10 | 0.7 | 0.70 | Model shows good understanding, generating coherent text or images |
| 20 | 0.4 | 0.85 | Loss low and accuracy high, model converges well on training data |