Introduction
Creating detailed images from simple text descriptions is a complex challenge. People want tools that can turn their ideas into pictures quickly and clearly without needing to draw.
Imagine sculpting a statue from a block of marble covered in dust. You slowly brush away the dust bit by bit, revealing the statue that matches a story someone told you. Each brush stroke makes the statue clearer and closer to the story.
┌───────────────┐
│ Text Prompt │
└──────┬────────┘
│
▼
┌───────────────┐
│ Latent Space │
│ Representation│
└──────┬────────┘
│
▼
┌───────────────┐
│ Diffusion │
│ Process │
│ (Noise Removal)│
└──────┬────────┘
│
▼
┌───────────────┐
│ Final Image │
└───────────────┘