Introduction
Imagine waiting for a long message to arrive all at once. Streaming responses solve this by sending parts of the message as soon as they are ready, so you start seeing the answer without delay.
Imagine watching a movie online that starts playing while the rest is still downloading. You don’t wait for the full movie to download before watching; it streams so you see it bit by bit.
┌───────────────┐ ┌───────────────┐ │ Server │──────▶│ Client │ │ (prepares │ │ (receives and │ │ response) │ │ displays │ │ in chunks) │ │ chunks live) │ └───────────────┘ └───────────────┘ ▲ ▲ │ │ └────────────── Continuous connection ──────────────┘