Introduction
When you have multiple servers handling requests, you want to send new requests to the server with the fewest active connections. This helps balance the load evenly and keeps your app fast and reliable.
When you run a website on several servers and want to avoid overloading any single server.
When some servers are slower and you want to send fewer requests to busy servers.
When you want to improve user experience by reducing wait times during high traffic.
When you want to automatically distribute traffic without manual intervention.
When you want a simple way to balance load without complex health checks.