HLDsystem_design~10 mins

Why load balancers distribute traffic in HLD - Scalability Evidence

Choose your learning style9 modes available

Learn Why Deep Arch Practice Challenge Design Recall Scale

Scalability Analysis - Why load balancers distribute traffic

Growth Table: Traffic and Load Balancer Role

Users	Requests per Second (RPS)	Load Balancer Role	Server Count
100	50	Single server, no load balancer needed	1
10,000	5,000	Single load balancer distributes traffic to few servers	2-4
1,000,000	500,000	Multiple load balancers with health checks and failover	100+
100,000,000	50,000,000	Global load balancers, geo-distribution, CDN integration	Thousands

First Bottleneck: Server Overload Without Load Balancer

At low traffic, one server can handle all requests. As users grow, a single server hits CPU and memory limits first. Without a load balancer, traffic cannot be split, causing slow responses or crashes.

Load balancers prevent this by spreading requests evenly, avoiding overload on any one server.

Scaling Solutions Using Load Balancers

Horizontal Scaling: Add more servers behind the load balancer to share traffic.
Health Checks: Load balancers detect unhealthy servers and stop sending traffic to them.
Session Persistence: Keep user sessions on the same server if needed.
Global Load Balancing: Use multiple load balancers across regions for geo-distribution.
Integration with CDN: Offload static content to CDN, reducing load on servers.

Back-of-Envelope Cost Analysis

Assuming 1 server handles ~3000 RPS:

At 10,000 RPS, need ~4 servers.
At 500,000 RPS, need ~170 servers.
Load balancer can handle ~10,000 concurrent connections per instance.
Network bandwidth: 1 Gbps = 125 MB/s, so servers and load balancers must have sufficient network capacity.

Interview Tip: Structuring Scalability Discussion

Start by explaining the problem of single server limits. Then introduce load balancers as a solution to distribute traffic. Discuss how load balancers improve availability and fault tolerance. Finally, mention scaling strategies and monitoring.

Self Check Question

Your database handles 1000 QPS. Traffic grows 10x. What do you do first?

Answer: Introduce load balancers to distribute traffic across multiple database replicas or application servers to avoid overload and maintain performance.

Key Result

Load balancers are essential to distribute increasing traffic across multiple servers, preventing any single server from becoming a bottleneck and enabling horizontal scaling.