Overview - Least connections

What is it?

Least connections is a load balancing method used by nginx to distribute incoming network traffic. It sends new requests to the server with the fewest active connections at that moment. This helps balance the load evenly across servers, especially when requests take different amounts of time to complete.

Why it matters

Without least connections, traffic might be sent evenly without considering server load, causing some servers to become overloaded while others sit idle. This can slow down applications and cause poor user experience. Least connections helps keep servers busy but not overwhelmed, improving speed and reliability.

Where it fits

Before learning least connections, you should understand basic load balancing concepts and how nginx works as a web server. After this, you can explore advanced load balancing methods, health checks, and scaling strategies in nginx and other tools.

Mental Model

Core Idea

Least connections sends new requests to the server currently handling the fewest active connections to balance load dynamically.

Think of it like...

Imagine a group of cashiers at a supermarket. Instead of everyone joining the shortest line, you always go to the cashier who is currently serving the fewest customers, so no one cashier gets overwhelmed.

┌───────────────┐
│ Incoming Req  │
└──────┬────────┘
       │
       ▼
┌─────────────────────────────┐
│ Check active connections on │
│ all servers                 │
└─────────────┬───────────────┘
              │
              ▼
┌─────────────────────────────┐
│ Select server with least     │
│ active connections          │
└─────────────┬───────────────┘
              │
              ▼
┌─────────────┐
│ Send request│
│ to server   │
└─────────────┘

Build-Up - 7 Steps

1

FoundationWhat is load balancing

Concept: Load balancing spreads network traffic across multiple servers to avoid overload.

When many users visit a website, one server might get too busy and slow down. Load balancing sends each user to different servers to keep things fast and stable.

Result

Traffic is shared among servers, preventing any single server from slowing down.

Understanding load balancing is key to managing traffic and keeping services responsive.

2

FoundationActive connections explained

3

IntermediateLeast connections load balancing method

4

IntermediateConfiguring least connections in nginx

5

IntermediateHow least connections handles uneven request times

6

AdvancedCombining least connections with health checks

7

ExpertLimitations and edge cases of least connections

Under the Hood

nginx tracks the number of active connections for each server in the upstream group. When a new request arrives, it queries these counts and selects the server with the lowest number. This decision happens in real-time for each request, allowing dynamic balancing. The counts increase when a connection starts and decrease when it closes.

Why designed this way?

Least connections was designed to handle uneven request durations better than simple round-robin. Early load balancers sent requests evenly, but slow requests caused some servers to overload. Tracking active connections provides a simple, efficient way to estimate current load without complex metrics.

┌───────────────┐
│ New Request   │
└──────┬────────┘
       │
       ▼
┌─────────────────────────────┐
│ nginx checks active          │
│ connections per server       │
└─────────────┬───────────────┘
              │
              ▼
┌─────────────────────────────┐
│ Select server with least     │
│ active connections          │
└─────────────┬───────────────┘
              │
              ▼
┌─────────────┐
│ Forward    │
│ request to │
│ selected   │
│ server     │
└─────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does least connections always send requests to the server with the fewest total requests ever? Commit yes or no.

Common Belief:Least connections sends requests to the server with the fewest total requests handled historically.

Tap to reveal reality

Quick: Can least connections alone detect if a server is down? Commit yes or no.

Common Belief:Least connections automatically avoids servers that are down or unreachable.

Tap to reveal reality

Quick: Does least connections guarantee perfectly balanced CPU and memory load? Commit yes or no.

Common Belief:Least connections perfectly balances all server resources like CPU and memory.

Tap to reveal reality

Quick: Is least connections always better than round-robin? Commit yes or no.

Common Belief:Least connections is always the best load balancing method.

Tap to reveal reality

Expert Zone

1

Least connections does not account for connection quality or resource intensity; a single connection might be very heavy or light.

2

In high traffic, connection counts can lag due to timing, causing temporary imbalances.

3

Sticky sessions or session persistence can interfere with least connections, requiring careful configuration.

When NOT to use

Avoid least connections when all requests are uniform and short, where round-robin is simpler and effective. Also, if server resource usage varies widely per connection, consider load balancers that use CPU or response time metrics instead.

Production Patterns

In production, least connections is often combined with health checks and weighted servers to handle different capacities. It is used in microservices and API gateways to improve responsiveness under variable load.

Connections

Round-robin load balancing

Alternative load balancing method

Understanding least connections clarifies why round-robin can fail under uneven request durations, highlighting the need for dynamic load awareness.

Health checks in load balancers

Complementary mechanism

Knowing least connections requires health checks to avoid broken servers helps design reliable, fault-tolerant systems.

Queue management in supermarkets

Similar dynamic load distribution

Recognizing how least connections mimics choosing the shortest active queue helps understand dynamic balancing in other systems like customer service.

Common Pitfalls

#1Configuring least connections without health checks

Wrong approach:upstream backend { least_conn; server backend1.example.com; server backend2.example.com; } # No health checks configured

Correct approach:upstream backend { least_conn; server backend1.example.com max_fails=3 fail_timeout=30s; server backend2.example.com max_fails=3 fail_timeout=30s; } # Health checks or fail parameters added

Root cause:Assuming least connections automatically avoids unhealthy servers leads to traffic sent to down servers.

#2Using least connections with sticky sessions without adjustments

Wrong approach:upstream backend { least_conn; server backend1.example.com; server backend2.example.com; } # Sticky sessions enabled elsewhere without coordination

Correct approach:upstream backend { least_conn; server backend1.example.com; server backend2.example.com; } # Sticky sessions configured to respect load balancing or disabled

Root cause:Sticky sessions force requests to the same server, defeating least connections balancing.

#3Misunderstanding least connections as total request count balancing

Wrong approach:Assuming least connections balances based on total requests handled historically, not current active connections.

Correct approach:Recognize least connections balances based on current active connections only.

Root cause:Confusing metrics leads to wrong expectations and troubleshooting errors.

Key Takeaways

Least connections balances load by sending new requests to the server with the fewest active connections right now.

This method adapts dynamically to uneven request times, preventing some servers from becoming overloaded.

nginx enables least connections with the 'least_conn' directive inside the upstream block.

Least connections alone does not check server health; combining it with health checks is essential for reliability.

Understanding its limits helps avoid hidden bottlenecks and choose the right load balancing strategy.