Overview - limit_req_zone and limit_req

What is it?

limit_req_zone and limit_req are two directives in nginx used to control the rate of incoming requests to a server. limit_req_zone defines a shared memory zone to track request rates by a key, such as client IP. limit_req applies the rate limiting rules from that zone to specific locations or servers. Together, they help prevent too many requests from overwhelming the server.

Why it matters

Without rate limiting, a server can be flooded by too many requests, causing slowdowns or crashes. This can happen accidentally or from malicious attacks like denial-of-service. Using limit_req_zone and limit_req helps keep the server stable and responsive by controlling how fast clients can send requests. This protects resources and improves user experience.

Where it fits

Before learning limit_req_zone and limit_req, you should understand basic nginx configuration and how requests are handled. After mastering these directives, you can explore advanced security topics like fail2ban or web application firewalls, and performance tuning with caching and load balancing.

Mental Model

Core Idea

limit_req_zone sets the rules and memory to track request rates, while limit_req enforces those rules on incoming requests to prevent overload.

Think of it like...

It's like a bouncer at a club (limit_req) who checks a guest list (limit_req_zone) to make sure no one enters too fast or too often, keeping the club safe and comfortable.

┌─────────────────────────────┐
│       limit_req_zone         │
│  (defines tracking zone)     │
│  ┌───────────────────────┐  │
│  │ Shared Memory for Keys │  │
│  └───────────────────────┘  │
└─────────────┬───────────────┘
              │
              ▼
┌─────────────────────────────┐
│         limit_req            │
│ (applies rate limiting per  │
│  request using the zone)    │
└─────────────────────────────┘

Build-Up - 7 Steps

1

FoundationUnderstanding Request Flooding Risks

Concept: Why controlling request rates is important for server health.

When many clients send requests too quickly, the server can become overloaded. This causes slow responses or crashes. Rate limiting helps by slowing down or rejecting excessive requests.

Result

You see that uncontrolled requests can harm server performance and availability.

Understanding the problem of request flooding motivates the need for rate limiting controls.

2

FoundationBasic nginx Configuration Structure

3

IntermediateDefining limit_req_zone Directive

4

IntermediateApplying limit_req Directive to Enforce Limits

5

IntermediateChoosing the Right Key for Tracking

6

AdvancedHandling Bursts and Delays in Requests

7

ExpertShared Memory Zone Internals and Performance

Under the Hood

limit_req_zone creates a shared memory area in nginx worker processes to store counters keyed by variables like client IP. Each request increments the counter for its key. limit_req checks these counters on each request, comparing against the configured rate and burst limits. If the request exceeds limits, it is delayed or rejected. The counters use atomic operations to avoid race conditions across workers.

Why designed this way?

nginx is event-driven and multi-process, so shared memory is needed to track request rates globally. Using a shared zone avoids per-worker isolated counters that would be inaccurate. The design balances performance and accuracy by using efficient binary keys and atomic increments. Alternatives like external databases would add latency and complexity.

┌───────────────┐       ┌─────────────────────┐
│ Client Request│──────▶│ nginx Worker Process │
└───────────────┘       └─────────┬───────────┘
                                   │
                                   ▼
                      ┌─────────────────────────┐
                      │ Shared Memory Zone (10m) │
                      │ ┌─────────────────────┐ │
                      │ │ Key: Client IP       │ │
                      │ │ Counter: Request #   │ │
                      │ └─────────────────────┘ │
                      └─────────────────────────┘
                                   │
                                   ▼
                      ┌─────────────────────────┐
                      │ limit_req Checks Limits  │
                      │ Delay or Reject Requests │
                      └─────────────────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does limit_req_zone alone limit requests or is limit_req also needed? Commit to yes or no.

Common Belief:limit_req_zone by itself limits the request rate.

Tap to reveal reality

Quick: Does burst allow unlimited extra requests during spikes? Commit to yes or no.

Common Belief:burst lets clients send unlimited extra requests temporarily.

Tap to reveal reality

Quick: Is client IP always the best key for rate limiting? Commit to yes or no.

Common Belief:Using client IP as the key always works best for rate limiting.

Tap to reveal reality

Quick: Does increasing zone size always improve performance? Commit to yes or no.

Common Belief:Bigger shared memory zones always improve rate limiting performance.

Tap to reveal reality

Expert Zone

1

The choice of $binary_remote_addr over $remote_addr improves memory efficiency and speed by using a fixed-length binary key.

2

Using burst with nodelay=false smooths traffic by delaying excess requests instead of rejecting, improving user experience under load.

3

Shared memory zones are global across all nginx worker processes, so tuning zone size affects all workers and overall accuracy.

When NOT to use

Do not use limit_req_zone and limit_req for complex user authentication or session-based rate limiting; use external API gateways or application-level controls instead. Also avoid for very high precision per-user limits where external databases or Redis-based rate limiting is better.

Production Patterns

In production, limit_req_zone is often combined with geo-blocking and fail2ban to protect against attacks. Burst and nodelay settings are tuned per application to balance user experience and protection. Zones are sized based on traffic analysis to avoid evictions. Logs and status modules monitor rate limiting effectiveness.

Connections

Token Bucket Algorithm

limit_req uses a variant of the token bucket algorithm to allow bursts and control request rates.

Understanding token bucket helps grasp how burst and rate limits smooth traffic and prevent overload.

Firewall Rate Limiting

Both nginx limit_req and firewall rate limiting control traffic flow but operate at different layers (application vs network).

Knowing firewall rate limiting complements nginx limits for layered defense improves overall security.

Traffic Shaping in Networking

limit_req shapes incoming HTTP traffic similarly to how traffic shaping controls bandwidth in networks.

Recognizing this connection helps understand how smoothing and delaying requests prevent congestion.

Common Pitfalls

#1Defining limit_req_zone but forgetting to apply limit_req in server or location blocks.

Wrong approach:http { limit_req_zone $binary_remote_addr zone=mylimit:10m rate=10r/s; } server { listen 80; location / { # missing limit_req directive } }

Correct approach:http { limit_req_zone $binary_remote_addr zone=mylimit:10m rate=10r/s; } server { listen 80; location / { limit_req zone=mylimit burst=5 nodelay; } }

Root cause:Misunderstanding that limit_req_zone only defines tracking, enforcement requires limit_req.

#2Setting burst too high without nodelay, causing server overload.

Wrong approach:limit_req zone=mylimit burst=100;

Correct approach:limit_req zone=mylimit burst=5 nodelay;

Root cause:Not realizing burst allows extra requests that can overwhelm the server if set too large.

#3Using $remote_addr instead of $binary_remote_addr, causing inefficient memory use.

Wrong approach:limit_req_zone $remote_addr zone=mylimit:10m rate=10r/s;

Correct approach:limit_req_zone $binary_remote_addr zone=mylimit:10m rate=10r/s;

Root cause:Not knowing $binary_remote_addr is a more efficient binary representation for keys.

Key Takeaways

limit_req_zone defines a shared memory area to track request rates by a key like client IP.

limit_req applies the rate limiting rules from the zone to specific server locations, controlling request flow.

Burst and nodelay settings let you balance between allowing traffic spikes and protecting server stability.

Choosing the right key for tracking is critical to effective and fair rate limiting.

Understanding the shared memory internals helps optimize performance and avoid common scaling issues.