Microservicessystem_design~10 mins

API Gateway pattern in Microservices - Scalability & System Analysis

Choose your learning style10 modes available

Learn Why Deep Arch Practice Challenge Design Recall Scale

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Scalability Analysis - API Gateway pattern

Growth Table: API Gateway Pattern

Users	Requests per Second (RPS)	API Gateway Load	Microservices Load	Network Traffic	Notes
100 users	~50 RPS	Single instance handles easily	Microservices handle requests directly	Low	Simple setup, no scaling needed
10,000 users	~5,000 RPS	API Gateway needs horizontal scaling	Microservices start to see increased load	Moderate	Introduce load balancer, caching at gateway
1,000,000 users	~500,000 RPS	Multiple API Gateway instances behind LB	Microservices require scaling and partitioning	High	Use caching, rate limiting, and circuit breakers
100,000,000 users	~50,000,000 RPS	Global distributed API Gateways with CDN	Microservices sharded and geo-distributed	Very High	Advanced routing, edge caching, and autoscaling

First Bottleneck

The API Gateway becomes the first bottleneck as it handles all incoming requests. At moderate to high traffic (around 10,000 users or 5,000 RPS), a single gateway instance struggles with CPU and network limits. This causes increased latency and potential request drops.

Scaling Solutions

Horizontal Scaling: Add multiple API Gateway instances behind a load balancer to distribute traffic evenly.
Caching: Implement response caching at the gateway to reduce calls to microservices.
Rate Limiting: Protect backend services by limiting request rates per user or IP.
Sharding Microservices: Partition microservices by user region or function to reduce load.
CDN Integration: Use Content Delivery Networks for static content and edge caching to reduce gateway load.
Circuit Breakers: Prevent cascading failures by stopping calls to failing microservices.

Back-of-Envelope Cost Analysis

At 10,000 users (~5,000 RPS):
- API Gateway CPU: ~50% utilization per instance (assuming 2,000 RPS per instance)
- Network bandwidth: ~500 Mbps (assuming 100 KB per request/response)
- Storage: Minimal at gateway, microservices storage depends on data
At 1,000,000 users (~500,000 RPS):
- Need ~250 API Gateway instances (500,000 / 2,000 RPS per instance)
- Network bandwidth: ~50 Gbps
- Microservices require database scaling and caching layers

Interview Tip

Start by explaining the role of the API Gateway as a single entry point. Discuss how it simplifies client interactions but can become a bottleneck. Then, outline scaling steps: horizontal scaling, caching, rate limiting, and microservices partitioning. Always justify why each step is needed based on traffic growth.

Self Check

Your database handles 1000 QPS. Traffic grows 10x to 10,000 QPS. What do you do first?

Answer: Add read replicas and implement caching to reduce direct database load before scaling vertically or sharding.

Key Result

API Gateway handles all client requests and becomes the first bottleneck as traffic grows; horizontal scaling and caching at the gateway are key to maintaining performance.

Practice

(1/5)

1. What is the primary role of an API Gateway in a microservices architecture?

easy

A. It acts as a single entry point to route requests to multiple microservices.

B. It stores all the data for the microservices.

C. It replaces the database in the system.

D. It directly manages the internal logic of each microservice.

API Gateway pattern in Microservices - Scalability & System Analysis

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of API Gateway

Step 2: Eliminate incorrect roles

Final Answer:

Quick Check:

Solution

Step 1: Identify API Gateway responsibilities

Step 2: Remove incorrect options

Final Answer:

Quick Check:

Solution

Step 1: Analyze the request flow

Step 2: Understand the benefit

Final Answer:

Quick Check:

Solution

Step 1: Identify the error source

Step 2: Exclude other causes

Final Answer:

Quick Check:

Solution

Step 1: Understand response aggregation purpose

Step 2: Evaluate other options

Final Answer:

Quick Check: