Microservicessystem_design~10 mins

REST API between services in Microservices - Scalability & System Analysis

Choose your learning style10 modes available

Learn Why Deep Arch Practice Challenge Design Recall Scale

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Scalability Analysis - REST API between services

Growth Table: REST API Between Services

Users/Traffic	API Requests per Second	Latency	Service Instances	Network Load	Data Volume
100 users	~50-100 RPS	Low (10-50ms)	1-2 instances per service	Low	Small
10,000 users	~5,000-10,000 RPS	Moderate (50-100ms)	3-5 instances per service	Moderate	Medium
1,000,000 users	~500,000-1,000,000 RPS	Higher (100-200ms)	10+ instances per service, autoscaling	High	Large
100,000,000 users	~50,000,000+ RPS	High (200ms+)	Hundreds of instances, global distribution	Very High	Very Large

First Bottleneck

At low scale, the first bottleneck is usually the API gateway or load balancer handling incoming REST calls. As traffic grows, the service instances CPU and memory become bottlenecks due to request processing. At medium scale, the network bandwidth between services can limit throughput. At very large scale, database or stateful service dependencies accessed via REST APIs become the main bottleneck.

Scaling Solutions

Horizontal scaling: Add more service instances behind load balancers to distribute REST API calls.
API Gateway optimization: Use caching, rate limiting, and request aggregation to reduce load.
Asynchronous communication: Use message queues or event streams to reduce synchronous REST calls.
Service partitioning: Split services by domain or function to reduce inter-service calls.
Network improvements: Use faster network links, service mesh with optimized routing.
Database scaling: Use read replicas, caching layers, and sharding to reduce backend bottlenecks.
CDN: For REST APIs serving static or cacheable content, use CDN to offload traffic.

Back-of-Envelope Cost Analysis

Assuming 10,000 users generating 10,000 RPS:

Each server handles ~3,000 RPS -> need ~4 service instances per service.
Network bandwidth: 10,000 RPS * 1 KB/request = ~10 MB/s per service.
Storage: Logs and metrics grow with traffic; plan for scalable storage.
CPU and memory scale linearly with request volume; monitor and autoscale.

Interview Tip

Structure your scalability discussion by:

Defining expected traffic and usage patterns.
Identifying bottlenecks at each scale step.
Proposing targeted solutions for each bottleneck.
Considering trade-offs like cost, complexity, and latency.
Discussing monitoring and autoscaling strategies.

Self Check

Your database handles 1000 QPS. Traffic grows 10x. What do you do first?

Answer: Add read replicas and implement caching to reduce direct database load before scaling vertically or sharding.

Key Result

REST APIs between services scale well with horizontal service instances and load balancing, but the first bottleneck is usually service CPU and network bandwidth. Database and backend dependencies become bottlenecks at large scale, requiring caching, read replicas, and sharding.

Practice

(1/5)

1. What is the main purpose of using a REST API between microservices?

easy

A. To allow services to communicate over HTTP using standard methods

B. To store data permanently in a database

C. To run services on the same machine only

D. To replace all backend logic with frontend code

REST API between services in Microservices - Scalability & System Analysis

Start learning this pattern below

Practice

Solution

Step 1: Understand REST API role in microservices

Step 2: Identify the correct purpose

Final Answer:

Quick Check:

Solution

Step 1: Recall HTTP methods and their purposes

Step 2: Match method to update action

Final Answer:

Quick Check:

Solution

Step 1: Analyze the HTTP method and URL

Step 2: Determine the action based on method

Final Answer:

Quick Check:

Solution

Step 1: Understand 405 Method Not Allowed meaning

Step 2: Match error to cause

Final Answer:

Quick Check:

Solution

Step 1: Evaluate synchronous calls impact

Step 2: Consider caching benefits

Step 3: Assess other options

Final Answer:

Quick Check: