Bird
Raised Fist0
Microservicessystem_design~10 mins

gRPC for internal communication in Microservices - Scalability & System Analysis

Choose your learning style10 modes available

Start learning this pattern below

Jump into concepts and practice - no test required

or
Recommended
Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong
Scalability Analysis - gRPC for internal communication
Growth Table: gRPC for Internal Communication
ScaleUsers / ServicesTraffic CharacteristicsInfrastructure ChangesLatency & Throughput
100 users~10 microservicesLow request rate, simple RPC callsSingle cluster, basic load balancingLow latency, high throughput easily handled
10K users~50 microservicesModerate RPC calls, increased concurrencyMultiple instances per service, service discovery neededLatency stable, throughput requires connection pooling
1M users~200 microservicesHigh RPC volume, bursty traffic patternsHorizontal scaling, advanced load balancing, circuit breakersLatency sensitive, throughput near single node limits
100M users500+ microservicesMassive RPC calls, global distributionMulti-region clusters, sharded service registries, CDN for static contentLatency optimized with retries, throughput requires partitioning
First Bottleneck

The first bottleneck is usually the network bandwidth and connection limits on the gRPC servers. Each server can handle around 1000-5000 concurrent connections. As the number of microservices and RPC calls grow, the servers may run out of available connections or CPU resources to handle serialization/deserialization of protobuf messages.

Scaling Solutions
  • Horizontal scaling: Add more instances of microservices behind load balancers to distribute RPC calls.
  • Connection pooling: Reuse gRPC connections to reduce overhead and improve throughput.
  • Load balancing: Use client-side or service mesh load balancing to evenly distribute requests.
  • Service discovery: Implement dynamic discovery to route calls efficiently.
  • Circuit breakers and retries: Prevent cascading failures and improve resilience.
  • Compression: Enable gRPC message compression to reduce bandwidth usage.
  • Sharding services: Partition services by function or data to reduce cross-service calls.
  • Use of service mesh: Tools like Istio or Linkerd can manage traffic, retries, and observability.
Back-of-Envelope Cost Analysis

Assuming 1M users generating 10 RPC calls per second on average:

  • Total RPC calls per second: 10M QPS
  • Each server handles ~3000 concurrent connections and ~5000 QPS
  • Number of servers needed: ~2000 instances (10M / 5000)
  • Network bandwidth per server: Assuming 1KB per RPC, 5000 QPS = ~5MB/s (~40Mbps)
  • Total bandwidth: 10M QPS * 1KB = ~10GB/s (~80Gbps)
  • Storage: Mostly ephemeral, but logs and metrics storage grows with traffic
Interview Tip

Start by explaining the typical load and traffic patterns for gRPC in microservices. Identify the first bottleneck clearly (usually network or CPU on servers). Then discuss practical scaling solutions like horizontal scaling, connection pooling, and service mesh. Always justify why each solution fits the bottleneck. End with cost and complexity trade-offs.

Self Check

Question: Your database handles 1000 QPS. Traffic grows 10x. What do you do first?

Answer: Since the database is the bottleneck, first add read replicas to distribute read traffic and implement caching to reduce load. For writes, consider sharding or write optimization.

Key Result
gRPC scales well with horizontal service instances and connection pooling, but network bandwidth and server CPU become bottlenecks at high traffic. Using service mesh and sharding helps maintain low latency and high throughput.

Practice

(1/5)
1. What is the main advantage of using gRPC for internal communication between microservices?
easy
A. It requires no predefined message formats.
B. It provides fast, efficient, and strongly typed communication.
C. It only works with services written in the same language.
D. It uses plain text messages for easy debugging.

Solution

  1. Step 1: Understand gRPC communication benefits

    gRPC uses Protocol Buffers which are compact and strongly typed, making communication fast and reliable.
  2. Step 2: Compare with other options

    Options B, C, and D are incorrect because gRPC requires predefined message formats, supports multiple languages, and uses binary messages, not plain text.
  3. Final Answer:

    It provides fast, efficient, and strongly typed communication. -> Option B
  4. Quick Check:

    gRPC speed and typing [OK]
Hint: gRPC is fast and typed, unlike plain text or language-specific methods [OK]
Common Mistakes:
  • Thinking gRPC uses plain text messages
  • Assuming gRPC works only with one language
  • Believing gRPC needs no message definitions
2. Which of the following is the correct way to define a gRPC service method in a .proto file?
easy
A. method GetUser returns UserResponse(UserRequest);
B. service GetUser { rpc UserRequest returns UserResponse; }
C. rpc GetUser (UserRequest) returns (UserResponse);
D. function GetUser(UserRequest): UserResponse;

Solution

  1. Step 1: Recall gRPC .proto syntax

    In .proto files, service methods are defined using the syntax: rpc MethodName (RequestType) returns (ResponseType);
  2. Step 2: Validate options

    rpc GetUser (UserRequest) returns (UserResponse); matches the correct syntax. Options B, C, and D do not follow the .proto syntax for defining rpc methods.
  3. Final Answer:

    rpc GetUser (UserRequest) returns (UserResponse); -> Option C
  4. Quick Check:

    .proto rpc syntax [OK]
Hint: Remember: rpc Method(Request) returns (Response); in .proto files [OK]
Common Mistakes:
  • Using 'service' keyword incorrectly for methods
  • Confusing method syntax with programming language functions
  • Omitting parentheses around request and response types
3. Given the following gRPC client call in Python, what will be the output if the server returns a UserResponse with name='Alice' and age=30?
response = stub.GetUser(UserRequest(id=123))
print(f"Name: {response.name}, Age: {response.age}")
medium
A. Name: Alice, Age: 30
B. Name: 123, Age: 0
C. Name: , Age:
D. Error: stub.GetUser is not a function

Solution

  1. Step 1: Understand the client call and server response

    The client calls GetUser with id=123. The server responds with UserResponse containing name='Alice' and age=30.
  2. Step 2: Analyze the print statement output

    The print statement accesses response.name and response.age, so it will output the values returned by the server.
  3. Final Answer:

    Name: Alice, Age: 30 -> Option A
  4. Quick Check:

    Client prints server response fields [OK]
Hint: Client prints server response fields directly as returned [OK]
Common Mistakes:
  • Assuming client sends back request data instead of server response
  • Confusing method call syntax causing errors
  • Expecting empty or default values without server response
4. A developer wrote this gRPC service definition but the client fails to connect:
service UserService {
  rpc GetUser UserRequest returns UserResponse;
}
What is the error in this definition?
medium
A. Missing parentheses around request and response types.
B. Service name should be lowercase.
C. rpc keyword should be capitalized as RPC.
D. UserRequest and UserResponse must be strings.

Solution

  1. Step 1: Check gRPC method syntax in .proto

    The correct syntax requires parentheses around request and response types: rpc MethodName (RequestType) returns (ResponseType);
  2. Step 2: Identify the error in the given code

    The code misses parentheses around UserRequest and UserResponse, causing client connection failure.
  3. Final Answer:

    Missing parentheses around request and response types. -> Option A
  4. Quick Check:

    Parentheses required in rpc method signature [OK]
Hint: Always use parentheses around request and response in rpc methods [OK]
Common Mistakes:
  • Ignoring parentheses in rpc method definitions
  • Thinking service names must be lowercase
  • Misunderstanding rpc keyword casing rules
5. You have multiple microservices written in different languages that need to communicate internally with low latency and strict message contracts. Which approach best fits this scenario?
hard
A. Use REST APIs with JSON for all communication.
B. Use message queues with XML messages.
C. Use plain TCP sockets with custom binary protocol.
D. Use gRPC with Protocol Buffers for internal communication.

Solution

  1. Step 1: Analyze requirements for low latency and strict contracts

    Low latency and strict message contracts require efficient, strongly typed communication.
  2. Step 2: Evaluate communication options

    REST with JSON is flexible but slower and less strict. Plain TCP with custom protocol is complex and error-prone. Message queues add latency and XML is verbose. gRPC with Protocol Buffers is designed for efficient, strongly typed, multi-language communication.
  3. Final Answer:

    Use gRPC with Protocol Buffers for internal communication. -> Option D
  4. Quick Check:

    Low latency + strict contracts = gRPC [OK]
Hint: gRPC + Protobuf = fast, typed, multi-language communication [OK]
Common Mistakes:
  • Choosing REST despite latency and typing needs
  • Using custom protocols without standard tooling
  • Ignoring message size and parsing overhead