| Users / Traffic | What Changes? |
|---|---|
| 100 users | Single microservice version runs; parallel running not needed. |
| 10,000 users | Start parallel running new microservice version alongside old for testing and smooth transition. |
| 1,000,000 users | Multiple parallel instances of old and new versions run; traffic split carefully; monitoring and rollback mechanisms critical. |
| 100,000,000 users | Parallel running at scale requires automated deployment, canary releases, feature flags; orchestration tools manage many versions and services. |
Parallel running in Microservices - Scalability & System Analysis
Start learning this pattern below
Jump into concepts and practice - no test required
The first bottleneck in parallel running is the increased resource usage on servers and network. Running multiple versions simultaneously doubles or triples CPU, memory, and bandwidth needs. This can overwhelm application servers and increase latency if not managed well.
- Horizontal scaling: Add more servers or containers to distribute load of parallel versions.
- Load balancing: Use smart load balancers to route traffic between versions efficiently.
- Feature flags and canary releases: Gradually shift traffic to new versions to reduce risk and resource spikes.
- Resource isolation: Use container orchestration (e.g., Kubernetes) to allocate resources per version and avoid interference.
- Monitoring and auto-scaling: Track resource usage and scale instances automatically to handle load.
Assuming 1 server handles ~3000 concurrent connections:
- At 10,000 users, running 2 versions in parallel needs ~7 servers (10,000 users * 2 versions / 3000 users per server).
- At 1,000,000 users, parallel running 2 versions requires ~667 servers.
- Network bandwidth doubles with parallel running; if each user request is 100KB, 1M users generate ~100GB/s total traffic.
- Storage for logs and metrics also doubles; plan for increased disk and database capacity.
When discussing parallel running scalability, start by explaining why parallel running is used (safe upgrades, testing). Then identify the resource overhead as the first bottleneck. Next, describe how horizontal scaling and orchestration tools help manage multiple versions. Finally, mention monitoring and gradual rollout strategies to minimize risk and cost.
Your database handles 1000 QPS. Traffic grows 10x due to parallel running of new microservice version. What do you do first?
Answer: Add read replicas and implement caching to reduce load on the primary database before scaling application servers. This addresses the database bottleneck caused by increased queries from parallel versions.
Practice
parallel running in microservices?Solution
Step 1: Understand the concept of parallel running
Parallel running means running old and new systems side by side to compare their outputs and ensure the new system works correctly.Step 2: Identify the purpose in microservices
This approach helps catch errors and ensures a smooth transition before fully switching to the new system.Final Answer:
To run old and new systems together to ensure smooth transition -> Option AQuick Check:
Parallel running = run old and new systems together [OK]
- Thinking parallel running means immediate replacement
- Confusing parallel running with running unrelated services
- Assuming old system is discarded immediately
Solution
Step 1: Understand deployment in parallel running
Parallel running requires both old and new versions to run simultaneously to compare results.Step 2: Identify correct routing method
Routing a copy of requests to both versions allows output comparison without disrupting users.Final Answer:
Deploy new microservice version alongside old one and route a copy of requests to both -> Option AQuick Check:
Parallel running = deploy both and route requests to both [OK]
- Stopping old service before testing new one
- Ignoring logs from old service
- Running new service only at specific times
Solution
Step 1: Understand output comparison in parallel running
Parallel running compares outputs to detect discrepancies between old and new services.Step 2: Decide action on output mismatch
If outputs differ, the system should log the difference and alert engineers to investigate before switching fully.Final Answer:
Log the difference and alert engineers for investigation -> Option DQuick Check:
Output mismatch = log and alert [OK]
- Ignoring output differences
- Stopping old service too early
- Switching back permanently without investigation
Solution
Step 1: Analyze routing in parallel running
For parallel running, requests must be routed to both old and new services simultaneously.Step 2: Identify why new service gets no requests
If new service never receives requests, routing likely sends all traffic only to old service.Final Answer:
The routing logic is only sending requests to the old service -> Option BQuick Check:
No requests to new service = routing issue [OK]
- Assuming new service crashed without checking logs
- Blaming old service logs
- Thinking speed affects request routing
Solution
Step 1: Understand gradual traffic shifting in parallel running
Gradually increasing traffic to the new service while comparing outputs reduces risk and performance impact.Step 2: Evaluate options for safety and performance
Routing a small portion initially and increasing after validation balances safety and system load.Final Answer:
Route 10% of traffic to new service and 90% to old service, compare outputs, then gradually increase new service traffic -> Option CQuick Check:
Gradual traffic shift with output comparison = safe and performant [OK]
- Switching 100% traffic immediately
- Skipping output comparison
- Stopping old service too early
