Bird
Raised Fist0
LLDsystem_design~10 mins

Delivery agent assignment in LLD - Scalability & System Analysis

Choose your learning style10 modes available

Start learning this pattern below

Jump into concepts and practice - no test required

or
Recommended
Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong
Scalability Analysis - Delivery agent assignment
Growth Table: Delivery Agent Assignment System
ScaleUsers (Orders/Day)Delivery AgentsKey Changes
Small10020Simple assignment logic, single server, direct DB queries
Medium10,0002,000Introduce caching, load balancing, asynchronous assignment
Large1,000,000200,000Microservices, sharded DB, geo-partitioning, message queues
Very Large100,000,00020,000,000Global distributed system, advanced sharding, AI-based assignment, CDN for static data
First Bottleneck

At small to medium scale, the database becomes the first bottleneck. This is because the system needs to quickly find available delivery agents and update their status for each order. The database handles many read and write queries per second, and as orders grow, it struggles to keep up.

Scaling Solutions
  • Database Read Replicas: Use replicas to handle read queries, reducing load on the primary database.
  • Caching: Cache delivery agent availability and location data to reduce frequent DB hits.
  • Load Balancing: Distribute incoming assignment requests across multiple application servers.
  • Asynchronous Processing: Use message queues to decouple order intake from assignment processing.
  • Sharding: Partition the database by geography or agent ID to spread load.
  • Microservices: Separate assignment logic into dedicated services for better scalability.
  • Geo-Partitioning: Assign orders to agents within the same region to reduce latency and data scope.
  • AI-based Assignment: Use machine learning to optimize agent selection and reduce assignment time.
Back-of-Envelope Cost Analysis
  • At 10,000 orders/day (~7 orders/min), assuming each order triggers 5 DB queries, total ~35 queries/min (~0.6 QPS) - easily handled by a single DB.
  • At 1,000,000 orders/day (~700 orders/min), queries rise to ~3500 QPS, nearing a single DB instance limit.
  • Storage: Each order record ~1 KB, 1M orders/day = ~1 GB/day storage; 100M orders = 100 GB/day, requiring archival strategies.
  • Network bandwidth: Assuming 1 KB per request/response, 1M orders/day = ~12 MB/s peak, manageable with standard infrastructure.
Interview Tip

Start by clarifying the scale and requirements. Discuss the current bottleneck and how it changes with growth. Propose incremental solutions: caching, load balancing, then sharding and microservices. Always justify why each step is needed and how it helps scalability.

Self Check

Your database handles 1000 QPS. Traffic grows 10x to 10,000 QPS. What do you do first?

Answer: Introduce read replicas and caching to reduce load on the primary database before considering sharding or adding more servers.

Key Result
The database is the first bottleneck as order volume grows; scaling solutions start with caching and read replicas, then move to sharding and microservices for very large scale.

Practice

(1/5)
1.

What is the primary goal of a delivery agent assignment system?

easy
A. Match orders to available delivery agents nearby
B. Track payment details of customers
C. Manage inventory in warehouses
D. Calculate delivery charges for orders

Solution

  1. Step 1: Understand the system purpose

    The delivery agent assignment system focuses on connecting orders with delivery agents who can fulfill them.
  2. Step 2: Identify the main function

    Matching orders to free agents nearby ensures timely delivery and efficient resource use.
  3. Final Answer:

    Match orders to available delivery agents nearby -> Option A
  4. Quick Check:

    Delivery agent assignment = Matching orders to agents [OK]
Hint: Focus on matching orders to agents, not payments or inventory [OK]
Common Mistakes:
  • Confusing delivery assignment with payment processing
  • Thinking inventory management is part of agent assignment
  • Assuming delivery charges calculation is the main goal
2.

Which data structure is best to quickly find the nearest free delivery agent for an order?

easy
A. Priority queue sorted by distance from order location
B. Stack of agents in order of registration
C. Hash map keyed by agent ID
D. Linked list of all agents

Solution

  1. Step 1: Identify the need for sorting by distance

    To find the nearest free agent, sorting agents by their distance to the order location is essential.
  2. Step 2: Choose a data structure supporting efficient nearest retrieval

    A priority queue can efficiently provide the closest agent by always giving the smallest distance first.
  3. Final Answer:

    Priority queue sorted by distance from order location -> Option A
  4. Quick Check:

    Nearest agent search = Priority queue [OK]
Hint: Use priority queue for nearest-first retrieval [OK]
Common Mistakes:
  • Using hash map which doesn't sort by distance
  • Using stack or linked list which are inefficient for nearest search
  • Ignoring the need to sort by distance
3.

Consider this pseudocode for assigning an agent:
for agent in agents:
  if agent.status == 'free' and distance(agent, order) < min_distance:
    min_distance = distance(agent, order)
    assigned_agent = agent
return assigned_agent

What will this code return if all agents are busy?

medium
A. The closest free agent
B. None or null
C. The first agent in the list
D. An error due to undefined variable

Solution

  1. Step 1: Check variable initializations

    The code does not initialize assigned_agent or min_distance before the loop.
  2. Step 2: Trace execution when all agents are busy

    The if condition's first part (agent.status == 'free') fails for all agents, so due to short-circuit evaluation of 'and', the second part (distance < min_distance) is never evaluated. The loop ends without ever setting assigned_agent. Returning an uninitialized assigned_agent causes an error due to undefined variable.
  3. Final Answer:

    An error due to undefined variable -> Option D
  4. Quick Check:

    No initialization + all busy = undefined variable error [OK]
Hint: No variable initialization leads to undefined variable error [OK]
Common Mistakes:
  • Thinking assigned_agent defaults to None or null
  • Assuming it returns the first agent regardless of status
  • Believing the code handles no free agents gracefully
4.

Given this snippet:
assigned_agent = None
for agent in agents:
  if agent.status = 'free':
    assigned_agent = agent
return assigned_agent

What is the main error in this code?

medium
A. Returning assigned_agent inside the loop
B. Not initializing assigned_agent before loop
C. Using assignment operator '=' instead of comparison '==' in if condition
D. Using 'free' instead of 'available' as status

Solution

  1. Step 1: Check the if condition syntax

    The condition uses '=' which assigns value instead of '==' which compares values.
  2. Step 2: Understand impact of wrong operator

    This causes a syntax error or unintended behavior because '=' cannot be used in conditions.
  3. Final Answer:

    Using assignment operator '=' instead of comparison '==' in if condition -> Option C
  4. Quick Check:

    Use '==' for comparison, not '=' [OK]
Hint: Use '==' for comparisons in conditions [OK]
Common Mistakes:
  • Confusing '=' with '==' in if statements
  • Thinking assigned_agent must be initialized inside loop
  • Assuming return inside loop is the error
5.

You want to design a scalable delivery agent assignment system for a city with thousands of agents and orders per minute. Which approach best improves scalability?

hard
A. Use a centralized server to check all agents for every order
B. Partition the city into zones and assign agents within zones only
C. Assign agents randomly without considering location
D. Assign the oldest registered agent to every order

Solution

  1. Step 1: Understand scalability challenges

    Checking all agents for every order is slow and resource-heavy at large scale.
  2. Step 2: Choose a partitioning strategy

    Dividing the city into zones limits search space, making assignment faster and scalable.
  3. Step 3: Evaluate other options

    Random or oldest agent assignment ignores location, causing delays and inefficiency.
  4. Final Answer:

    Partition the city into zones and assign agents within zones only -> Option B
  5. Quick Check:

    Zone partitioning = scalable assignment [OK]
Hint: Divide city into zones to limit search scope [OK]
Common Mistakes:
  • Using centralized server causing bottlenecks
  • Ignoring agent location in assignment
  • Assigning agents randomly or by registration time