HLDsystem_design~25 mins

Design a unique ID generator in HLD - System Design Exercise

Choose your learning style9 modes available

Learn Why Deep Arch Practice Challenge Design Recall Scale

Design: Unique ID Generator

Design covers the ID generation service, its architecture, data flow, and scaling. Does not cover client-side ID usage or storage.

Functional Requirements

FR1: Generate unique IDs that are globally unique across distributed systems

FR2: Support high request throughput of up to 100,000 ID requests per second

FR3: IDs should be sortable by creation time

FR4: IDs must be generated with low latency (p99 < 10ms)

FR5: System should be highly available with 99.9% uptime

FR6: Support multiple clients generating IDs concurrently

Non-Functional Requirements

NFR1: No single point of failure

NFR2: IDs must be collision-free even under network partitions

NFR3: System should scale horizontally

NFR4: Generated IDs should be compact (e.g., 64-bit or 128-bit)

NFR5: Avoid reliance on centralized databases for ID generation

Think Before You Design

Questions to Ask

❓ Question 1

❓ Question 2

❓ Question 3

❓ Question 4

❓ Question 5

❓ Question 6

Key Components

Timestamp provider

Node or machine identifier

Sequence number generator

API gateway or load balancer

Distributed coordination or consensus service (optional)

Cache or in-memory store for sequence state

Design Patterns

Snowflake ID generation pattern

UUID version 1 or 4

Database auto-increment with sharding

Timestamp + node ID + sequence number

Leader election for coordination

Reference Architecture

Load Balancer

↓

ID Generator Nodes

↓

Clients receive unique IDs

↓

Timestamp

Components

Load Balancer

Nginx or Cloud Load Balancer

Distributes incoming ID generation requests evenly across ID generator nodes

ID Generator Nodes

Custom service using Snowflake algorithm

Generate unique IDs using timestamp, node ID, and sequence number

Coordination Service

Apache ZooKeeper or etcd

Assigns unique node IDs and manages node membership to avoid collisions

Request Flow

1. Client sends request for unique ID to Load Balancer

2. Load Balancer forwards request to one of the ID Generator Nodes

3. ID Generator Node obtains current timestamp

4. Node uses its unique node ID assigned by Coordination Service

5. Node increments sequence number for IDs generated within the same millisecond

6. Node combines timestamp, node ID, and sequence number to form unique ID

7. Node returns generated unique ID to client

Database Schema

No persistent database required for ID generation. Coordination Service stores ephemeral node IDs and membership info.

Scaling Discussion

Bottlenecks

Sequence number overflow if too many IDs requested in one millisecond per node

Coordination service becoming a single point of failure

Load balancer overload under very high request rates

Clock synchronization issues causing duplicate or out-of-order IDs

Solutions

Increase number of ID generator nodes to distribute load

Use high-availability setup for coordination service with leader election

Implement client-side retries and backoff for load balancer failures

Use NTP or GPS for clock synchronization and monitor clock drift

If sequence overflows, wait for next millisecond or reject requests temporarily

Interview Tips

Time: Spend 10 minutes clarifying requirements and constraints, 20 minutes designing architecture and data flow, 10 minutes discussing scaling and failure handling, 5 minutes summarizing.

Explain why global uniqueness and ordering are important

Describe how timestamp, node ID, and sequence number combine to form unique IDs

Discuss trade-offs between centralized and decentralized approaches

Highlight how coordination service prevents node ID collisions

Address how system handles clock issues and high throughput

Mention scalability and fault tolerance strategies