0
0
HLDsystem_design~12 mins

Blob storage (S3, Azure Blob) in HLD - Architecture Diagram

Choose your learning style9 modes available
System Overview - Blob storage (S3, Azure Blob)

This system stores and manages large amounts of unstructured data like images, videos, and documents. It provides scalable, durable, and secure storage accessible over the internet with high availability and low latency.

Architecture Diagram
User
  |
  v
Internet CDN
  |
  v
Load Balancer
  |
  v
API Gateway
  |
  v
Blob Storage Service
  |
  +------------------+
  |                  |
  v                  v
Metadata DB       Object Storage
(Cache Layer)     (Durable Storage)
Components
User
client
End user or application accessing blob storage
Internet CDN
cdn
Caches frequently accessed blobs close to users for faster delivery
Load Balancer
load_balancer
Distributes incoming requests evenly to backend services
API Gateway
api_gateway
Handles authentication, authorization, and routes requests to blob storage service
Blob Storage Service
service
Manages blob upload, download, and metadata operations
Metadata DB (Cache Layer)
cache
Stores metadata like blob names, sizes, and access permissions for quick lookup
Object Storage (Durable Storage)
storage
Stores the actual blob data reliably and durably across multiple servers
Request Flow - 10 Hops
UserInternet CDN
Internet CDNLoad Balancer
Load BalancerAPI Gateway
API GatewayBlob Storage Service
Blob Storage ServiceMetadata DB (Cache Layer)
Blob Storage ServiceObject Storage (Durable Storage)
Blob Storage ServiceAPI Gateway
API GatewayLoad Balancer
Load BalancerInternet CDN
Internet CDNUser
Failure Scenario
Component Fails:Object Storage (Durable Storage)
Impact:Blob data cannot be read or written, causing failures in upload/download operations
Mitigation:Use data replication across multiple storage nodes and regions to ensure availability and durability
Architecture Quiz - 3 Questions
Test your understanding
Which component is responsible for authenticating user requests?
AAPI Gateway
BLoad Balancer
CMetadata DB
DInternet CDN
Design Principle
This architecture uses a layered approach with caching (CDN and metadata cache) to reduce latency and load on durable storage. It separates concerns by using an API Gateway for security and routing, and durable object storage for reliable data persistence. Load balancing ensures scalability and availability.