Overview - Read replicas

What is it?

Read replicas are copies of a primary database that handle read-only queries. They help spread out the load by letting many users read data without slowing down the main database. These replicas stay updated by copying changes from the primary database. This setup improves performance and availability for applications that read data often.

Why it matters

Without read replicas, all users would query the main database, causing slow responses and possible crashes during high traffic. Read replicas solve this by sharing the reading work, making apps faster and more reliable. This is crucial for websites and services with many users who mostly read data, like social media or online stores.

Where it fits

Before learning about read replicas, you should understand basic database concepts like primary databases and replication. After this, you can explore advanced topics like load balancing, caching, and multi-region database setups.

Mental Model

Core Idea

Read replicas are copies of a main database that handle read requests to reduce load and improve speed without affecting writes.

Think of it like...

Imagine a popular library with one main librarian who handles all book requests. To avoid long waits, the library makes several copies of popular books and places them on shelves around the building. Visitors can read these copies anytime without bothering the librarian, who focuses on adding new books and managing the collection.

Primary Database (Write + Read)
       │
       ├──> Read Replica 1 (Read-only)
       ├──> Read Replica 2 (Read-only)
       └──> Read Replica 3 (Read-only)

All write operations go to the Primary Database.
Read operations are distributed among Read Replicas.

Build-Up - 7 Steps

1

FoundationUnderstanding Primary Database Role

Concept: Learn what a primary database does and why it handles both reads and writes.

A primary database stores all the data and processes both read and write requests. It ensures data is accurate and consistent. However, when many users read data, the primary can become slow because it handles all requests.

Result

You understand that the primary database is the main source of truth but can become a bottleneck under heavy read load.

Knowing the primary database's dual role helps you see why separating reads can improve performance.

2

FoundationBasics of Database Replication

3

IntermediateHow Read Replicas Handle Read Traffic

4

IntermediateData Consistency and Replication Lag

5

IntermediateScaling with Multiple Read Replicas

6

AdvancedRead Replica Failover and High Availability

7

ExpertAdvanced Replication Techniques and Tradeoffs

Under the Hood

Read replicas work by copying the write operations from the primary database through a replication process. This can be done by streaming changes (like logs of updates) or by periodic snapshots. The replicas apply these changes to their own data stores, keeping them nearly in sync. The replication can be asynchronous, where the primary does not wait for replicas to confirm, or synchronous, where it does. Applications route read queries to replicas using load balancers or client logic, while writes always go to the primary.

Why designed this way?

Read replicas were designed to solve the problem of scaling database reads without overloading the primary. Early databases handled all reads and writes on one server, which limited performance. Replication allowed distributing read load cheaply and simply. Asynchronous replication was chosen to maximize write speed, accepting some delay in data freshness. Synchronous replication exists for cases needing strict consistency but at a performance cost.

┌─────────────────────┐        ┌─────────────────────┐
│   Primary Database   │───────▶│   Read Replica 1    │
│  (Writes + Reads)   │        │   (Read-only)       │
└─────────────────────┘        └─────────────────────┘
          │                            ▲
          │                            │
          │                            │
          ▼                            │
┌─────────────────────┐              │
│   Read Replica 2    │◀─────────────┘
│   (Read-only)       │
└─────────────────────┘

Replication stream flows from Primary to Replicas.
Reads are distributed to replicas.
Writes go only to Primary.

Myth Busters - 4 Common Misconceptions

Quick: Do read replicas handle write requests? Commit to yes or no.

Common Belief:Read replicas can handle both read and write requests just like the primary.

Tap to reveal reality

Quick: Are read replicas always perfectly up-to-date with the primary? Commit to yes or no.

Common Belief:Read replicas always have the exact same data as the primary at all times.

Tap to reveal reality

Quick: Does adding more read replicas always improve performance linearly? Commit to yes or no.

Common Belief:More read replicas always mean proportionally better read performance.

Tap to reveal reality

Quick: Can a read replica instantly replace the primary if it fails? Commit to yes or no.

Common Belief:Read replicas can immediately take over as primary without any delay or data loss.

Tap to reveal reality

Expert Zone

1

Some applications use read-after-write consistency by directing recent writes to the primary and older reads to replicas, balancing freshness and performance.

2

Network topology and geographic distance affect replication lag; placing replicas closer to users can improve read latency but complicate synchronization.

3

Monitoring replication lag and automating failover decisions are critical in production to avoid stale reads and downtime.

When NOT to use

Read replicas are not suitable when applications require strict, immediate consistency for all reads and writes. In such cases, consider single primary with strong consistency or distributed databases with consensus protocols like Paxos or Raft.

Production Patterns

In production, read replicas are combined with load balancers or proxy layers that route read queries intelligently. Systems often use multiple replicas across regions for disaster recovery and low latency. Monitoring tools track replication health and lag to trigger alerts or automated failover.

Connections

Caching

Both caching and read replicas reduce load on primary data sources by serving repeated reads from faster or distributed stores.

Understanding read replicas helps grasp caching strategies since both aim to improve read performance but differ in data freshness and complexity.

Content Delivery Networks (CDNs)

CDNs and read replicas both replicate data closer to users to reduce latency and improve availability.

Knowing how read replicas work clarifies CDN design, as both balance freshness, consistency, and performance tradeoffs.

Supply Chain Management

Read replicas resemble inventory distribution centers that hold copies of products to serve customers faster without waiting for the main warehouse.

This cross-domain link shows how distributing copies strategically improves service speed and reliability in both tech and logistics.

Common Pitfalls

#1Sending write queries to read replicas causing errors.

Wrong approach:INSERT INTO users (name) VALUES ('Alice'); -- sent to read replica

Correct approach:INSERT INTO users (name) VALUES ('Alice'); -- sent to primary database

Root cause:Misunderstanding that read replicas are read-only and cannot process writes.

#2Ignoring replication lag and reading stale data from replicas.

Wrong approach:SELECT balance FROM accounts; -- always from read replica without checking freshness

Correct approach:SELECT balance FROM accounts; -- from primary or replica with lag monitoring

Root cause:Assuming replicas are always perfectly up-to-date leads to incorrect application behavior.

#3Adding too many read replicas without managing replication overhead.

Wrong approach:Deploying 20 replicas for a small app expecting linear performance gains.

Correct approach:Deploying a balanced number of replicas based on load and monitoring replication health.

Root cause:Believing more replicas always equal better performance without considering system limits.

Key Takeaways

Read replicas copy data from a primary database to handle read requests and reduce load on the main system.

They improve performance and availability but may show slightly outdated data due to replication lag.

Writes always go to the primary database to maintain data consistency and avoid conflicts.

Choosing replication modes and the number of replicas involves tradeoffs between speed, consistency, and resource use.

Proper monitoring and routing logic are essential to use read replicas effectively in production.