Overview - Replication basics

What is it?

Replication in MySQL is a process where data from one database server (called the master) is copied automatically to another server (called the slave). This helps keep the data synchronized between servers. It is used to improve data availability, backup, and load balancing. Replication happens continuously and automatically once set up.

Why it matters

Without replication, if the main database server fails, data could be lost or unavailable, causing downtime and lost business. Replication ensures there is a backup copy of data that can take over quickly. It also allows spreading read requests across servers, improving performance and user experience.

Where it fits

Before learning replication, you should understand basic MySQL database concepts like tables, queries, and server setup. After mastering replication basics, you can learn advanced topics like replication filtering, multi-source replication, and failover strategies.

Mental Model

Core Idea

Replication is like having a trusted assistant who copies every change you make in your notebook to their own notebook in real time.

Think of it like...

Imagine you write notes in your diary every day. Your friend sits next to you and writes down exactly what you write, so they have the same diary. If you lose your diary, your friend’s copy keeps your notes safe.

Master Server
┌───────────────┐
│  Writes data  │
└──────┬────────┘
       │  sends changes
       ▼
Slave Server
┌───────────────┐
│  Receives and │
│  applies data │
└───────────────┘

Build-Up - 7 Steps

1

FoundationWhat is MySQL Replication?

Concept: Introduction to the basic idea of copying data from one server to another automatically.

MySQL replication means one server (master) sends all changes it makes to another server (slave). The slave keeps a copy of the master's data. This happens continuously without manual copying.

Result

You get two servers with the same data, where one is the main source and the other is a backup copy.

Understanding replication as automatic copying helps you see how data stays safe and available.

2

FoundationRoles: Master and Slave Servers

3

IntermediateHow Data Moves: Binary Log and Relay Log

4

IntermediateSetting Up Replication Step-by-Step

5

IntermediateRead-Only Slave and Load Balancing

6

AdvancedHandling Replication Delays and Conflicts

7

ExpertSemi-Synchronous Replication and Failover Strategies

Under the Hood

MySQL replication works by the master recording all data changes in a binary log file. The slave connects to the master and requests this log. The slave copies the binary log into its relay log and then applies each change to its own data. This process is asynchronous by default, meaning the master does not wait for the slave to finish applying changes before continuing. The replication protocol uses a client-server connection over TCP/IP, with the slave acting as a client reading the master's binary log.

Why designed this way?

Replication was designed to be asynchronous to maximize performance and reduce master load. Synchronous replication would slow down writes because the master would wait for slaves to confirm. Using logs decouples data changes from replication, allowing recovery and replay. The binary log format is compact and efficient, and the relay log on the slave ensures ordered application of changes even if the connection drops temporarily.

Master Server
┌───────────────┐
│ Binary Log    │
│ (records all  │
│ changes)      │
└──────┬────────┘
       │
       │ TCP/IP connection
       ▼
Slave Server
┌───────────────┐
│ Relay Log     │
│ (copies from  │
│ binary log)   │
└──────┬────────┘
       │
       ▼
┌───────────────┐
│ Applies data  │
│ changes to DB │
└───────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does replication guarantee zero delay between master and slave data? Commit to yes or no.

Common Belief:Replication means the slave always has exactly the same data as the master instantly.

Tap to reveal reality

Quick: Can slaves accept write queries from users? Commit to yes or no.

Common Belief:Slaves can be used just like masters and accept data changes from users.

Tap to reveal reality

Quick: Does replication automatically protect against all data loss? Commit to yes or no.

Common Belief:Replication always guarantees no data loss even if the master crashes.

Tap to reveal reality

Quick: Is replication setup the same for all MySQL versions? Commit to yes or no.

Common Belief:Replication setup and behavior are identical across all MySQL versions.

Tap to reveal reality

Expert Zone

1

Replication lag can be caused not only by network delay but also by heavy slave query load or slow disk I/O, which many overlook.

2

GTID-based replication simplifies failover and recovery but requires careful planning to avoid conflicts and ensure consistent server states.

3

Semi-synchronous replication improves durability but can reduce write throughput, so balancing safety and performance is a key expert decision.

When NOT to use

Replication is not suitable when you need immediate consistency across servers; in such cases, synchronous distributed databases or clustering solutions like Galera Cluster are better. Also, for very high write workloads, replication can become a bottleneck and alternative sharding or partitioning strategies might be preferred.

Production Patterns

In production, replication is often combined with monitoring tools to detect lag and failures automatically. Multi-slave setups distribute read load, and delayed slaves are used for point-in-time recovery. Experts use GTIDs for easier failover and semi-synchronous replication to reduce data loss risk. Backup strategies complement replication to ensure data safety.

Connections

Event Sourcing (Software Architecture)

Replication logs changes as events, similar to event sourcing recording state changes.

Understanding replication logs as event streams helps grasp how systems can rebuild state from change history.

Backup and Disaster Recovery

Replication provides a live copy of data, complementing traditional backups for faster recovery.

Knowing replication’s role alongside backups clarifies how to design resilient data systems.

Supply Chain Management

Replication’s master-slave data flow resembles supply chains where goods move from supplier to retailer.

Seeing replication as a flow of goods helps understand the importance of order, timing, and reliability in data synchronization.

Common Pitfalls

#1Not enabling binary logging on the master server.

Wrong approach:On master: skip setting 'log_bin=ON' in my.cnf and restart server.

Correct approach:On master: add 'log_bin=ON' in my.cnf and restart server to enable binary logging.

Root cause:Replication depends on the binary log; without it, slaves have no data to copy.

#2Starting slave replication without specifying the correct log file and position.

Wrong approach:On slave: RUN 'CHANGE MASTER TO MASTER_HOST='master', MASTER_USER='replica', MASTER_PASSWORD='pass'; START SLAVE;' without log file and position.

Correct approach:On slave: RUN 'CHANGE MASTER TO MASTER_HOST='master', MASTER_USER='replica', MASTER_PASSWORD='pass', MASTER_LOG_FILE='mysql-bin.000001', MASTER_LOG_POS=154; START SLAVE;' with correct log file and position.

Root cause:Slave needs to know where to start reading the master's binary log to avoid data inconsistency.

#3Allowing writes on the slave server.

Wrong approach:On slave: leave 'read_only=OFF' and run INSERT or UPDATE queries directly.

Correct approach:On slave: set 'read_only=ON' in my.cnf to prevent accidental writes.

Root cause:Writes on slaves cause conflicts and break replication consistency.

Key Takeaways

MySQL replication copies data changes from a master server to one or more slave servers automatically using binary and relay logs.

Replication roles are clearly defined: the master writes data and logs changes, while slaves read and apply those changes to stay synchronized.

Replication is asynchronous by default, so slaves may lag behind the master, which affects data freshness for read queries.

Proper setup requires enabling binary logging on the master, configuring slaves with correct log positions, and usually setting slaves to read-only mode.

Advanced replication modes and failover strategies improve data safety and availability but require careful planning and monitoring.