Overview - Sentinel architecture

What is it?

Sentinel architecture is a system in Redis that helps manage and monitor Redis servers automatically. It watches over Redis instances to detect failures and can promote a backup server to replace a failed main server. This keeps the Redis service running smoothly without manual intervention.

Why it matters

Without Sentinel, if a Redis server fails, someone must notice and fix it manually, causing downtime and lost data access. Sentinel automates this process, making Redis highly available and reliable, which is crucial for applications that need fast and continuous data access.

Where it fits

Before learning Sentinel, you should understand basic Redis server setup and replication. After Sentinel, you can explore Redis Cluster for scaling and advanced fault tolerance.

Mental Model

Core Idea

Sentinel acts like a watchful guardian that monitors Redis servers and steps in automatically to fix failures by promoting backups.

Think of it like...

Imagine a team of lifeguards watching swimmers in a pool. If one swimmer (server) starts struggling or disappears, the lifeguards quickly spot it and send a backup swimmer to take their place, keeping the swim race going without interruption.

┌─────────────┐       ┌─────────────┐       ┌─────────────┐
│  Redis      │       │  Redis      │       │  Redis      │
│  Master     │◄──────│  Replica 1  │       │  Replica 2  │
└─────────────┘       └─────────────┘       └─────────────┘
       ▲                    ▲                     ▲
       │                    │                     │
       │                    │                     │
┌───────────────────────────────────────────────┐
│                 Sentinel Nodes                 │
│  (Monitor, Detect Failures, Promote Backup)   │
└───────────────────────────────────────────────┘

Build-Up - 7 Steps

1

FoundationWhat is Redis Sentinel

Concept: Introduction to Sentinel as a monitoring and failover system for Redis.

Redis Sentinel is a separate system that watches Redis servers. It checks if the main Redis server (called master) is working well. If the master stops working, Sentinel can promote one of the backup servers (called replicas) to become the new master automatically.

Result

You understand Sentinel's role as a helper that keeps Redis running without manual fixes.

Understanding Sentinel as a guardian clarifies why it is essential for high availability in Redis.

2

FoundationBasic Redis Replication Setup

3

IntermediateSentinel Monitoring and Quorum

4

IntermediateAutomatic Failover Process

5

IntermediateSentinel Configuration and Deployment

6

AdvancedHandling Split-Brain and Network Partitions

7

ExpertSentinel Internals and Leader Election

Under the Hood

Sentinel nodes continuously send PING messages to Redis servers and each other to check health. They maintain state about which servers are up or down. When a master is suspected down, Sentinels exchange votes to reach quorum. Upon quorum, they elect a leader Sentinel that performs failover by sending commands to promote a replica and update configurations.

Why designed this way?

Sentinel was designed to provide automatic failover without a single point of failure. Using multiple Sentinels and quorum voting prevents false failovers caused by network glitches. Leader election ensures coordinated actions. Alternatives like manual failover or single-node monitoring were too slow or unreliable for production needs.

┌───────────────┐       ┌───────────────┐       ┌───────────────┐
│ Sentinel Node │◄──────│ Sentinel Node │──────►│ Sentinel Node │
└───────┬───────┘       └───────┬───────┘       └───────┬───────┘
        │                       │                       │
        ▼                       ▼                       ▼
┌─────────────┐         ┌─────────────┐         ┌─────────────┐
│ Redis Master│         │ Redis Replica│         │ Redis Replica│
└─────────────┘         └─────────────┘         └─────────────┘

Sentinels monitor Redis servers and communicate to agree on failures.
Leader Sentinel coordinates failover commands.

Myth Busters - 4 Common Misconceptions

Quick: Does Sentinel automatically scale Redis by adding more servers? Commit yes or no.

Common Belief:Sentinel automatically adds more Redis servers to handle more data or traffic.

Tap to reveal reality

Quick: Can a single Sentinel node safely decide to promote a replica alone? Commit yes or no.

Common Belief:One Sentinel node can detect failure and promote a replica by itself.

Tap to reveal reality

Quick: After failover, do clients automatically connect to the new master without any changes? Commit yes or no.

Common Belief:Clients always automatically find the new master after failover without configuration changes.

Tap to reveal reality

Quick: Is it impossible for two masters to exist at the same time with Sentinel? Commit yes or no.

Common Belief:Sentinel guarantees no split-brain; two masters can never exist simultaneously.

Tap to reveal reality

Expert Zone

1

Sentinel's failover timing balances speed and safety; too fast can cause false failovers, too slow increases downtime.

2

Sentinel nodes themselves can fail; deploying an odd number of Sentinels helps maintain quorum and availability.

3

Sentinel uses a gossip-like protocol to share state, which can cause delays in failure detection under heavy network load.

When NOT to use

Sentinel is not suitable for scaling Redis horizontally or sharding data; for that, use Redis Cluster. Also, in environments with extremely unstable networks, Sentinel's failover may cause split-brain; consider external orchestration tools.

Production Patterns

In production, teams deploy at least three Sentinel nodes on separate machines or data centers. Clients use Sentinel APIs to discover the current master dynamically. Monitoring and alerting are set up on Sentinel health to detect issues early.

Connections

Distributed Consensus Algorithms

Sentinel's leader election and quorum voting are examples of distributed consensus.

Understanding consensus algorithms like Raft or Paxos helps grasp how Sentinel coordinates failover safely.

High Availability Systems

Sentinel is a practical implementation of high availability principles in databases.

Knowing general HA concepts clarifies why Sentinel uses monitoring, failover, and redundancy.

Emergency Response Teams

Sentinel's role is similar to emergency teams that monitor and respond quickly to incidents.

Seeing Sentinel as an emergency response system highlights the importance of quick detection and coordinated action.

Common Pitfalls

#1Running only one Sentinel node for monitoring.

Wrong approach:sentinel monitor mymaster 127.0.0.1 6379 2 # Only one Sentinel node running

Correct approach:Run at least three Sentinel nodes on different machines: sentinel monitor mymaster 127.0.0.1 6379 2 # Deployed on three separate servers

Root cause:Misunderstanding that Sentinel needs quorum from multiple nodes to safely decide failover.

#2Clients connecting directly to Redis master without Sentinel support.

Wrong approach:redis-cli -h 127.0.0.1 -p 6379 # Client connects directly to master IP

Correct approach:Use Sentinel-aware clients or query Sentinel for master address: redis-cli -p 26379 sentinel get-master-addr-by-name mymaster # Then connect to returned master

Root cause:Not realizing clients must discover the current master dynamically after failover.

#3Setting quorum too low in Sentinel configuration.

Wrong approach:sentinel monitor mymaster 127.0.0.1 6379 1 # Quorum set to 1

Correct approach:Set quorum to majority of Sentinel nodes, e.g., 2 if 3 Sentinels: sentinel monitor mymaster 127.0.0.1 6379 2

Root cause:Underestimating the need for multiple votes to avoid false failover.

Key Takeaways

Redis Sentinel is an automatic system that monitors Redis servers and promotes backups if the main server fails.

Sentinel uses multiple nodes and quorum voting to safely detect failures and avoid mistakes.

Clients must be configured to use Sentinel to find the current master after failover.

Sentinel improves Redis availability but does not handle scaling or data sharding.

Understanding Sentinel's leader election and failover process is key to deploying reliable Redis systems.