Elasticsearchquery~15 mins

Hot-warm-cold architecture in Elasticsearch - Deep Dive

Choose your learning style10 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Overview - Hot-warm-cold architecture

What is it?

Hot-warm-cold architecture is a way to organize data storage in Elasticsearch based on how often data is used and how fast it needs to be accessed. It divides data into three layers: hot for new and frequently accessed data, warm for less active data, and cold for old data that is rarely accessed but still kept. This helps manage resources efficiently and keeps search fast.

Why it matters

Without this architecture, all data would be treated the same, making searches slower and more expensive as data grows. It solves the problem of balancing speed and cost by storing data in the right place depending on its age and usage. This means businesses can keep large amounts of data without slowing down or spending too much on storage.

Where it fits

Before learning this, you should understand basic Elasticsearch concepts like indices and nodes. After this, you can explore advanced data lifecycle management and performance tuning in Elasticsearch clusters.

Mental Model

Core Idea

Hot-warm-cold architecture organizes data by usage frequency and age to optimize cost and performance in Elasticsearch.

Think of it like...

It's like a library where new popular books are kept on the front shelves (hot), older but still sometimes read books are on side shelves (warm), and rarely read books are stored in the basement (cold).

┌───────────┐   ┌───────────┐   ┌───────────┐
│   HOT     │→ │   WARM    │→ │   COLD    │
│ (Fast,   │   │ (Slower,  │   │ (Slowest, │
│  expensive)│  │  cheaper) │   │  cheapest)│
└───────────┘   └───────────┘   └───────────┘
   ▲              ▲               ▲
   │              │               │
New &          Less active     Rarely used
frequently     data            data
accessed

Build-Up - 7 Steps

FoundationUnderstanding Elasticsearch data nodes

Concept: Learn what data nodes are and how they store data in Elasticsearch.

Elasticsearch stores data in units called indices, which are split into shards. These shards live on data nodes, which are servers that hold and manage the data. Each node can have different roles, but data nodes specifically hold the actual data and respond to search requests.

Result

You know that data nodes are the servers where your data lives and that they handle search and indexing.

Understanding data nodes is key because hot-warm-cold architecture depends on placing data on different nodes based on usage.

FoundationWhat is data aging in Elasticsearch?

IntermediateDefining hot, warm, and cold tiers

IntermediateHow data moves between tiers

IntermediateHardware and configuration differences per tier

AdvancedTradeoffs and challenges in tiered storage

ExpertOptimizing hot-warm-cold in large clusters

Under the Hood

Elasticsearch uses node attributes and shard allocation rules to place indices on nodes tagged as hot, warm, or cold. ILM policies automate index rollover and shard movement by changing index settings and triggering shard relocation. The cluster state tracks node roles and index locations, coordinating searches and writes accordingly.

Why designed this way?

This design arose to handle growing data volumes without linear cost increases. Early Elasticsearch clusters treated all data equally, causing performance bottlenecks and high costs. Separating data by usage allowed better hardware utilization and predictable performance. Alternatives like single-tier storage were too expensive or slow at scale.

┌───────────────┐       ┌───────────────┐       ┌───────────────┐
│   Hot Nodes   │──────▶│   Warm Nodes  │──────▶│   Cold Nodes  │
│ (Fast SSDs)   │       │ (Slower HDDs) │       │ (Slowest Disk)│
│  Index writes │       │  Read mostly  │       │  Archive data │
│  Frequent     │       │  Less writes  │       │  Rare queries │
└───────────────┘       └───────────────┘       └───────────────┘
        ▲                      ▲                       ▲
        │                      │                       │
   New data             Aging data              Old data
   indexed here         moved here             moved here

Myth Busters - 4 Common Misconceptions

Quick: Is cold data in Elasticsearch always offline and inaccessible? Commit to yes or no.

Common Belief:Cold data is offline and cannot be searched until manually restored.

Tap to reveal reality

Quick: Does hot-warm-cold architecture mean you must buy three different types of hardware? Commit to yes or no.

Common Belief:You must have three completely separate hardware setups for hot, warm, and cold tiers.

Tap to reveal reality

Quick: Does data automatically move between tiers without any configuration? Commit to yes or no.

Common Belief:Data moves between hot, warm, and cold tiers automatically without any setup.

Tap to reveal reality

Quick: Is hot data always the largest portion of your Elasticsearch storage? Commit to yes or no.

Common Belief:Hot data makes up most of the storage because it is the newest and most important.

Tap to reveal reality

Expert Zone

ILM policies can include custom actions like force merging or shrinking indices to optimize storage in warm and cold tiers.

Shard allocation awareness can be used to keep replicas on different tiers or availability zones for fault tolerance.

Cold tier nodes can be configured as frozen nodes, which keep data on remote storage and load it into memory only when searched.

When NOT to use

Hot-warm-cold architecture is less suitable for small clusters or use cases where all data is equally critical and frequently accessed. In such cases, a single-tier cluster or time-series optimized indices might be better.

Production Patterns

In production, teams often combine hot-warm-cold with rollover indices and snapshot backups. They monitor ILM execution closely and adjust policies based on query patterns and storage costs. Some use cloud storage for cold tier to reduce on-premises hardware needs.

Connections

Data Lifecycle Management

Hot-warm-cold architecture builds on data lifecycle management principles.

Understanding lifecycle management helps grasp how data moves through hot, warm, and cold tiers automatically.

Cache Hierarchy in Computer Architecture

Both organize data storage by speed and cost tradeoffs.

Knowing cache layers (L1, L2, L3) helps understand why Elasticsearch separates data into hot, warm, and cold tiers for efficiency.

Library Book Organization

Both arrange items by usage frequency and accessibility.

Seeing how libraries keep popular books handy and archives in storage clarifies why data is tiered in Elasticsearch.

Common Pitfalls

#1Not setting ILM policies causes data to stay on hot nodes indefinitely.

Wrong approach:PUT /_template/my_template { "index_patterns": ["logs-*"], "settings": { "number_of_shards": 1 } }

Correct approach:PUT /_ilm/policy/logs_policy { "policy": { "phases": { "hot": {"actions": {"rollover": {"max_age": "7d"}}}, "warm": {"actions": {"allocate": {"require": {"data": "warm"}}}}, "cold": {"actions": {"allocate": {"require": {"data": "cold"}}}} } } } PUT /_template/my_template { "index_patterns": ["logs-*"], "settings": { "number_of_shards": 1, "index.lifecycle.name": "logs_policy", "index.lifecycle.rollover_alias": "logs" } }

Root cause:Learners may not realize that ILM policies must be explicitly created and linked to indices to automate tier movement.

#2Using the same hardware and settings for all tiers wastes resources.

Wrong approach:All nodes use SSDs and high CPU with frequent refresh intervals regardless of data age.

Correct approach:Hot nodes use SSDs and fast CPUs with frequent refreshes; warm nodes use slower disks and less CPU; cold nodes use cheapest storage and minimal CPU with infrequent refreshes.

Root cause:Misunderstanding that different data ages require different hardware and settings leads to inefficient cluster design.

#3Assuming cold tier data is instantly searchable like hot data.

Wrong approach:Expecting sub-second search times on cold nodes without tuning or understanding delays.

Correct approach:Plan for slower search times on cold nodes and configure frozen indices or caching to improve performance when needed.

Root cause:Not recognizing the performance tradeoffs of storing data on slower hardware causes unrealistic expectations.

Key Takeaways

Hot-warm-cold architecture organizes Elasticsearch data by usage frequency and age to balance speed and cost.

Data moves through tiers automatically using Index Lifecycle Management policies based on age or size.

Each tier uses different hardware and settings optimized for its access patterns and cost constraints.

Understanding this architecture helps design scalable, efficient Elasticsearch clusters that handle large data volumes.

Misconfigurations or misunderstandings can lead to higher costs, slower searches, or wasted resources.

Practice

(1/5)

1. What is the main purpose of the hot-warm-cold architecture in Elasticsearch?

easy

A. To encrypt data at rest and in transit

B. To store recent data on fast nodes and older data on slower, cheaper nodes

C. To backup data to external storage automatically

D. To replicate data across multiple clusters for high availability

Hot-warm-cold architecture in Elasticsearch - Deep Dive

Start learning this pattern below

Practice

Solution

Step 1: Understand the architecture purpose

Step 2: Match the purpose to options

Final Answer:

Quick Check:

Solution

Step 1: Identify automation for data phase movement

Step 2: Compare other features

Final Answer:

Quick Check:

Solution

Step 1: Analyze min_age values for phases

Step 2: Determine phase after 30 days

Final Answer:

Quick Check:

Solution

Step 1: Understand ILM phase transition requirements

Step 2: Identify missing trigger

Final Answer:

Quick Check:

Solution

Step 1: Identify required phase ages

Step 2: Match policy phases to requirements

Final Answer:

Quick Check: