Overview - Cache eviction policies (LRU, LFU, TTL)

What is it?

Cache eviction policies are rules that decide which data to remove from a cache when it is full. They help keep the cache efficient by removing less useful data to make space for new data. Common policies include LRU (Least Recently Used), LFU (Least Frequently Used), and TTL (Time To Live). Each policy uses a different way to pick what to evict.

Why it matters

Without eviction policies, caches would fill up and stop working well, slowing down systems that rely on fast data access. Good eviction policies keep the cache fresh and relevant, improving speed and reducing load on slower storage or databases. This makes apps feel faster and saves resources.

Where it fits

Learners should first understand what a cache is and why caching improves performance. After learning eviction policies, they can explore cache implementation details, distributed caching, and how eviction affects system scalability and consistency.

Mental Model

Core Idea

Cache eviction policies decide which stored data to remove when space runs out, balancing freshness and usefulness to keep the cache effective.

Think of it like...

Imagine a small backpack with limited space. You decide what to take out when it’s full: either the items you haven’t used recently (LRU), the items you rarely use (LFU), or the items that have expired (TTL).

┌───────────────┐
│   Cache Full  │
└──────┬────────┘
       │
       ▼
┌─────────────────────────────┐
│  Eviction Policy Decision    │
│ ┌─────────┬─────────┬──────┐ │
│ │   LRU   │   LFU   │ TTL  │ │
│ └─────────┴─────────┴──────┘ │
└─────────────┬───────────────┘
              │
              ▼
     ┌─────────────────┐
     │ Remove Selected  │
     │ Cache Entry     │
     └─────────────────┘

Build-Up - 7 Steps

1

FoundationWhat is a Cache and Why Evict

Concept: Introduce the basic idea of a cache and why eviction is necessary.

A cache is a small, fast storage that keeps copies of data to speed up access. Since cache size is limited, it cannot hold everything. When full, it must remove some data to make room for new data. This removal process is called eviction.

Result

Learners understand that eviction is essential to keep caches working efficiently.

Knowing why eviction is needed helps learners appreciate the role of eviction policies in maintaining cache performance.

2

FoundationBasic Cache Eviction Concepts

3

IntermediateLeast Recently Used (LRU) Policy

4

IntermediateLeast Frequently Used (LFU) Policy

5

IntermediateTime To Live (TTL) Policy

6

AdvancedComparing Eviction Policies and Tradeoffs

7

ExpertImplementing Efficient Eviction in Large Systems

Under the Hood

Cache eviction policies work by maintaining metadata about each cache entry, such as last access time (LRU), access count (LFU), or expiration timestamp (TTL). When the cache is full, the policy uses this metadata to select entries to remove. Data structures like doubly linked lists (for LRU), frequency lists (for LFU), and priority queues or timers (for TTL) support efficient updates and lookups.

Why designed this way?

These policies were designed to balance cache hit rate and resource use. LRU leverages temporal locality common in many workloads. LFU captures long-term popularity but is more complex. TTL addresses data freshness needs. Alternatives like random eviction exist but perform worse. The chosen designs reflect tradeoffs between accuracy, complexity, and overhead.

Cache Storage
┌─────────────────────────────┐
│  ┌───────────────┐          │
│  │ Cache Entry 1 │◄─────────┤
│  ├───────────────┤          │
│  │ Cache Entry 2 │◄─────────┤ Metadata
│  ├───────────────┤          │
│  │ Cache Entry 3 │◄─────────┤  ┌─────────────┐
│  └───────────────┘          │  │ Last Access │
│                             │  │ Count      │
│                             │  │ Expiry     │
│                             │  └─────────────┘
└─────────────┬───────────────┘
              │
              ▼
      Eviction Decision
┌─────────────────────────────┐
│  Use Metadata to Select Entry│
│  to Remove Based on Policy   │
└─────────────────────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does LRU always remove the oldest data in the cache? Commit yes or no.

Common Belief:LRU removes the oldest data stored in the cache regardless of usage.

Tap to reveal reality

Quick: Does LFU always keep the freshest data? Commit yes or no.

Common Belief:LFU keeps the freshest data because it tracks usage frequency.

Tap to reveal reality

Quick: Does TTL evict data based on usage patterns? Commit yes or no.

Common Belief:TTL evicts data based on how often it is used.

Tap to reveal reality

Quick: Is it cheap to track exact usage for millions of cache entries? Commit yes or no.

Common Belief:Tracking exact usage for all cache entries is cheap and easy.

Tap to reveal reality

Expert Zone

1

LRU can be approximated with CLOCK algorithms to reduce overhead while maintaining similar eviction behavior.

2

LFU often requires aging mechanisms to prevent cache pollution by old popular items that are no longer relevant.

3

TTL eviction can be implemented lazily during access or eagerly with timers, each with tradeoffs in complexity and accuracy.

When NOT to use

Avoid LRU in workloads with cyclic or scan-heavy access patterns where it performs poorly; prefer LFU or hybrid policies. Avoid LFU when data popularity changes rapidly; TTL or LRU may be better. TTL is unsuitable when data usage patterns matter more than freshness; use LRU or LFU instead.

Production Patterns

Real systems often combine policies, like LFU with TTL, or use segmented caches separating hot and cold data. Distributed caches use consistent hashing with local eviction policies. Approximate counters and sampling reduce overhead. Monitoring eviction rates helps tune policies dynamically.

Connections

Operating System Page Replacement

Similar eviction strategies are used to decide which memory pages to swap out.

Understanding cache eviction helps grasp OS memory management, as both optimize limited fast storage.

Garbage Collection in Programming Languages

Both remove unused or less useful data to free resources based on usage patterns.

Knowing eviction policies clarifies how garbage collectors decide which objects to reclaim.

Inventory Management in Retail

Deciding which products to remove or discount based on sales frequency or shelf life mirrors eviction policies.

Recognizing this connection shows how eviction balances freshness and demand in diverse fields.

Common Pitfalls

#1Evicting cache entries randomly without a policy.

Wrong approach:When cache is full, remove any entry without checking usage or time.

Correct approach:Use a defined eviction policy like LRU, LFU, or TTL to select entries to remove.

Root cause:Misunderstanding that eviction needs to be strategic to keep cache effective.

#2Implementing LRU by scanning all entries to find the least recently used.

Wrong approach:On eviction, loop through entire cache to find oldest access time.

Correct approach:Maintain a linked list or queue to track usage order for O(1) eviction decisions.

Root cause:Not using proper data structures leads to inefficient eviction and slow cache performance.

#3Setting TTL values too long or too short without workload analysis.

Wrong approach:Assign a fixed TTL like 24 hours for all cache entries regardless of data nature.

Correct approach:Tune TTL based on data freshness needs and access patterns, possibly varying per entry type.

Root cause:Ignoring workload characteristics causes stale data or excessive cache misses.

Key Takeaways

Cache eviction policies are essential to keep caches efficient by removing less useful data when space runs out.

LRU evicts data not used recently, LFU evicts data used least often, and TTL evicts data after a set time.

No single eviction policy fits all workloads; understanding tradeoffs helps choose or design the right one.

Efficient implementation of eviction policies requires careful data structures and approximations at scale.

Misunderstanding eviction policies leads to poor cache performance, stale data, or system bottlenecks.