Overview - LRU cache design with hash map and doubly linked list

What is it?

An LRU cache is a special kind of storage that keeps track of the most recently used items. It stores a limited number of items and removes the least recently used one when full. This design uses a hash map for quick access and a doubly linked list to keep track of usage order. Together, they help find and update items fast while managing which to remove.

Why it matters

Without an LRU cache, programs might waste time and resources by repeatedly loading or computing data that was recently used. This slows down performance and increases costs. The LRU cache solves this by remembering recent data and quickly discarding old, unused data. It makes apps and systems faster and more efficient, especially when memory is limited.

Where it fits

Before learning LRU cache design, you should understand basic data structures like hash maps (for fast lookup) and linked lists (for ordered data). After this, you can explore other cache strategies or advanced memory management techniques. This topic fits into learning about efficient data storage and retrieval in computer science.

Mental Model

Core Idea

An LRU cache uses a hash map for instant access and a doubly linked list to track usage order, so it quickly finds data and removes the least recently used item when full.

Think of it like...

Imagine a small desk drawer where you keep your most used tools. You have a list that shows which tool you used last and which one you haven't touched for a while. When the drawer is full and you need a new tool, you remove the one you haven't used for the longest time.

┌───────────────┐       ┌───────────────┐
│   Hash Map    │──────▶│ Doubly Linked │
│ (key → node)  │       │    List       │
└───────────────┘       └───────────────┘
       │                        ▲
       │                        │
       ▼                        │
  Fast lookup             Tracks usage order
  of cache items          (most recent at head,
                          least recent at tail)

Build-Up - 6 Steps

1

FoundationUnderstanding cache basics

Concept: What a cache is and why it helps speed up data access.

A cache is a small, fast storage that keeps copies of data you use often. Instead of fetching data from a slow place every time, you check the cache first. If the data is there (a cache hit), you get it quickly. If not (a cache miss), you fetch it from the slow place and store it in the cache for next time.

Result

You learn why caching improves speed and reduces repeated work.

Understanding caching basics is essential because it explains why we need smart ways to decide what data to keep or remove.

2

FoundationBasics of hash maps and linked lists

3

IntermediateCombining hash map with doubly linked list

4

IntermediateHandling cache hits and misses

5

AdvancedOptimizing node removal and insertion

6

ExpertHandling concurrency and thread safety

Under the Hood

The hash map stores keys linked to nodes in the doubly linked list, allowing O(1) access to cache entries. The doubly linked list maintains the order of usage, with the head as most recently used and tail as least. On access, nodes are moved to the head by updating pointers. When capacity is exceeded, the tail node is removed and its key deleted from the hash map. This combination ensures all operations—lookup, insertion, update, and eviction—run in constant time.

Why designed this way?

This design balances speed and order tracking. Hash maps alone provide fast lookup but no order. Linked lists track order but are slow to search. Combining them leverages their strengths. Alternatives like singly linked lists or arrays either slow down operations or complicate updates. The doubly linked list allows quick removal and insertion anywhere, essential for moving nodes on access.

┌───────────────┐          ┌───────────────┐
│   Hash Map    │          │ Doubly Linked │
│  (key → node) │─────────▶│    List       │
└───────────────┘          ├───────────────┤
                           │ Head (MRU)    │
                           │  ↕            │
                           │  ↕            │
                           │ Tail (LRU)    │
                           └───────────────┘

Operations:
- Access key: hash map finds node → move node to head
- Insert key: add node at head, update hash map
- Evict: remove tail node, delete from hash map

Myth Busters - 4 Common Misconceptions

Quick: Does the LRU cache always remove the oldest inserted item? Commit to yes or no.

Common Belief:LRU cache removes the oldest item added to the cache.

Tap to reveal reality

Quick: Can a singly linked list efficiently support all LRU cache operations? Commit to yes or no.

Common Belief:A singly linked list is enough to track usage order in an LRU cache.

Tap to reveal reality

Quick: Does the hash map alone maintain the order of usage in an LRU cache? Commit to yes or no.

Common Belief:The hash map keeps track of usage order by itself.

Tap to reveal reality

Quick: Is the basic LRU cache design safe to use as-is in multi-threaded programs? Commit to yes or no.

Common Belief:The simple LRU cache design works safely in multi-threaded environments without changes.

Tap to reveal reality

Expert Zone

1

The order of operations when moving nodes matters; removing before inserting prevents temporary list corruption.

2

Using dummy head and tail nodes simplifies edge cases in the doubly linked list, reducing bugs.

3

Eviction policies can be combined with LRU, like time-based expiration, for more complex cache behavior.

When NOT to use

LRU caches are not ideal when access patterns are random or when the cost of eviction is high. Alternatives like LFU (Least Frequently Used) or ARC (Adaptive Replacement Cache) may perform better in those cases.

Production Patterns

In real systems, LRU caches often include size limits in bytes, not just item counts, and integrate with persistent storage. They also use locks or atomic operations for thread safety and may shard caches to reduce contention.

Connections

Operating System Page Replacement

LRU cache design is the basis for page replacement algorithms in OS memory management.

Understanding LRU cache helps grasp how operating systems decide which memory pages to swap out, improving system performance.

Database Indexing

Both use hash maps for fast lookup, but databases add complex structures for range queries.

Knowing LRU cache design clarifies how fast key-based lookups work, which is foundational for database indexing.

Human Short-Term Memory

LRU cache mimics how humans tend to remember recent information and forget older, unused details.

This cross-domain link shows how computer memory management can be inspired by cognitive science principles.

Common Pitfalls

#1Removing a node by searching from the head each time.

Wrong approach:function removeNode(key) { let current = head; while (current !== null) { if (current.key === key) { // remove node break; } current = current.next; } }

Correct approach:function removeNode(node) { node.prev.next = node.next; node.next.prev = node.prev; }

Root cause:Not using the hash map to get the node directly causes inefficient O(n) removal instead of O(1).

#2Not updating the hash map when evicting the least recently used item.

Wrong approach:function evict() { let tailNode = tail.prev; removeNode(tailNode); // forgot to delete from hash map }

Correct approach:function evict() { let tailNode = tail.prev; removeNode(tailNode); hashMap.delete(tailNode.key); }

Root cause:Forgetting to keep the hash map in sync leads to stale entries and incorrect cache state.

#3Using a singly linked list and trying to remove nodes from the middle.

Wrong approach:function removeNode(node) { // no prev pointer, must traverse from head let current = head; while (current.next !== node) { current = current.next; } current.next = node.next; }

Correct approach:Use a doubly linked list where each node has prev and next pointers for O(1) removal.

Root cause:Choosing the wrong data structure causes inefficient operations and complexity.

Key Takeaways

An LRU cache combines a hash map and a doubly linked list to achieve fast access and efficient tracking of usage order.

The hash map provides O(1) lookup by key, while the doubly linked list maintains the order from most to least recently used.

On cache hits, items move to the front of the list; on misses, new items are added and the least used item is removed if full.

Proper pointer updates in the doubly linked list ensure all operations run in constant time, critical for performance.

In multi-threaded environments, additional synchronization is needed to keep the cache consistent and safe.