PostgreSQLquery~15 mins

Work_mem and effective_cache_size tuning in PostgreSQL - Deep Dive

Choose your learning style10 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Overview - Work_mem and effective_cache_size tuning

What is it?

Work_mem and effective_cache_size are two important settings in PostgreSQL that control how much memory the database uses for query operations and caching. Work_mem sets the amount of memory used for internal operations like sorting and hashing during query execution. Effective_cache_size estimates how much memory is available for caching data, helping the query planner decide the best way to run queries. Together, they help PostgreSQL run queries efficiently by balancing memory use and speed.

Why it matters

Without tuning work_mem and effective_cache_size, PostgreSQL might use too little memory, causing slow queries due to excessive disk access, or too much memory, leading to system crashes or swapping. Proper tuning improves query speed, reduces server load, and makes the database more responsive, which is crucial for applications that rely on fast data access.

Where it fits

Before tuning these settings, you should understand basic PostgreSQL configuration and how queries work. After learning this, you can explore advanced performance tuning topics like autovacuum tuning, indexing strategies, and parallel query execution.

Mental Model

Core Idea

Work_mem controls memory for individual query operations, while effective_cache_size guides the planner on available cache memory to optimize query plans.

Think of it like...

Imagine a kitchen where work_mem is the size of your cutting board for preparing each dish, and effective_cache_size is how much pantry space you have to keep ingredients handy. A bigger cutting board lets you prepare ingredients faster, and knowing your pantry size helps you plan meals efficiently.

┌───────────────────────────────┐
│        PostgreSQL Memory       │
├───────────────┬───────────────┤
│  work_mem     │ effective_cache_size │
│ (per operation│ (planner estimate of │
│  memory for   │  OS cache memory)    │
│  sorting, etc)│                      │
└───────────────┴───────────────┘

Query Execution:
[Query] → uses work_mem for sorting/hashing
Planner → uses effective_cache_size to choose plan

Build-Up - 7 Steps

FoundationUnderstanding work_mem basics

Concept: Introduce work_mem as memory for query operations like sorting and hashing.

Work_mem is a setting that defines how much memory PostgreSQL can use for internal operations during a single query step. For example, when sorting data or creating hash tables, PostgreSQL uses work_mem to hold data in memory instead of writing to disk. If work_mem is too small, PostgreSQL will use temporary files on disk, which slows down queries.

Result

Queries that require sorting or hashing run faster when work_mem is set appropriately, reducing disk I/O.

Understanding that work_mem is per operation helps realize that complex queries with many operations can use a lot of memory if work_mem is set too high.

FoundationGrasping effective_cache_size role

IntermediateBalancing work_mem for query complexity

IntermediateUsing effective_cache_size to guide planner

AdvancedTuning work_mem for concurrent workloads

AdvancedEstimating effective_cache_size correctly

ExpertSurprising effects of work_mem and cache interplay

Under the Hood

Work_mem allocates memory for each sort or hash operation during query execution. If the data fits in this memory, operations happen in RAM; otherwise, PostgreSQL writes temporary files to disk. Effective_cache_size is a planner parameter that estimates how much data the OS cache can hold, influencing the planner's choice between sequential scans, index scans, and join methods. The planner uses this estimate to predict query costs and select the fastest plan.

Why designed this way?

PostgreSQL separates these settings to give fine control over memory use and planning. Work_mem controls actual memory allocation per operation to prevent overuse, while effective_cache_size is a heuristic to guide planning without reserving memory. This design balances flexibility, safety, and performance, allowing administrators to tune based on workload and hardware.

┌───────────────────────────────┐
│        Query Execution         │
├───────────────┬───────────────┤
│   work_mem    │  effective_cache_size  │
│ (memory used  │ (planner's estimate of │
│  per operation│  OS cache memory)      │
│  for sorting, │                       │
│  hashing)     │                       │
└───────┬───────┴───────────────┬───────┘
        │                       │
        ▼                       ▼
 ┌─────────────┐         ┌─────────────┐
 │ In-memory   │         │ Planner uses │
 │ operations  │         │ estimate to  │
 │ or temp     │         │ choose query │
 │ files on    │         │ plan         │
 │ disk       │         └─────────────┘
 └─────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does increasing work_mem always improve query speed? Commit to yes or no.

Common Belief:Increasing work_mem always makes queries faster because more memory means less disk usage.

Tap to reveal reality

Quick: Does effective_cache_size allocate memory for caching? Commit to yes or no.

Common Belief:Effective_cache_size reserves memory for caching data in PostgreSQL.

Tap to reveal reality

Quick: If effective_cache_size is set too high, will queries always be faster? Commit to yes or no.

Common Belief:Setting effective_cache_size very high always improves query plans and speeds up queries.

Tap to reveal reality

Quick: Does work_mem apply to the entire query or per operation? Commit to one.

Common Belief:Work_mem is a total memory limit for the whole query.

Tap to reveal reality

Expert Zone

Work_mem is allocated per operation and per parallel worker, so parallel queries multiply memory usage unexpectedly.

Effective_cache_size should consider not only PostgreSQL but also other processes and OS cache behavior, which can vary with workload.

Some query plans use work_mem differently, for example, hash joins vs. sorts, affecting how memory pressure manifests.

When NOT to use

Avoid setting very high work_mem on systems with many concurrent connections; instead, optimize queries or use connection pooling. Effective_cache_size is not useful on systems with unpredictable OS cache behavior, such as virtualized environments with limited control; in such cases, rely more on monitoring and adaptive query tuning.

Production Patterns

In production, DBAs monitor query plans and memory usage, adjusting work_mem per workload type (e.g., higher for reporting queries, lower for OLTP). Effective_cache_size is set based on server RAM and OS cache behavior, often around 50-75% of total RAM. Tools like pg_stat_statements help identify queries benefiting from work_mem tuning.

Connections

Operating System File Cache

effective_cache_size models OS file cache behavior

Understanding OS caching helps grasp why effective_cache_size is a planner hint, not a memory allocation.

Query Planner Cost Estimation

effective_cache_size influences cost estimates in the planner

Knowing how the planner estimates costs clarifies why tuning effective_cache_size changes query plans.

Resource Management in Operating Systems

work_mem tuning relates to managing limited memory resources among processes

Understanding OS resource management helps appreciate why per-operation memory limits prevent system overload.

Common Pitfalls

#1Setting work_mem too high without considering concurrency

Wrong approach:SET work_mem = '500MB'; -- for all sessions without concurrency check

Correct approach:SET work_mem = '50MB'; -- balanced for expected concurrency

Root cause:Misunderstanding that work_mem is per operation and per query, leading to excessive total memory use.

#2Setting effective_cache_size too low causing poor plans

Wrong approach:SET effective_cache_size = '128MB'; -- too low for server with 16GB RAM

Correct approach:SET effective_cache_size = '12GB'; -- realistic estimate of OS cache

Root cause:Not realizing effective_cache_size guides planner decisions, so too low values cause inefficient query plans.

#3Confusing effective_cache_size with actual memory allocation

Wrong approach:Expecting effective_cache_size to reserve memory and adjusting other settings accordingly

Correct approach:Treat effective_cache_size as a planner hint only, not a memory reservation

Root cause:Misunderstanding the role of effective_cache_size in PostgreSQL's memory management.

Key Takeaways

Work_mem controls memory used for individual query operations like sorting and hashing, and is allocated per operation and per query.

Effective_cache_size is a planner hint estimating how much OS cache is available, guiding query plan choices but not allocating memory.

Setting work_mem too high without considering concurrency risks exhausting server memory and causing crashes.

Setting effective_cache_size too low or too high misguides the planner, leading to inefficient query plans and slower queries.

Balancing work_mem and effective_cache_size based on workload and system memory is key to optimizing PostgreSQL performance.

Practice

(1/5)

1. What does the work_mem setting control in PostgreSQL?

easy

A. The amount of memory used for sorting and joining operations during query execution

B. The total memory available for caching disk pages

C. The maximum size of a database connection pool

D. The memory allocated for background worker processes

Work_mem and effective_cache_size tuning in PostgreSQL - Deep Dive

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of `work_mem`

Step 2: Differentiate from other memory settings

Final Answer:

Quick Check:

Solution

Step 1: Check PostgreSQL config syntax for memory sizes

Step 2: Validate each option

Final Answer:

Quick Check:

Solution

Step 1: Understand work_mem usage per operation

Step 2: Calculate total memory for 3 sorts

Final Answer:

Quick Check:

Solution

Step 1: Understand effective_cache_size role

Step 2: Consequence of low effective_cache_size

Final Answer:

Quick Check:

Solution

Step 1: Balance work_mem for concurrent large sorts

Step 2: Set effective_cache_size to reflect OS cache

Step 3: Evaluate other options

Final Answer:

Quick Check:

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of work_mem

Step 2: Differentiate from other memory settings

Final Answer:

Quick Check:

Solution

Step 1: Check PostgreSQL config syntax for memory sizes

Step 2: Validate each option

Final Answer:

Quick Check:

Solution

Step 1: Understand work_mem usage per operation

Step 2: Calculate total memory for 3 sorts

Final Answer:

Quick Check:

Solution

Step 1: Understand effective_cache_size role

Step 2: Consequence of low effective_cache_size

Final Answer:

Quick Check:

Solution

Step 1: Balance work_mem for concurrent large sorts

Step 2: Set effective_cache_size to reflect OS cache

Step 3: Evaluate other options

Final Answer:

Quick Check:

Step 1: Understand the role of `work_mem`