Overview - Table capacity modes (on-demand vs provisioned)

What is it?

Table capacity modes in DynamoDB determine how the database handles read and write throughput. There are two main modes: on-demand and provisioned. On-demand mode automatically adjusts capacity based on traffic, while provisioned mode requires you to specify capacity ahead of time. This choice affects cost, performance, and scalability.

Why it matters

Choosing the right capacity mode helps control costs and ensures your application runs smoothly. Without these modes, you might pay too much for unused capacity or face slowdowns during traffic spikes. Proper capacity management prevents wasted resources and keeps user experience consistent.

Where it fits

Before learning about capacity modes, you should understand basic DynamoDB concepts like tables, items, and throughput. After this, you can explore advanced topics like auto-scaling, cost optimization, and performance tuning.

Mental Model

Core Idea

Table capacity modes decide whether DynamoDB automatically adjusts throughput or requires manual capacity settings to balance cost and performance.

Think of it like...

It's like choosing between a taxi that waits and charges by the ride (on-demand) versus renting a car with a fixed monthly fee regardless of how much you drive (provisioned).

┌───────────────────────────────┐
│       Table Capacity Modes     │
├───────────────┬───────────────┤
│ On-Demand     │ Provisioned   │
├───────────────┼───────────────┤
│ Auto scales   │ Fixed capacity│
│ Pay per use   │ Pay fixed rate│
│ Handles spikes│ Needs planning│
└───────────────┴───────────────┘

Build-Up - 7 Steps

1

FoundationUnderstanding DynamoDB Throughput Basics

Concept: Learn what throughput means in DynamoDB and why it matters.

Throughput is how much data your table can read or write per second. DynamoDB measures this in Read Capacity Units (RCUs) and Write Capacity Units (WCUs). Each RCU lets you read up to 4 KB per second, and each WCU lets you write up to 1 KB per second. If your app tries to read or write more than your capacity, requests get throttled (slowed down).

Result

You understand that throughput limits how fast your app can access data and that exceeding it causes delays.

Knowing throughput basics helps you see why capacity modes exist: to manage these limits and avoid slowdowns.

2

FoundationWhat Is Provisioned Capacity Mode?

3

IntermediateHow On-Demand Capacity Mode Works

4

IntermediateComparing Costs Between Modes

5

IntermediateWhen to Use Auto-Scaling with Provisioned Mode

6

AdvancedLimits and Throttling Behavior in Each Mode

7

ExpertChoosing Capacity Modes for Global and Multi-Region Tables

Under the Hood

DynamoDB capacity modes control how the service allocates resources for read and write operations. Provisioned mode reserves fixed throughput capacity on backend servers, ensuring predictable performance but requiring manual adjustments. On-demand mode uses a serverless model that dynamically allocates resources per request, scaling instantly but with internal soft limits to protect system stability.

Why designed this way?

Provisioned mode was the original design to give users control and predictable billing. On-demand was introduced later to simplify scaling and handle unpredictable workloads without manual intervention. The tradeoff is between cost predictability and operational simplicity. AWS designed on-demand to protect backend stability with soft limits, balancing flexibility and reliability.

┌───────────────┐       ┌─────────────────────┐
│ Client App    │──────▶│ DynamoDB Frontend    │
└───────────────┘       └─────────┬───────────┘
                                   │
               ┌───────────────────┴───────────────────┐
               │                                       │
       ┌───────▼────────┐                     ┌────────▼────────┐
       │ Provisioned     │                     │ On-Demand       │
       │ Capacity Layer  │                     │ Capacity Layer  │
       │ (Fixed RCUs/WCUs)│                    │ (Dynamic scaling)│
       └─────────────────┘                     └─────────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does on-demand mode guarantee zero throttling? Commit yes or no.

Common Belief:On-demand mode never throttles because it scales automatically.

Tap to reveal reality

Quick: Is provisioned mode always cheaper for high traffic? Commit yes or no.

Common Belief:Provisioned mode is always cheaper for high traffic workloads.

Tap to reveal reality

Quick: Can you switch capacity modes instantly without downtime? Commit yes or no.

Common Belief:You can switch between on-demand and provisioned modes instantly without impact.

Tap to reveal reality

Quick: Does auto-scaling eliminate all manual capacity management? Commit yes or no.

Common Belief:Auto-scaling removes the need to plan capacity limits in provisioned mode.

Tap to reveal reality

Expert Zone

1

Provisioned capacity mode allows fine-tuned control over throughput but requires careful monitoring to avoid throttling or wasted capacity.

2

On-demand mode's internal soft limits protect DynamoDB's stability but can cause throttling during very rapid traffic spikes, which is often overlooked.

3

Global tables can have mixed capacity modes per region, requiring complex cost and performance balancing strategies.

When NOT to use

Avoid on-demand mode for consistently high and predictable workloads where provisioned mode with auto-scaling is more cost-effective. Avoid provisioned mode for unpredictable or very spiky workloads where manual capacity planning is impractical. For extremely latency-sensitive applications, consider provisioned mode for guaranteed throughput.

Production Patterns

Many production systems start with on-demand mode during development or unpredictable traffic phases, then switch to provisioned with auto-scaling for cost savings at scale. Global applications often use provisioned mode in primary regions and on-demand in less critical regions to balance cost and availability.

Connections

Serverless Computing

On-demand capacity mode is a serverless model that abstracts resource management.

Understanding serverless principles helps grasp how on-demand mode automatically scales without manual intervention.

Cloud Cost Optimization

Capacity modes directly impact cloud spending and budgeting strategies.

Knowing capacity modes aids in designing cost-efficient cloud architectures and avoiding surprise bills.

Traffic Engineering in Networks

Managing capacity modes is similar to controlling bandwidth allocation and handling traffic bursts in networks.

Recognizing this connection helps apply traffic shaping and load balancing concepts to database throughput management.

Common Pitfalls

#1Setting provisioned capacity too low causes throttling during traffic spikes.

Wrong approach:CREATE TABLE MyTable (id STRING) WITH PROVISIONED_CAPACITY (read=5, write=5);

Correct approach:CREATE TABLE MyTable (id STRING) WITH PROVISIONED_CAPACITY (read=50, write=50);

Root cause:Underestimating traffic volume leads to insufficient capacity and request throttling.

#2Switching capacity mode without planning causes temporary errors.

Wrong approach:ALTER TABLE MyTable SET CAPACITY_MODE = 'ON_DEMAND'; -- done instantly without checks

Correct approach:Plan switch during low traffic, monitor for throttling, and allow time for mode change to complete.

Root cause:Ignoring transition time and impact causes unexpected downtime or throttling.

#3Using on-demand mode for steady high traffic leads to high costs.

Wrong approach:CREATE TABLE MyTable WITH ON_DEMAND_CAPACITY; -- for a table with constant heavy traffic

Correct approach:CREATE TABLE MyTable WITH PROVISIONED_CAPACITY (read=1000, write=1000) AND AUTO_SCALING;

Root cause:Not matching capacity mode to traffic pattern causes unnecessary expenses.

Key Takeaways

DynamoDB capacity modes control how throughput is managed and billed, balancing cost and performance.

Provisioned mode requires manual capacity settings and is best for predictable workloads with steady traffic.

On-demand mode automatically scales capacity and suits unpredictable or spiky workloads but can cost more for steady high traffic.

Auto-scaling in provisioned mode helps adjust capacity automatically within limits but still needs planning.

Understanding throttling limits and mode switching impacts is essential to avoid downtime and optimize costs.