Overview - Single-table design methodology

What is it?

Single-table design methodology is a way to organize data in a database using just one table instead of many. It stores different types of related information together by using clever keys and attributes. This approach helps to quickly find and manage data without needing to join multiple tables. It is popular in DynamoDB, a fast and scalable database service.

Why it matters

Without single-table design, applications often use many tables and complex joins, which can slow down performance and increase costs. Single-table design solves this by reducing the number of tables and making data retrieval faster and cheaper. This means apps can respond quickly and handle lots of users smoothly, improving user experience and saving money.

Where it fits

Before learning single-table design, you should understand basic database concepts like tables, keys, and queries. After mastering it, you can explore advanced DynamoDB features like secondary indexes, transactions, and data modeling for complex applications.

Mental Model

Core Idea

Single-table design stores all related data types in one table using smart keys to quickly find and link items without joins.

Think of it like...

Imagine a well-organized filing cabinet where all documents are stored in one drawer, but each document has a clear label and folder code so you can find any paper quickly without opening multiple drawers.

┌───────────────────────────────┐
│         Single Table          │
├─────────────┬───────────────┤
│ PartitionKey│ SortKey       │
├─────────────┼───────────────┤
│ User#123    │ Profile#123   │
│ User#123    │ Order#456     │
│ User#123    │ Address#789   │
│ Product#001 │ Info#Product  │
│ Product#001 │ Review#User123│
└─────────────┴───────────────┘

Build-Up - 7 Steps

1

FoundationUnderstanding DynamoDB Basics

Concept: Learn what DynamoDB tables, partition keys, and sort keys are.

DynamoDB stores data in tables. Each table has a primary key made of a partition key and optionally a sort key. The partition key decides where data is stored, and the sort key orders data within that partition. This helps DynamoDB find data fast.

Result

You know how DynamoDB organizes data and how keys help locate items quickly.

Understanding keys is essential because single-table design relies on using keys cleverly to store and retrieve different data types efficiently.

2

FoundationWhy Multiple Tables Can Slow You Down

3

IntermediateDesigning Composite Keys for Multiple Entities

4

IntermediateUsing Secondary Indexes to Access Data Differently

5

IntermediateModeling Relationships Without Joins

6

AdvancedHandling Access Patterns and Query Efficiency

7

ExpertSurprising Limits and Trade-offs of Single-Table Design

Under the Hood

DynamoDB stores data in partitions based on the partition key's hash. The sort key orders items within each partition. Single-table design uses composite keys with prefixes to group and identify different entity types. Queries use these keys to quickly locate items without scanning the whole table. Secondary indexes create alternate views with different keys for flexible queries.

Why designed this way?

DynamoDB was built for speed and scalability without joins to avoid slow operations. Single-table design fits this by using keys to organize data logically in one table, reducing the need for joins and scans. This design trades off complexity in modeling for fast, predictable performance.

┌───────────────┐
│   Client App  │
└──────┬────────┘
       │ Query/Write
       ▼
┌───────────────┐
│ DynamoDB Table│
│ ┌───────────┐ │
│ │Partition  │ │
│ │Key Hashes │ │
│ └───────────┘ │
│ ┌───────────┐ │
│ │Sort Keys  │ │
│ └───────────┘ │
└──────┬────────┘
       │ Uses keys to locate
       ▼
┌───────────────┐
│ Secondary     │
│ Indexes (GSI) │
└───────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does single-table design mean putting all data in one big, flat list? Commit to yes or no.

Common Belief:Single-table design means dumping all data into one table without structure.

Tap to reveal reality

Quick: Can you always add new access patterns easily in single-table design? Commit to yes or no.

Common Belief:You can add any new query pattern anytime without redesigning the table.

Tap to reveal reality

Quick: Is single-table design always cheaper and faster than multiple tables? Commit to yes or no.

Common Belief:Single-table design always improves cost and performance.

Tap to reveal reality

Quick: Does single-table design eliminate the need to understand your application's data access? Commit to yes or no.

Common Belief:You can design a single table without knowing how your app queries data.

Tap to reveal reality

Expert Zone

1

Using sparse indexes cleverly can reduce storage and speed up queries by indexing only relevant items.

2

Item collection size limits per partition key can cause hot partitions if not designed carefully, impacting performance.

3

Attribute projections in secondary indexes affect read costs and query speed, so choosing them wisely is crucial.

When NOT to use

Single-table design is not ideal when your data model is simple, access patterns are unpredictable, or when you need complex transactions and joins better handled by relational databases. In such cases, consider multiple tables or relational databases like Aurora or PostgreSQL.

Production Patterns

In production, teams use single-table design to model user profiles, orders, and events in one table with GSIs for different queries. They automate schema validation and use infrastructure as code to manage complexity. Monitoring partition key usage helps avoid hot spots.

Connections

Normalization in Relational Databases

Opposite approach; normalization splits data into many tables, while single-table design combines data into one table.

Understanding normalization helps appreciate why single-table design denormalizes data for speed and scalability.

Hashing in Computer Science

Single-table design relies on partition key hashing to distribute data evenly across storage nodes.

Knowing hashing principles explains how DynamoDB achieves fast data access and load balancing.

Library Cataloging Systems

Both organize diverse items with unique codes to find them quickly without searching everything.

Seeing how libraries use classification systems helps understand how composite keys organize data in single-table design.

Common Pitfalls

#1Using simple keys without prefixes causes data mix-up.

Wrong approach:PartitionKey: '123', SortKey: '456' for all items without type identifiers.

Correct approach:PartitionKey: 'User#123', SortKey: 'Order#456' to separate data types clearly.

Root cause:Not distinguishing data types in keys leads to overlapping and confusing queries.

#2Ignoring access patterns and designing keys randomly.

Wrong approach:Choosing keys without considering how the app queries data, e.g., random keys.

Correct approach:Design keys based on expected queries, like grouping all user orders under 'User#ID'.

Root cause:Lack of planning causes inefficient queries and higher costs.

#3Overloading partition keys causing hot partitions.

Wrong approach:Using a single partition key for millions of items, e.g., 'User#all'.

Correct approach:Distribute data with unique partition keys like 'User#123', 'User#124'.

Root cause:Not understanding partition key distribution leads to performance bottlenecks.

Key Takeaways

Single-table design stores multiple related data types in one table using composite keys to organize and access data efficiently.

Designing keys and indexes based on application access patterns is critical for performance and cost savings.

While powerful, single-table design adds complexity and requires careful planning to avoid pitfalls like hot partitions and difficult maintenance.

Understanding DynamoDB's internal partitioning and indexing helps create scalable and fast data models.

Single-table design is a trade-off between simplicity in querying and complexity in data modeling, best used when access patterns are well known.