Overview - Tables, items, and attributes

What is it?

In AWS DynamoDB, data is stored in tables. Each table holds multiple items, which are like rows in a spreadsheet. Each item is made up of attributes, which are the individual pieces of data or columns. This structure helps organize and access data efficiently in a flexible way.

Why it matters

Without tables, items, and attributes, storing and retrieving data in a fast, scalable way would be very hard. These concepts let you organize data so you can quickly find what you need, even when your data grows very large. Without them, apps would be slow or unreliable when handling lots of users or data.

Where it fits

You should first understand basic database ideas like tables and records. After this, you can learn about querying DynamoDB, indexing, and data modeling to use these building blocks effectively.

Mental Model

Core Idea

A DynamoDB table holds many items, and each item is a collection of attributes that describe it.

Think of it like...

Think of a table as a filing cabinet, each item as a folder inside it, and attributes as the papers inside the folder with details about that item.

┌─────────────┐
│   Table     │
│ ┌─────────┐ │
│ │  Item   │ │
│ │ ┌─────┐ │ │
│ │ │Attr │ │ │
│ │ │ibute│ │ │
│ │ └─────┘ │ │
│ └─────────┘ │
└─────────────┘

Build-Up - 7 Steps

1

FoundationUnderstanding DynamoDB Tables

Concept: Tables are containers for data in DynamoDB, similar to tables in other databases.

A DynamoDB table is where you store your data. It has a name and a primary key that uniquely identifies each item. You create a table before adding any data. Think of it as an empty box ready to hold many items.

Result

You have a named container ready to hold data items, each uniquely identified.

Knowing that tables are the top-level container helps you organize your data storage and plan how to access it.

2

FoundationWhat Are Items in DynamoDB?

3

IntermediateAttributes: The Data Pieces

4

IntermediatePrimary Keys and Uniqueness

5

IntermediateFlexible Schema with Attributes

6

AdvancedHandling Large Items and Attribute Limits

7

ExpertSparse Indexes and Attribute Absence

Under the Hood

DynamoDB stores tables as distributed partitions across many servers. Each item is stored as a key-value pair where the key is the primary key, and the value is a JSON-like document of attributes. The system uses the primary key to quickly locate the partition holding the item. Attributes are stored flexibly, allowing different items to have different sets of attributes without a fixed schema.

Why designed this way?

DynamoDB was designed for high scalability and flexibility. Fixed schemas slow down scaling and make evolving data models hard. Using flexible attributes and primary keys allows DynamoDB to distribute data efficiently and adapt to changing application needs. Alternatives like rigid relational schemas were rejected to achieve speed and scale.

┌───────────────┐
│   DynamoDB    │
│   Table      │
│ ┌───────────┐ │
│ │ Partition │ │
│ │  Server   │ │
│ └───────────┘ │
│      │        │
│ ┌───────────┐ │
│ │   Item    │ │
│ │(Primary   │ │
│ │  Key +    │ │
│ │ Attributes)│ │
│ └───────────┘ │
└───────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Can two items in the same DynamoDB table share the same primary key? Commit to yes or no.

Common Belief:Yes, items can share the same primary key as long as other attributes differ.

Tap to reveal reality

Quick: Do all items in a DynamoDB table need to have the same attributes? Commit to yes or no.

Common Belief:Yes, all items must have the same attributes like columns in a traditional database.

Tap to reveal reality

Quick: Does DynamoDB automatically index all attributes in a table? Commit to yes or no.

Common Belief:Yes, all attributes are indexed by default for fast queries.

Tap to reveal reality

Quick: Can DynamoDB items be larger than 400 KB? Commit to yes or no.

Common Belief:Yes, DynamoDB can store items of any size.

Tap to reveal reality

Expert Zone

1

Sparse indexes only include items that have the indexed attribute, saving storage and improving query speed.

2

Composite primary keys enable sorting and range queries within partitions, which is key for complex access patterns.

3

Attributes can be nested documents (maps) or lists, allowing rich data structures without multiple tables.

When NOT to use

Avoid using DynamoDB tables with highly relational data requiring complex joins or transactions; use relational databases like Amazon RDS instead. Also, if your data items exceed 400 KB regularly, consider storing large objects in Amazon S3 and referencing them.

Production Patterns

In production, tables are designed with careful primary key selection to distribute load evenly. Items use sparse attributes to optimize indexes. Data modeling often involves denormalization and composite keys to support fast queries without joins.

Connections

Relational Database Tables and Rows

Similar structure but DynamoDB tables are schema-less and more flexible.

Understanding relational tables helps grasp DynamoDB tables, but knowing DynamoDB's flexibility shows how NoSQL differs.

JSON Documents

Attributes in DynamoDB items can be nested like JSON objects.

Knowing JSON structure helps understand how DynamoDB stores complex nested data within attributes.

Filing Systems

DynamoDB tables and items function like filing cabinets and folders organizing information.

This connection helps appreciate how data organization impacts retrieval speed and flexibility.

Common Pitfalls

#1Using a primary key that causes all items to be stored in one partition.

Wrong approach:PrimaryKey: { "userId": "sameValue" }

Correct approach:PrimaryKey: { "userId": "uniqueValue" }

Root cause:Misunderstanding that partition keys must distribute data evenly to avoid hot partitions.

#2Assuming all items must have the same attributes and forcing a fixed schema.

Wrong approach:Item1: { "name": "Alice", "age": 30 } Item2: { "name": "Bob", "age": 25, "address": "123 St" } // Error: schema mismatch

Correct approach:Item1: { "name": "Alice", "age": 30 } Item2: { "name": "Bob", "age": 25, "address": "123 St" } // Allowed in DynamoDB

Root cause:Applying relational database schema rules to DynamoDB leads to unnecessary constraints.

#3Not creating secondary indexes for attributes used in queries.

Wrong approach:Querying on attribute 'email' without an index, causing full table scan.

Correct approach:Create a Global Secondary Index on 'email' attribute before querying.

Root cause:Assuming DynamoDB automatically indexes all attributes leads to inefficient queries.

Key Takeaways

DynamoDB organizes data into tables, items, and attributes, where tables hold items and items hold attributes.

Each item must have a unique primary key to identify it and enable fast access.

Attributes are flexible and can vary between items, allowing schema-less data models.

Understanding primary keys and attribute flexibility is essential for designing scalable and efficient DynamoDB tables.

Knowing DynamoDB's limits and indexing behavior prevents common mistakes and improves application performance.