Overview - Put, get, and query operations

What is it?

Put, get, and query operations are basic ways to work with data in cloud databases like AWS DynamoDB. 'Put' means adding or replacing an item. 'Get' means retrieving a specific item by its key. 'Query' means searching for items that match certain conditions. These operations help store, find, and manage data efficiently.

Why it matters

Without these operations, you couldn't save or find data in cloud databases easily. Imagine a library without a way to add books, find a specific book, or search for books by topic. These operations make data handling fast and reliable, which is crucial for apps and services that millions of people use every day.

Where it fits

Before learning these, you should understand basic cloud storage and database concepts. After mastering them, you can learn about advanced data operations like scans, transactions, and indexing for better performance and complex queries.

Mental Model

Core Idea

Put adds or updates data, get retrieves data by key, and query searches data by conditions in a cloud database.

Think of it like...

Think of a post office: 'Put' is like dropping a letter into a mailbox, 'Get' is picking up a letter addressed to you, and 'Query' is asking the clerk to find all letters sent from a certain city.

┌─────────────┐      ┌─────────────┐      ┌─────────────┐
│   Put       │─────▶│   Database  │◀─────│    Get      │
│ (Add/Update)│      │   Storage   │      │ (Retrieve)  │
└─────────────┘      └─────────────┘      └─────────────┘
                           ▲
                           │
                      ┌─────────────┐
                      │   Query     │
                      │ (Search by  │
                      │  conditions)│
                      └─────────────┘

Build-Up - 6 Steps

1

FoundationUnderstanding Put Operation Basics

Concept: Learn how to add or replace data items in a cloud database using the put operation.

The put operation stores a new item or replaces an existing item with the same key. You provide the item's key and attributes. If the key exists, the old item is overwritten. This is like saving a file on your computer: if the file name exists, it replaces the old file.

Result

Data is saved or updated in the database immediately.

Knowing that put replaces existing items helps avoid accidental data loss when updating.

2

FoundationBasics of Get Operation

3

IntermediateQuery Operation for Searching Data

4

IntermediateDifference Between Query and Scan

5

AdvancedConditional Put and Get Operations

6

ExpertQuery Pagination and Performance Optimization

Under the Hood

Put, get, and query operations interact with the database's storage engine. Put writes data to partitions based on keys, overwriting if needed. Get retrieves data by directly accessing the partition and item key, making it very fast. Query uses partition keys and indexes to scan only relevant partitions and sort keys, avoiding full database scans. Internally, the database uses hash functions and B-tree indexes to organize and find data quickly.

Why designed this way?

These operations were designed to balance speed, cost, and flexibility. Put and get focus on quick, direct access by key, which is common in many apps. Query adds flexible searching without scanning everything, using indexes to keep performance high. Alternatives like full scans were avoided for efficiency. This design supports massive scale and low latency needed in cloud environments.

┌─────────────┐       ┌───────────────┐       ┌─────────────┐
│   Client    │──────▶│  Database API │──────▶│ Storage Node│
└─────────────┘       └───────────────┘       └─────────────┘
       │                     │                       │
       │ Put/Get/Query call   │                       │
       │─────────────────────▶│                       │
       │                     │ Hash partitioning      │
       │                     │───────────────────────▶│
       │                     │                       │
       │                     │   Read/Write data      │
       │                     │◀──────────────────────│
       │                     │                       │
       │◀────────────────────┤                       │

Myth Busters - 4 Common Misconceptions

Quick: Does a put operation always add a new item without replacing existing data? Commit yes or no.

Common Belief:Put operation only adds new items and never overwrites existing ones.

Tap to reveal reality

Quick: Can query search any attribute without specifying a key? Commit yes or no.

Common Belief:Query can search any attribute in the database without restrictions.

Tap to reveal reality

Quick: Does get operation return partial data if you ask for only some attributes? Commit yes or no.

Common Belief:Get always returns the full item regardless of requested attributes.

Tap to reveal reality

Quick: Is scan always better than query for flexible searches? Commit yes or no.

Common Belief:Scan is better because it can search all data without restrictions.

Tap to reveal reality

Expert Zone

1

Query results are paginated and may not return all matching items in one call, requiring careful handling of continuation tokens.

2

Conditional expressions in put and get operations can prevent race conditions in concurrent environments, but must be designed carefully to avoid deadlocks.

3

Choosing the right partition key and sort key design directly impacts query efficiency and overall database performance.

When NOT to use

Avoid using query when you don't know the partition key; instead, consider using scans with filters or redesign your data model. Put operations should not be used for partial updates; use update operations instead. For complex multi-item transactions, use transactional APIs rather than multiple put or get calls.

Production Patterns

In production, put is often used with conditional expressions to prevent overwriting data unintentionally. Get is used for fast lookups by primary key, such as user profiles. Query is used to retrieve related items, like all orders for a customer, using well-designed partition and sort keys. Pagination is implemented to handle large result sets efficiently.

Connections

Indexing in Databases

Query operations build on indexing concepts to find data efficiently.

Understanding how indexes work helps grasp why query is faster than scan and how to design keys.

Caching Systems

Get operations are similar to cache lookups where data is retrieved by key.

Knowing caching principles clarifies why get is the fastest operation and how to reduce database load.

Postal Service Logistics

Put, get, and query mirror sending, receiving, and searching mail in postal systems.

Seeing data operations as mail handling reveals the importance of addressing (keys) and sorting (indexes) for speed.

Common Pitfalls

#1Overwriting data unintentionally with put operation.

Wrong approach:PutItem({Key: '123', Attributes: {...}}) without condition

Correct approach:PutItem({Key: '123', Attributes: {...}}, ConditionExpression: 'attribute_not_exists(Key)')

Root cause:Not using conditional expressions leads to overwriting existing items without warning.

#2Using query without specifying partition key.

Wrong approach:Query({FilterExpression: 'Status = :val'})

Correct approach:Query({KeyConditionExpression: 'PartitionKey = :pk', FilterExpression: 'Status = :val'})

Root cause:Query requires partition key to efficiently locate data; omitting it causes errors or full scans.

#3Fetching all attributes when only some are needed.

Wrong approach:GetItem({Key: '123'})

Correct approach:GetItem({Key: '123', ProjectionExpression: 'Name, Email'})

Root cause:Not using projection expressions wastes bandwidth and slows response.

Key Takeaways

Put operation adds or replaces entire items and can overwrite data if not used carefully.

Get operation retrieves a single item quickly by its unique key and can return partial data to save resources.

Query operation searches for multiple items using partition keys and indexes, making it efficient for related data retrieval.

Using conditional expressions in put and get prevents data conflicts and maintains integrity in concurrent environments.

Understanding pagination and key design is essential for building scalable and cost-effective cloud database applications.