Overview - Indexing and slicing

What is it?

Indexing and slicing in PyTorch means selecting parts of a tensor, like picking pieces from a big cake. Indexing picks single elements or specific positions, while slicing grabs a range or section of the tensor. This helps you work with just the data you need without changing the whole tensor.

Why it matters

Without indexing and slicing, you would have to use the entire tensor every time, which is slow and wasteful. Being able to quickly grab parts of data lets you train models faster and handle complex data easily. It’s like having a sharp knife to cut exactly the piece you want instead of eating the whole cake.

Where it fits

Before learning indexing and slicing, you should understand what tensors are and how they store data. After mastering this, you can learn advanced tensor operations like broadcasting, reshaping, and masking to manipulate data efficiently.

Mental Model

Core Idea

Indexing and slicing let you pick and cut parts of a tensor to focus on specific data without copying or changing the whole thing.

Think of it like...

Imagine a tensor as a big chocolate bar divided into squares. Indexing is like picking one square, and slicing is like breaking off a row or a chunk of squares to eat.

Tensor (3x4):
┌───────────────┐
│ 0  1  2  3    │
│ 4  5  6  7    │
│ 8  9 10 11    │
└───────────────┘

Indexing: tensor[1,2] → 6
Slicing: tensor[0:2, 1:3] → [[1, 2], [5, 6]]

Build-Up - 7 Steps

1

FoundationUnderstanding PyTorch tensors basics

Concept: Learn what tensors are and how they store data in multiple dimensions.

A tensor is like a multi-dimensional array. For example, a 2D tensor looks like a matrix with rows and columns. You can create a tensor in PyTorch using torch.tensor(). Example: import torch x = torch.tensor([[1, 2, 3], [4, 5, 6]]) print(x) This prints a 2x3 tensor.

Result

You get a tensor printed as rows and columns, showing the data stored.

Understanding tensors as containers of data in multiple dimensions is key before learning how to pick parts of them.

2

FoundationBasic indexing to access elements

3

IntermediateSlicing tensors to get sub-tensors

4

IntermediateUsing advanced indexing with lists and masks

5

IntermediateIndexing with ellipsis and new axes

6

AdvancedMemory sharing and copying in slicing

7

ExpertIndexing performance and advanced tricks

Under the Hood

PyTorch tensors store data in contiguous memory blocks. Indexing with slices creates views by adjusting metadata like shape and strides without copying data. Fancy indexing creates new tensors by copying selected elements. This memory sharing is efficient but requires care to avoid unintended side effects.

Why designed this way?

The design balances speed and flexibility. Views avoid copying for performance, while copies ensure safe independent data. Alternatives like always copying would slow down operations; always sharing would risk bugs. This hybrid approach is a practical tradeoff.

Tensor data memory:
┌─────────────────────────────┐
│ [data block in memory]      │
│  1  2  3  4  5  6  7  8    │
└─────────────────────────────┘

Slicing view:
┌─────────────┐
│ shape, strides │
│ points to data │
└─────────────┘

Fancy indexing:
┌─────────────┐
│ new data copy │
└─────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does slicing always create a new copy of the tensor data? Commit yes or no.

Common Belief:Slicing always makes a new copy, so changing a slice won't affect the original tensor.

Tap to reveal reality

Quick: Can you use negative indices in PyTorch indexing? Commit yes or no.

Common Belief:Negative indices are not allowed in PyTorch indexing.

Tap to reveal reality

Quick: Does fancy indexing return a view or a copy? Commit your answer.

Common Belief:Fancy indexing returns a view like slicing.

Tap to reveal reality

Quick: Is ellipsis (...) required to index all dimensions? Commit yes or no.

Common Belief:You must specify all dimensions explicitly; ellipsis is just a shortcut with no real effect.

Tap to reveal reality

Expert Zone

1

Slicing creates views that share memory, but some operations like transpose or permute may create non-contiguous tensors requiring copying for some operations.

2

Fancy indexing can trigger unexpected data copies and slowdowns, so profiling code to choose between slicing, index_select, or masked_select is crucial in performance-critical applications.

3

Using None (or unsqueeze) to add dimensions is essential for broadcasting, but forgetting to do so can cause silent shape mismatches or inefficient computations.

When NOT to use

Avoid fancy indexing when working with very large tensors or in tight loops due to copying overhead; instead, use slicing or torch.index_select. For complex conditional selection, consider boolean masks or torch.where. When you need independent data, explicitly clone slices to avoid side effects.

Production Patterns

In production, slicing is used for batching data and extracting features efficiently. Fancy indexing is common in data augmentation and filtering. Experts combine indexing with broadcasting and masking to write concise, fast data pipelines and model inputs.

Connections

Broadcasting

Indexing with new axes (None) prepares tensors for broadcasting by aligning dimensions.

Understanding indexing helps you reshape tensors correctly to leverage broadcasting for efficient arithmetic operations.

Database querying

Indexing and slicing in tensors is similar to selecting rows and columns in database tables.

Knowing how databases filter and select data helps grasp tensor indexing as a way to query multi-dimensional data efficiently.

Photography cropping

Slicing a tensor is like cropping a photo to focus on a part of the image.

This connection shows how selecting parts of data is a universal concept across fields, helping understand the purpose of slicing.

Common Pitfalls

#1Modifying a slice expecting original tensor to stay unchanged.

Wrong approach:x = torch.tensor([1, 2, 3, 4]) s = x[1:3] s[0] = 100 print(x) # expects [1, 2, 3, 4], but prints [1, 100, 3, 4]

Correct approach:x = torch.tensor([1, 2, 3, 4]) s = x[1:3].clone() s[0] = 100 print(x) # prints [1, 2, 3, 4]

Root cause:Not realizing slicing creates a view sharing memory, so changes affect the original tensor.

#2Using fancy indexing inside a loop causing slowdowns.

Wrong approach:for i in range(1000): y = x[[i, i+1]] # fancy indexing inside loop

Correct approach:indices = torch.arange(0, 1001) y = x[indices] # use slicing or batch indexing outside loop

Root cause:Not understanding that fancy indexing copies data and is slower than slicing.

#3Forgetting to add new axis when needed for broadcasting.

Wrong approach:x = torch.tensor([1, 2, 3]) y = torch.tensor([[10], [20], [30]]) result = x + y # shape mismatch error

Correct approach:x = torch.tensor([1, 2, 3]) x = x[None, :] # add new axis result = x + y # works correctly

Root cause:Not using None or unsqueeze to align tensor shapes for broadcasting.

Key Takeaways

Indexing and slicing let you select parts of tensors efficiently without copying data unnecessarily.

Slicing creates views sharing memory, so modifying slices affects the original tensor unless you clone explicitly.

Fancy indexing returns copies and is slower, so use it carefully in performance-critical code.

Using ellipsis and new axes simplifies working with tensors of different shapes and prepares data for broadcasting.

Understanding these concepts deeply helps write faster, cleaner, and bug-free tensor manipulation code in PyTorch.