PyTorch Tensor Indexing: From 1D Slices to N-Dimensional Views

March 12, 2025· pytorch, tensors, deep-learning, indexing

Tensor indexing feels natural once you see how dimensions line up. This walkthrough starts with 1D arrays and climbs to N-dimensional tensors, highlighting how PyTorch treats slices as views (no copies) and how to mutate data safely. To keep the shape intuition crisp, we pair each example with a small diagram illustrating axes and selected elements.

1D Vectors: The Basics

import torch

x = torch.arange(10)  # tensor([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
print(x[0])           # scalar tensor(0)
print(x[-1])          # tensor(9)
print(x[2:7])         # tensor([2, 3, 4, 5, 6])

graph LR
    x0["0"] --> x1["1"] --> x2["2"] --> x3["3"] --> x4["4"] --> x5["5"] --> x6["6"] --> x7["7"] --> x8["8"] --> x9["9"]
    class x2,x3,x4,x5,x6 slice;
    classDef slice fill:#cdeafe,stroke:#2570d0,stroke-width:2px;

Mutation: slices share storage. Modify a slice and the original vector updates.

view = x[2:5]
view[:] = 42
print(x)
# tensor([ 0,  1, 42, 42, 42,  5,  6,  7,  8,  9])

Use .clone() if you need a detached copy.

2D Matrices: Rows, Columns, and Ranges

M = torch.arange(16).reshape(4, 4)
# tensor([[ 0,  1,  2,  3],
#         [ 4,  5,  6,  7],
#         [ 8,  9, 10, 11],
#         [12, 13, 14, 15]])

print(M[1, 2])    # element at row 1, col 2 -> tensor(6)
print(M[1])       # entire row 1 -> tensor([4, 5, 6, 7])
print(M[:, 2])    # entire column 2 -> tensor([ 2,  6, 10, 14])
print(M[1:3, 1:3])
# tensor([[5, 6],
#         [9, 10]])

graph TB
    subgraph Rows
    R0["Row 0"]
    R1["Row 1"]
    R2["Row 2"]
    R3["Row 3"]
    end

    R0 --> C00["0"]
    R0 --> C01["1"]
    R0 --> C02["2"]
    R0 --> C03["3"]

    R1 --> C10["4"]
    R1 --> C11["5"]
    R1 --> C12["6"]
    R1 --> C13["7"]

    R2 --> C20["8"]
    R2 --> C21["9"]
    R2 --> C22["10"]
    R2 --> C23["11"]

    R3 --> C30["12"]
    R3 --> C31["13"]
    R3 --> C32["14"]
    R3 --> C33["15"]

    class C11,C12,C21,C22 sub;
    classDef sub fill:#ffe9b5,stroke:#d49600,stroke-width:2px;

In-place ops: broadcast across selected regions.

M[:, 2] = torch.tensor([100, 200, 300, 400])

The third column updates in one shot.

3D Tensors: Batches + Channels + Spatial

Model data often batches 2D data. Consider BCH (batch, channel, height) or BCHW with width included.

T = torch.arange(2 * 3 * 4).reshape(2, 3, 4)
# shape: (batch=2, channel=3, width=4)

sample0 = T[0]          # shape (3, 4)
channel2 = T[:, 2, :]   # all batches, channel index 2 -> shape (2, 4)
slice_hw = T[:, :, 1:3] # drop first and last column -> shape (2, 3, 2)

graph LR
    subgraph Batch 0
        subgraph Channel 0
            A0("0") --> A1("1") --> A2("2") --> A3("3")
        end
        subgraph Channel 1
            B0("4") --> B1("5") --> B2("6") --> B3("7")
        end
        subgraph Channel 2
            C0("8") --> C1("9") --> C2("10") --> C3("11")
        end
    end
    subgraph Batch 1
        subgraph Channel 0
            D0("12") --> D1("13") --> D2("14") --> D3("15")
        end
        subgraph Channel 1
            E0("16") --> E1("17") --> E2("18") --> E3("19")
        end
        subgraph Channel 2
            F0("20") --> F1("21") --> F2("22") --> F3("23")
        end
    end

    class C0,C1,C2,C3,F0,F1,F2,F3 highlight;
    classDef highlight fill:#e4f7d2,stroke:#54a644,stroke-width:2px;

Above, channel index 2 across both batches is highlighted.

Boolean Masks and Fancy Indexing

mask = M > 10
print(M[mask])  # tensor([11, 12, 13, 14, 15])

Boolean masks flatten the result; positions that satisfy the predicate are returned in a 1D tensor.

“Fancy” indexing with integer sequences lets you pick arbitrary rows/columns:

rows = torch.tensor([0, 3])
cols = torch.tensor([1, 2])
print(M[rows])   # rows 0 and 3
print(M[:, cols])
# columns 1 and 2 -> shape (4, 2)

Ellipsis for High Dimensions

The ... placeholder fills in as many dimensions as needed:

N = torch.randn(4, 3, 5, 5)  # e.g., batch, channel, height, width

# select channel 1 across all batches and spatial locations
channel1 = N[:, 1, ...]      # shape (4, 5, 5)

# zero the bottom-right quadrant
N[..., 2:, 2:] = 0

This keeps code readable when tensors grow beyond three dimensions.

Mutating with Index_add_ and Scatter

weights = torch.zeros(5)
indices = torch.tensor([0, 2, 2, 4])
updates = torch.tensor([1.0, 0.5, 1.5, 2.0])

weights.index_add_(0, indices, updates)
# tensor([1., 0., 2., 0., 2.])

index_add_ accumulates values at the specified indices. scatter_ performs targeted writes:

result = torch.zeros(3, 4)
src = torch.tensor([[1, 2], [3, 4], [5, 6]])
indices = torch.tensor([[0, 2], [1, 0], [2, 1]])
result.scatter_(1, indices, src)

Now column indices [0,2], [1,0], [2,1] receive the values from src.

Beware of Non-Contiguous Views

Slicing followed by transpose can produce non-contiguous tensors. Some ops require contiguous memory; fix with .contiguous() or .clone():

view = M[:, ::2]       # stride step 2
transposed = view.t()  # transpose
if not transposed.is_contiguous():
    transposed = transposed.contiguous()

Quick Reference

Pattern	Result	Notes
`tensor[i]`	Scalar/Vetor	Drops first dimension
`tensor[:, j]`	1D slice	Keeps dimension count unless squeezed
`tensor[a:b, c:d]`	2D view	Shares storage
`tensor[..., k]`	Works on last axis	Good for high-dimensional data
`tensor[mask]`	1D tensor	Mask must match shape
`tensor[index_tensor]`	Fancy indexing	Returns copy
`.index_add_`, `.scatter_`	In-place mutation	Broadcast rules apply

PyTorch indexing is flexible because it mirrors NumPy’s semantics but adds GPU-friendly views. With a mental picture of axes—conjured by simple diagrams—selecting, slicing, and mutating becomes routine, even for large neural network tensors.

Keep Reading

Understanding Reference Types in Modern C++ October 22, 2025 · c++, references, move-semantics, perfect-forwarding
How Pix2Seq-D Generates Panoptic Masks Without Heavy Inductive Biases October 13, 2025 · computer-vision, panoptic-segmentation, generative-models
Why Direct Sampling from PDFs or PMFs Is So Hard October 4, 2025 · probability, sampling, monte-carlo, numerical-methods
Gripe, Complaint, or Grievance? Understanding Usage in Indian Newspapers (September 2025) September 30, 2025
From Elementary Mathematics to Vision Algorithms: The Hidden Life of Normalized Power Sums September 22, 2025 · mathematics, computer-vision, spectral-analysis, machine-learning, graph-theory