Pandasdata~10 mins

Long to wide format conversion in Pandas - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Concept Flow - Long to wide format conversion

Start with long format DataFrame

↓

Choose index columns

↓

Choose columns to spread

↓

Choose values to fill

↓

Apply pivot or pivot_table

↓

Get wide format DataFrame

↓

End

Convert a long table with repeated identifiers into a wide table by spreading values across new columns.

Execution Sample

Pandas

import pandas as pd

df = pd.DataFrame({
    'Name': ['Anna', 'Anna', 'Bob', 'Bob'],
    'Year': [2020, 2021, 2020, 2021],
    'Score': [85, 88, 90, 92]
})

wide_df = df.pivot(index='Name', columns='Year', values='Score')

This code converts a long DataFrame of scores by year into a wide format with years as columns.

Execution Table

Step	Action	DataFrame State	Result
1	Create long DataFrame	Name, Year, Score columns with 4 rows	DataFrame with repeated Names and Years
2	Select index='Name', columns='Year', values='Score'	Prepare to pivot	Ready to reshape
3	Apply pivot()	Reshape data	Wide DataFrame with Names as index, Years as columns, Scores as values
4	Resulting DataFrame	Index: Anna, Bob; Columns: 2020, 2021	Anna: 85, 88; Bob: 90, 92
5	End	Wide format achieved	Conversion complete

💡 Pivot completed, long format converted to wide format with unique index-column pairs

Variable Tracker

Variable	Start	After pivot	Final
df	Long format DataFrame with 4 rows	Unchanged	Unchanged
wide_df	Not defined	Wide DataFrame with 2 rows and 2 columns	Wide DataFrame ready for use

Key Moments - 2 Insights

Why do we need to specify index, columns, and values in pivot?

What happens if there are duplicate index-column pairs in the long data?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution_table step 4, what value is at row 'Bob' and column 2021?

A90

B92

C88

D85

Concept Snapshot

Long to wide format conversion:
Use pandas pivot() to reshape data.
Specify index (rows), columns (new columns), and values (cell values).
Data must have unique index-column pairs.
Result is a wide DataFrame with spread columns.
Use pivot_table() if duplicates exist.

Full Transcript

This visual execution shows how to convert a long format DataFrame into a wide format using pandas pivot(). We start with a DataFrame where each row has a Name, Year, and Score. By choosing 'Name' as the index, 'Year' as the columns, and 'Score' as the values, pivot() reshapes the data so each Name has one row and each Year becomes a column with the corresponding Score. The execution table traces each step, showing the DataFrame state before and after pivoting. Key moments clarify why specifying index, columns, and values is necessary and what happens if duplicates exist. The quiz tests understanding of the resulting table and pivot behavior. This method is useful to reorganize data for easier analysis or visualization.