Pandasdata~10 mins

crosstab() for cross-tabulation in Pandas - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Concept Flow - crosstab() for cross-tabulation

Input Data

↓

Select two categorical variables

↓

Count occurrences of each combination

↓

Create frequency table (cross-tabulation)

↓

Display table showing counts per category pair

The crosstab function takes two categorical variables and counts how often each pair occurs, showing the result as a table.

Execution Sample

Pandas

import pandas as pd

data = {'Gender': ['Male', 'Female', 'Female', 'Male', 'Male'],
        'Preference': ['A', 'B', 'A', 'B', 'A']}
df = pd.DataFrame(data)

result = pd.crosstab(df['Gender'], df['Preference'])
print(result)

This code counts how many males and females prefer categories A or B and shows the counts in a table.

Execution Table

Step	Action	Gender	Preference	Count	Cross-tab Table State
1	Read first row	Male	A	1	{'Male': {'A': 1}}
2	Read second row	Female	B	1	{'Male': {'A': 1}, 'Female': {'B': 1}}
3	Read third row	Female	A	1	{'Male': {'A': 1}, 'Female': {'B': 1, 'A': 1}}
4	Read fourth row	Male	B	1	{'Male': {'A': 1, 'B': 1}, 'Female': {'B': 1, 'A': 1}}
5	Read fifth row	Male	A	2	{'Male': {'A': 2, 'B': 1}, 'Female': {'B': 1, 'A': 1}}
6	Build final table	-	-	-	Preference A B Gender Female 1 1 Male 2 1

💡 All rows processed, cross-tabulation table completed

Variable Tracker

Variable	Start	After 1	After 2	After 3	After 4	After 5	Final
Cross-tab dict	{}	{'Male': {'A': 1}}	{'Male': {'A': 1}, 'Female': {'B': 1}}	{'Male': {'A': 1}, 'Female': {'B': 1, 'A': 1}}	{'Male': {'A': 1, 'B': 1}, 'Female': {'B': 1, 'A': 1}}	{'Male': {'A': 2, 'B': 1}, 'Female': {'B': 1, 'A': 1}}	Final table as shown

Key Moments - 2 Insights

Why does the count for Male and Preference A increase on the last row?

What happens if a category pair does not appear in the data?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution table, what is the count for Female and Preference B after step 2?

DNot counted yet

Concept Snapshot

pandas.crosstab(index, columns) creates a frequency table.
It counts occurrences of each pair of categories.
Input: two categorical series.
Output: DataFrame with counts.
Useful for quick category relationship summaries.

Full Transcript

The crosstab function in pandas takes two categorical variables and counts how many times each combination occurs. We start with input data containing categories like Gender and Preference. For each row, crosstab counts the pair and updates the frequency table. For example, when it reads a row with Male and Preference A, it adds 1 to that cell. If the same pair appears again, the count increases. After processing all rows, it shows a table with counts for each category pair. This helps us see relationships between categories quickly.