Categorical plots help us see how data values spread and differ across groups. Boxplots and violinplots show this clearly for categories.
0
0
Categorical plots (boxplot, violinplot) in Data Analysis Python
Introduction
Comparing test scores of students from different classes.
Checking how sales vary by product category.
Seeing the distribution of heights for men and women.
Understanding customer ratings across different stores.
Syntax
Data Analysis Python
import seaborn as sns sns.boxplot(x='category_column', y='value_column', data=dataframe) sns.violinplot(x='category_column', y='value_column', data=dataframe)
Use boxplot to see median, quartiles, and outliers.
Use violinplot to see data distribution shape and density.
Examples
Shows boxplot of sepal length for each iris species.
Data Analysis Python
sns.boxplot(x='species', y='sepal_length', data=iris_df)
Shows violinplot of total bill amounts for each day of the week.
Data Analysis Python
sns.violinplot(x='day', y='total_bill', data=tips_df)
Compares height distributions between genders using boxplots.
Data Analysis Python
sns.boxplot(x='gender', y='height', data=people_df)
Sample Program
This code loads penguin data and shows two plots: a boxplot and a violinplot of flipper length for each species. It helps compare the spread and distribution of flipper lengths.
Data Analysis Python
import seaborn as sns import matplotlib.pyplot as plt # Load example dataset penguins = sns.load_dataset('penguins') # Create boxplot plt.figure(figsize=(8,4)) sns.boxplot(x='species', y='flipper_length_mm', data=penguins) plt.title('Boxplot of Flipper Length by Penguin Species') plt.show() # Create violinplot plt.figure(figsize=(8,4)) sns.violinplot(x='species', y='flipper_length_mm', data=penguins) plt.title('Violinplot of Flipper Length by Penguin Species') plt.show()
OutputSuccess
Important Notes
Boxplots summarize data with median and quartiles, good for spotting outliers.
Violinplots add detail by showing data distribution shape, useful for seeing if data is skewed or has multiple peaks.
Both plots require categorical data on one axis and numeric data on the other.
Summary
Categorical plots help compare groups easily.
Boxplots show summary statistics and outliers.
Violinplots show data distribution shapes.