Overview - Categorical scatter with jitter
What is it?
Categorical scatter with jitter is a way to show data points that belong to different categories on a plot. Instead of placing points exactly on the category line, jitter adds a small random shift to each point horizontally. This helps to avoid points overlapping and makes it easier to see the distribution of data within each category.
Why it matters
Without jitter, many points in the same category stack on top of each other, hiding the true number of observations and their spread. Jitter reveals the density and variation of data points, making patterns clearer and helping to avoid misleading conclusions. It is especially useful when dealing with discrete categories and many data points.
Where it fits
Before learning this, you should understand basic scatter plots and categorical data representation. After mastering jitter, you can explore more advanced visualization techniques like violin plots, swarm plots, or interactive plots that show distributions more richly.