Overview - Multiple time series comparison

What is it?

Multiple time series comparison means looking at two or more sets of data points collected over time to see how they relate or differ. Each time series shows how something changes, like temperature or sales, over days, months, or years. Comparing them helps find patterns, similarities, or differences. This is often done using line charts or graphs.

Why it matters

Without comparing multiple time series, we might miss important insights like which product sells better over time or how weather changes affect energy use. It helps businesses, scientists, and decision-makers understand trends and relationships. Without this, decisions would be based on guesswork, not clear evidence from data.

Where it fits

Before this, you should understand what a single time series is and how to plot it. After learning this, you can explore advanced topics like correlation analysis, forecasting multiple series together, or anomaly detection across series.

Mental Model

Core Idea

Comparing multiple time series is like watching several movies side by side to see how their stories unfold differently or similarly over time.

Think of it like...

Imagine you have several friends jogging on different paths, and you watch their speeds every minute. Comparing their speeds over time helps you see who runs faster, who slows down, or if they keep pace together.

┌─────────────────────────────┐
│ Time Series Comparison Chart │
├─────────────┬───────────────┤
│ Time (X)    │ Values (Y)    │
├─────────────┼───────────────┤
│ 1           │ Series A: 5   │
│             │ Series B: 7   │
│ 2           │ Series A: 6   │
│             │ Series B: 6   │
│ 3           │ Series A: 7   │
│             │ Series B: 8   │
└─────────────┴───────────────┘

Lines for Series A and B plotted over time to compare trends.

Build-Up - 7 Steps

1

FoundationUnderstanding single time series

Concept: Learn what a time series is and how to plot it simply.

A time series is a sequence of data points recorded at regular time intervals. For example, daily temperature readings. Using matplotlib, you can plot these points on a line chart to see how values change over time. Example code: import matplotlib.pyplot as plt time = [1, 2, 3, 4, 5] temps = [22, 21, 23, 24, 22] plt.plot(time, temps) plt.xlabel('Time (days)') plt.ylabel('Temperature (°C)') plt.title('Daily Temperature') plt.show()

Result

A simple line chart showing temperature changes over 5 days.

Understanding a single time series plot is the base for comparing multiple series later.

2

FoundationPlotting multiple lines together

3

IntermediateUsing legends and colors effectively

4

IntermediateHandling different scales with dual axes

5

IntermediateAligning time points and handling missing data

6

AdvancedVisualizing with subplots for clarity

7

ExpertComparing series with statistical overlays

Under the Hood

Matplotlib creates plots by mapping data points to coordinates on a canvas. Each time series is drawn as a line connecting points in order of time. When multiple series are plotted, matplotlib manages layers and colors to keep them distinct. Dual axes are separate coordinate systems sharing the same x-axis but different y-axes. Internally, matplotlib uses objects like Figure and Axes to organize these elements.

Why designed this way?

Matplotlib was designed to be flexible and powerful for scientific plotting. The layered approach allows combining many plots in one figure. Dual axes solve the problem of comparing series with different units without distorting data. This design balances ease of use with customization.

┌───────────────┐
│   Figure     │
│ ┌─────────┐ │
│ │  Axes   │ │
│ │ ┌─────┐ │ │
│ │ │Line │ │ │
│ │ └─────┘ │ │
│ └─────────┘ │
└───────────────┘

Figure contains Axes; Axes contain Lines representing series.

Myth Busters - 4 Common Misconceptions

Quick: Do you think plotting multiple series on the same axis always makes comparison easier? Commit yes or no.

Common Belief:Plotting all series on the same axis is always best for comparison.

Tap to reveal reality

Quick: Do you think missing data points can be ignored safely when comparing series? Commit yes or no.

Common Belief:Missing data points don't affect comparison much and can be ignored.

Tap to reveal reality

Quick: Do you think adding too many series on one plot always improves insight? Commit yes or no.

Common Belief:More series on one plot always gives better insight.

Tap to reveal reality

Quick: Do you think raw data lines alone are enough to understand complex series behavior? Commit yes or no.

Common Belief:Raw time series lines show all needed information clearly.

Tap to reveal reality

Expert Zone

1

Choosing the right time alignment method (e.g., interpolation vs. truncation) affects comparison accuracy subtly but critically.

2

Colorblind-friendly palettes and line styles improve accessibility but are often overlooked in production charts.

3

Understanding matplotlib's layering and z-order helps avoid hidden lines when plotting many series.

When NOT to use

Avoid plotting too many series on one chart; instead, use subplots or summary statistics. For very large datasets, consider dimensionality reduction or interactive visualization tools like Plotly or Bokeh.

Production Patterns

Professionals often combine multiple time series plots with interactive features like zoom and hover. They use statistical summaries and anomaly detection overlays to highlight important events. Automated reports include consistent color schemes and legends for clarity.

Connections

Correlation analysis

Builds-on

Comparing multiple time series visually is the first step before calculating numerical relationships like correlation coefficients.

Dashboard design

Same pattern

Effective multiple time series comparison principles apply directly to designing dashboards that show many metrics over time clearly.

Music composition

Opposite pattern

Just as multiple time series show simultaneous data trends, music layers different melodies over time, but the goal is harmony rather than comparison.

Common Pitfalls

#1Plotting multiple series with the same color and no legend.

Wrong approach:plt.plot(time, series1) plt.plot(time, series2) plt.show()

Correct approach:plt.plot(time, series1, label='Series 1') plt.plot(time, series2, label='Series 2') plt.legend() plt.show()

Root cause:Not labeling or coloring lines differently makes it impossible to tell which line is which.

#2Plotting series with very different scales on the same y-axis without adjustment.

Wrong approach:plt.plot(time, series1) plt.plot(time, series2) plt.show()

Correct approach:fig, ax1 = plt.subplots() ax1.plot(time, series1, 'b-') ax2 = ax1.twinx() ax2.plot(time, series2, 'r-') plt.show()

Root cause:Ignoring scale differences hides smaller series and misleads interpretation.

#3Ignoring missing time points and plotting series directly.

Wrong approach:plt.plot(time1, series1) plt.plot(time2, series2) plt.show()

Correct approach:s2_aligned = s2.reindex(s1.index).interpolate() plt.plot(s1.index, s1) plt.plot(s2_aligned.index, s2_aligned) plt.show()

Root cause:Not aligning time points causes mismatched comparisons and confusion.

Key Takeaways

Multiple time series comparison helps reveal how different data sets change over time relative to each other.

Plotting multiple lines on the same chart requires clear colors, labels, and sometimes dual axes to handle scale differences.

Aligning time points and handling missing data are essential to avoid misleading comparisons.

Using subplots or statistical overlays can improve clarity and insight beyond raw line plots.

Understanding these principles prevents common mistakes and leads to better data-driven decisions.