Bird
Raised Fist0
ML Pythonml~5 mins

Time series components (trend, seasonality) in ML Python

Choose your learning style10 modes available

Start learning this pattern below

Jump into concepts and practice - no test required

or
Recommended
Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong
Introduction

Time series components help us understand patterns in data collected over time. They show us the main directions and repeating cycles in the data.

When you want to predict sales that change over months or years.
When analyzing temperature changes that repeat every year.
When studying website visits that rise and fall daily or weekly.
When checking stock prices for long-term growth or seasonal effects.
When planning inventory based on regular demand cycles.
Syntax
ML Python
Trend: The long-term increase or decrease in data.
Seasonality: Regular repeating patterns over fixed periods.

Trend shows the overall direction, like a steady rise or fall.

Seasonality repeats at regular intervals, like daily, weekly, or yearly cycles.

Examples
Here, trend is the steady monthly increase, and seasonality is the yearly December spike.
ML Python
Trend: Sales slowly increasing every month.
Seasonality: More sales every December.
Trend shows long-term warming, seasonality shows yearly summer highs.
ML Python
Trend: Temperature rising over decades.
Seasonality: Temperature peaks every summer.
Sample Model

This code creates a fake time series with a clear trend and seasonality. It then breaks the data into parts and shows them with plots and prints.

ML Python
import numpy as np
import matplotlib.pyplot as plt
from statsmodels.tsa.seasonal import seasonal_decompose

# Create time series data with trend and seasonality
np.random.seed(0)
time = np.arange(100)
trend = time * 0.1  # slowly increasing trend
seasonal = 10 * np.sin(2 * np.pi * time / 20)  # repeating pattern every 20 steps
noise = np.random.normal(scale=1, size=100)
data = trend + seasonal + noise

# Decompose the time series
result = seasonal_decompose(data, model='additive', period=20)

# Plot components
plt.figure(figsize=(10,8))
plt.subplot(411)
plt.plot(data)
plt.title('Original Data')
plt.subplot(412)
plt.plot(result.trend)
plt.title('Trend Component')
plt.subplot(413)
plt.plot(result.seasonal)
plt.title('Seasonal Component')
plt.subplot(414)
plt.plot(result.resid)
plt.title('Residual (Noise)')
plt.tight_layout()
plt.show()

# Print first 5 values of each component
print('First 5 trend values:', result.trend[:5])
print('First 5 seasonal values:', result.seasonal[:5])
print('First 5 residual values:', result.resid[:5])
OutputSuccess
Important Notes

Trend values at the edges may be NaN because of how the method calculates moving averages.

Seasonality repeats exactly every set period (20 here), showing the repeating pattern.

Residual is what is left after removing trend and seasonality, often noise.

Summary

Time series data can be split into trend, seasonality, and residual parts.

Trend shows the overall direction over time.

Seasonality shows repeating cycles at fixed intervals.

Practice

(1/5)
1. Which component of a time series shows the long-term upward or downward movement over time?
easy
A. Trend
B. Seasonality
C. Noise
D. Residual

Solution

  1. Step 1: Understand the meaning of trend

    The trend component represents the overall direction or pattern in the data over a long period, such as increasing sales over years.
  2. Step 2: Differentiate from seasonality and noise

    Seasonality repeats in fixed cycles (like monthly), and noise is random variation. Trend is the smooth long-term movement.
  3. Final Answer:

    Trend -> Option A
  4. Quick Check:

    Long-term direction = Trend [OK]
Hint: Trend = overall direction over time, not repeating cycles [OK]
Common Mistakes:
  • Confusing seasonality with trend
  • Thinking noise is trend
  • Mixing residual with trend
2. Which of the following is the correct Python code to plot seasonality in a time series using pandas?
easy
A. df['value'].plot()
B. df['value'].rolling(window=12).mean().plot()
C. df['value'].groupby(df.index.month).mean().plot()
D. df['value'].diff().plot()

Solution

  1. Step 1: Identify how to extract seasonality

    Seasonality repeats in fixed intervals like months, so grouping by month and averaging shows seasonal pattern.
  2. Step 2: Check code options

    df['value'].groupby(df.index.month).mean().plot() groups by month and plots mean, revealing seasonality. Others plot raw data, trend (rolling mean), or differences.
  3. Final Answer:

    df['value'].groupby(df.index.month).mean().plot() -> Option C
  4. Quick Check:

    Group by time period for seasonality plot [OK]
Hint: Group data by time unit (month) to see seasonality [OK]
Common Mistakes:
  • Plotting raw data only
  • Using rolling mean for seasonality
  • Plotting differences instead of seasonal groups
3. Given this Python code snippet, what will be the output type of seasonal?
import pandas as pd
import numpy as np
index = pd.date_range('2023-01-01', periods=12, freq='M')
data = np.sin(np.linspace(0, 2 * np.pi, 12))
df = pd.Series(data, index=index)
seasonal = df.groupby(df.index.month).transform('mean')
medium
A. A numpy array of length 12
B. A pandas Series with same length as df
C. A pandas DataFrame with 12 rows and 1 column
D. A single float value representing mean

Solution

  1. Step 1: Understand groupby with transform

    Using groupby with transform('mean') returns a Series aligned with original index, same length as df.
  2. Step 2: Check output type

    Since df is a Series, seasonal is also a Series with same length, each value replaced by group mean.
  3. Final Answer:

    A pandas Series with same length as df -> Option B
  4. Quick Check:

    groupby + transform returns Series matching original length [OK]
Hint: groupby + transform keeps original length Series [OK]
Common Mistakes:
  • Thinking transform returns single value
  • Confusing transform with aggregate
  • Expecting DataFrame instead of Series
4. You have this code to extract trend using rolling mean:
trend = df['value'].rolling(window=3).mean()
But the output has many NaN values at the start. How can you fix this?
medium
A. Use diff() instead of rolling mean
B. Change window to 1
C. Drop NaN values after rolling mean
D. Use min_periods=1 in rolling to reduce NaNs

Solution

  1. Step 1: Understand rolling mean NaNs

    Rolling mean with window=3 needs 3 values to compute, so first 2 are NaN by default.
  2. Step 2: Use min_periods to allow fewer values

    Setting min_periods=1 lets rolling mean compute with fewer points, reducing NaNs at start.
  3. Final Answer:

    Use min_periods=1 in rolling to reduce NaNs -> Option D
  4. Quick Check:

    min_periods controls minimum data points for rolling [OK]
Hint: Set min_periods=1 in rolling to avoid initial NaNs [OK]
Common Mistakes:
  • Changing window to 1 loses smoothing
  • Dropping NaNs loses early data
  • Using diff() does not fix NaNs
5. You have monthly sales data with a strong yearly seasonality and an upward trend. Which method best separates trend and seasonality components?
hard
A. Use moving average with window=12 for trend, then subtract to get seasonality
B. Use differencing with lag=1 to remove seasonality
C. Apply Fourier transform to remove trend
D. Use rolling mean with window=3 to capture seasonality

Solution

  1. Step 1: Understand yearly seasonality and trend

    Yearly seasonality repeats every 12 months; trend is slow upward movement.
  2. Step 2: Choose method to separate components

    Moving average with window=12 smooths out seasonality, capturing trend. Subtracting trend leaves seasonality.
  3. Step 3: Evaluate other options

    Differencing with lag=1 removes short-term changes, not yearly seasonality. Fourier transform is complex. Rolling mean with window=3 is too short for yearly seasonality.
  4. Final Answer:

    Use moving average with window=12 for trend, then subtract to get seasonality -> Option A
  5. Quick Check:

    Window matches season length to isolate trend [OK]
Hint: Match moving average window to season length to isolate trend [OK]
Common Mistakes:
  • Using too short window for moving average
  • Confusing differencing lag with season length
  • Ignoring trend when extracting seasonality