Matplotlibdata~5 mins

Trend lines on scatter plots in Matplotlib - Time & Space Complexity

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Time Complexity: Trend lines on scatter plots

O(n)

Understanding Time Complexity

We want to understand how the time to draw a trend line on a scatter plot changes as we add more points.

How does the work grow when the number of points increases?

Scenario Under Consideration

Analyze the time complexity of the following code snippet.

import matplotlib.pyplot as plt
import numpy as np

n = 100  # Example value for n
x = np.random.rand(n)
y = np.random.rand(n)

plt.scatter(x, y)
coeffs = np.polyfit(x, y, 1)
plt.plot(x, coeffs[0] * x + coeffs[1])
plt.show()

This code creates a scatter plot of n points and fits a straight line (trend line) through them.

Identify Repeating Operations

Identify the loops, recursion, array traversals that repeat.

Primary operation: Calculating the best-fit line using np.polyfit, which processes all n points.
How many times: It examines all n points once to compute the line coefficients.

How Execution Grows With Input

As the number of points n increases, the time to compute the trend line grows roughly in direct proportion.

Input Size (n)	Approx. Operations
10	About 10 operations
100	About 100 operations
1000	About 1000 operations

Pattern observation: Doubling the points roughly doubles the work needed to find the trend line.

Final Time Complexity

Time Complexity: O(n)

This means the time to compute the trend line grows linearly with the number of points.

Common Mistake

[X] Wrong: "Adding more points won't affect the time much because the line is just one line."

[OK] Correct: Even though the line is simple, the calculation must consider every point to find the best fit, so more points mean more work.

Interview Connect

Understanding how data size affects plotting and calculations helps you explain performance clearly and shows you think about efficiency in real tasks.

Self-Check

What if we changed the trend line to a polynomial of degree 3? How would the time complexity change?