Why window functions are powerful in PostgreSQL - Performance Analysis
We want to understand how the time it takes to run window functions changes as the data grows.
How does using window functions affect the work the database does?
Analyze the time complexity of the following code snippet.
SELECT employee_id, department_id, salary,
RANK() OVER (PARTITION BY department_id ORDER BY salary DESC) AS salary_rank
FROM employees;
This query ranks employees by salary within each department using a window function.
Look for repeated work done by the query.
- Primary operation: Sorting employees within each department to assign ranks.
- How many times: Once per department group, but overall it processes all rows.
As the number of employees grows, the database must sort more data within each department.
| Input Size (n) | Approx. Operations |
|---|---|
| 10 | About 10 sorting steps |
| 100 | About 100 sorting steps |
| 1000 | About 1000 sorting steps |
Pattern observation: The work grows roughly in proportion to the number of rows, with some extra cost for sorting within groups.
Time Complexity: O(n log n)
This means the time grows a bit faster than the number of rows because sorting is involved.
[X] Wrong: "Window functions just scan the data once, so they are always very fast."
[OK] Correct: Window functions often require sorting or partitioning, which can take more time as data grows.
Understanding how window functions scale helps you explain query performance clearly and shows you know how databases handle grouped calculations efficiently.
"What if we removed the PARTITION BY clause? How would the time complexity change?"