DBMS Theoryknowledge~5 mins

SELECT with WHERE, ORDER BY, GROUP BY in DBMS Theory - Time & Space Complexity

Choose your learning style9 modes available

Learn Why Deep Visual Practice Challenge Project Recall Time

Time Complexity: SELECT with WHERE, ORDER BY, GROUP BY

O(n + k log k)

Understanding Time Complexity

When we use SELECT queries with WHERE, ORDER BY, and GROUP BY, the database does several steps to get the result.

We want to understand how the time it takes grows as the data gets bigger.

Scenario Under Consideration

Analyze the time complexity of the following code snippet.


SELECT department, COUNT(*)
FROM employees
WHERE salary > 50000
GROUP BY department
ORDER BY COUNT(*) DESC;

This query filters employees by salary, groups them by department, counts each group, and sorts the groups by count.

Identify Repeating Operations

Identify the loops, recursion, array traversals that repeat.

Primary operation: Scanning all employee rows to check the salary condition.
How many times: Once for each row in the employees table.
Secondary operations: Grouping rows by department and counting them, then sorting the groups.

How Execution Grows With Input

As the number of employees grows, the database must check each one for the salary condition.

Input Size (n)	Approx. Operations
10	About 10 checks and grouping steps
100	About 100 checks and grouping steps
1000	About 1000 checks and grouping steps

Pattern observation: The work grows roughly in direct proportion to the number of rows.

Final Time Complexity

Time Complexity: O(n + k log k)

This means the time grows linearly with the number of rows scanned (n), plus the time to sort the grouped results (k groups), which is typically less than n.

Common Mistake

[X] Wrong: "The query only takes time proportional to the number of rows, so it is O(n)."

[OK] Correct: Sorting the grouped results adds extra work that grows faster than just scanning rows, depending on the number of groups.

Interview Connect

Understanding how filtering, grouping, and sorting affect query time helps you explain database performance clearly and confidently.

Self-Check

"What if we removed the ORDER BY clause? How would the time complexity change?"