Challenge - 5 Problems

🎖️

Window Function Mastery

Get all challenges correct to earn this badge!

Test your skills under time pressure!

❓ query_result

intermediate

2:00remaining

Calculate running total of sales per store

Given a table sales with columns store_id, sale_date, and amount, what is the running total of sales per store ordered by sale_date?

PostgreSQL

SELECT store_id, sale_date, amount,
       SUM(amount) OVER (PARTITION BY store_id ORDER BY sale_date) AS running_total
FROM sales
ORDER BY store_id, sale_date;

A[{"store_id":1,"sale_date":"2023-01-01","amount":100,"running_total":100},{"store_id":1,"sale_date":"2023-01-02","amount":150,"running_total":250},{"store_id":2,"sale_date":"2023-01-01","amount":200,"running_total":200},{"store_id":2,"sale_date":"2023-01-03","amount":100,"running_total":300}]

B[{"store_id":1,"sale_date":"2023-01-01","amount":100,"running_total":100},{"store_id":1,"sale_date":"2023-01-02","amount":150,"running_total":150},{"store_id":2,"sale_date":"2023-01-01","amount":200,"running_total":200},{"store_id":2,"sale_date":"2023-01-03","amount":100,"running_total":100}]

C]}003:"latot_gninnur",001:"tnuoma","30-10-3202":"etad_elas",2:"di_erots"{,}002:"latot_gninnur",002:"tnuoma","10-10-3202":"etad_elas",2:"di_erots"{,}052:"latot_gninnur",051:"tnuoma","20-10-3202":"etad_elas",1:"di_erots"{,}001:"latot_gninnur",001:"tnuoma","10-10-3202":"etad_elas",1:"di_erots"{[

D[{"store_id":1,"sale_date":"2023-01-01","amount":100,"running_total":100},{"store_id":1,"sale_date":"2023-01-02","amount":150,"running_total":250},{"store_id":2,"sale_date":"2023-01-01","amount":200,"running_total":200},{"store_id":2,"sale_date":"2023-01-03","amount":100,"running_total":100}]

Attempts:

2 left

❓ query_result

intermediate

2:00remaining

Find the rank of employees by salary within each department

Given a table employees with columns department_id, employee_name, and salary, which query returns the rank of each employee's salary within their department, with the highest salary ranked 1?

PostgreSQL

SELECT department_id, employee_name, salary,
       RANK() OVER (PARTITION BY department_id ORDER BY salary DESC) AS salary_rank
FROM employees
ORDER BY department_id, salary_rank;

A[{"department_id":10,"employee_name":"Alice","salary":90000,"salary_rank":1},{"department_id":10,"employee_name":"Bob","salary":85000,"salary_rank":1},{"department_id":20,"employee_name":"Carol","salary":95000,"salary_rank":1},{"department_id":20,"employee_name":"Dave","salary":90000,"salary_rank":1}]

B[{"department_id":10,"employee_name":"Alice","salary":90000,"salary_rank":1},{"department_id":10,"employee_name":"Bob","salary":85000,"salary_rank":2},{"department_id":20,"employee_name":"Carol","salary":95000,"salary_rank":1},{"department_id":20,"employee_name":"Dave","salary":90000,"salary_rank":2}]

C[{"department_id":10,"employee_name":"Alice","salary":90000,"salary_rank":2},{"department_id":10,"employee_name":"Bob","salary":85000,"salary_rank":1},{"department_id":20,"employee_name":"Carol","salary":95000,"salary_rank":2},{"department_id":20,"employee_name":"Dave","salary":90000,"salary_rank":1}]

D[{"department_id":10,"employee_name":"Alice","salary":90000,"salary_rank":1},{"department_id":10,"employee_name":"Bob","salary":85000,"salary_rank":3},{"department_id":20,"employee_name":"Carol","salary":95000,"salary_rank":1},{"department_id":20,"employee_name":"Dave","salary":90000,"salary_rank":2}]

Attempts:

2 left

📝 Syntax

advanced

2:00remaining

Identify the syntax error in this window function query

Which option contains a syntax error when trying to calculate the moving average of sales over the last 3 days per store?

PostgreSQL

SELECT store_id, sale_date, amount,
       AVG(amount) OVER (PARTITION BY store_id ORDER BY sale_date ROWS BETWEEN 2 PRECEDING AND CURRENT ROW) AS moving_avg
FROM sales;

SELECT store_id, sale_date, amount,
       AVG(amount) OVER (PARTITION BY store_id ORDER BY sale_date ROWS BETWEEN 2 PRECEDING AND CURRENT ROW) AS moving_avg
FROM sales;

SELECT store_id, sale_date, amount,
       AVG(amount) OVER (PARTITION BY store_id ORDER BY sale_date RANGE BETWEEN 2 PRECEDING AND CURRENT ROW) AS moving_avg
FROM sales;

SELECT store_id, sale_date, amount,
       AVG(amount) OVER (PARTITION BY store_id ORDER BY sale_date ROWS BETWEEN CURRENT ROW AND 2 PRECEDING) AS moving_avg
FROM sales;

SELECT store_id, sale_date, amount,
       AVG(amount) OVER (PARTITION BY store_id ORDER BY sale_date ROWS 2 PRECEDING AND CURRENT ROW) AS moving_avg
FROM sales;

Attempts:

2 left

❓ optimization

advanced

2:00remaining

Optimize query to find first and last sale date per customer

Which query is the most efficient to get the first and last sale date per customer from a sales table with many rows?

SELECT DISTINCT customer_id,
       FIRST_VALUE(sale_date) OVER (PARTITION BY customer_id ORDER BY sale_date) AS first_sale,
       LAST_VALUE(sale_date) OVER (PARTITION BY customer_id ORDER BY sale_date ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING) AS last_sale
FROM sales;

SELECT customer_id, sale_date AS first_sale, sale_date AS last_sale
FROM sales
WHERE sale_date IN (SELECT MIN(sale_date) FROM sales WHERE customer_id = sales.customer_id
                    UNION
                    SELECT MAX(sale_date) FROM sales WHERE customer_id = sales.customer_id);

SELECT customer_id, MIN(sale_date) AS first_sale, MAX(sale_date) AS last_sale
FROM sales
GROUP BY customer_id;

SELECT customer_id, sale_date AS first_sale, sale_date AS last_sale
FROM sales
WHERE sale_date = (SELECT MIN(sale_date) FROM sales WHERE customer_id = sales.customer_id);

Attempts:

2 left

🧠 Conceptual

expert

2:00remaining

Understanding difference between ROWS and RANGE in window frames

What is the key difference between using ROWS BETWEEN 1 PRECEDING AND CURRENT ROW and RANGE BETWEEN 1 PRECEDING AND CURRENT ROW in a window function ordered by a numeric column?

AROWS counts physical rows before the current row, while RANGE includes all rows with values within the numeric range, possibly including multiple rows with the same value.

BROWS includes all rows with values within the numeric range, while RANGE counts physical rows before the current row.

CROWS and RANGE behave identically when ordering by a numeric column.

DROWS excludes the current row, while RANGE includes the current row in the frame.

Attempts:

2 left