SQL Window Functions: The Ultimate Guide for Data Analysts

Every data professional knows the power of GROUP BY. It’s the trusty tool we all learn first, allowing us to aggregate data and calculate metrics like total sales per category or the number of users per city. But what happens when the questions get more complex?

What are the top 3 best-selling products within each category?
How does this month’s revenue compare to last month’s for each department?
What is the running total of sales day-by-day?

Trying to answer these questions with GROUP BY alone can lead to complex, inefficient, and often unreadable queries. This is where SQL window functions come in. They are the superpower you need to perform complex analysis while keeping your queries clean and performant.

What Are Window Functions, Really?

A window function performs a calculation across a set of table rows that are somehow related to the current row. Unlike a GROUP BY which collapses rows into a single output row, a window function returns a value for every single row.

Think of it like this: a GROUP BY looks at the whole room and gives you one summary. A window function gives each person in the room a piece of information based on looking at a specific “window” of people around them (e.g., “the 3 tallest people in your group”).

The magic happens with the OVER() clause, which defines the “window” of rows the function should consider.

The Core Syntax

The basic syntax for a window function looks like this:

SQL

SELECT
  column_a,
  column_b,
  AGGREGATE_FUNCTION() OVER (PARTITION BY ... ORDER BY ...) AS new_column
FROM your_table;

AGGREGATE_FUNCTION(): Can be an aggregate function like SUM(), AVG(), COUNT(), or a specialized window function like RANK().
OVER(): This is the mandatory clause that tells SQL you’re using a window function.
PARTITION BY column_name: This is like a GROUP BY within the window. It divides the rows into partitions (groups), and the function is calculated independently for each partition.
ORDER BY column_name: This sorts the rows within each partition. This is essential for functions that depend on order, like RANK() or running totals.

Practical Examples: From Theory to Insight

Let’s use a sample sales table to see window functions in action.

order_id	sale_date	category	product	amount
101	2025-09-01	Electronics	Laptop	1200
102	2025-09-01	Books	SQL Guide	45
103	2025-09-02	Electronics	Mouse	25
104	2025-09-02	Electronics	Keyboard	75
105	2025-09-03	Books	Data Viz	55

1. Calculating a Running Total

Goal: Find the cumulative sales total for each day.

SQL

SELECT
  sale_date,
  amount,
  SUM(amount) OVER (ORDER BY sale_date) AS running_total_sales
FROM sales;

Result:

sale_date	amount	running_total_sales
2025-09-01	1200	1200
2025-09-01	45	1245
2025-09-02	25	1270
2025-09-02	75	1345
2025-09-03	55	1400

2. Ranking Rows within a Group (`RANK`, `DENSE_RANK`, `ROW_NUMBER`)

Goal: Rank products by sales amount within each category.

This is where PARTITION BY becomes essential.

SQL

SELECT
  category,
  product,
  amount,
  RANK() OVER (PARTITION BY category ORDER BY amount DESC) AS rank_num,
  DENSE_RANK() OVER (PARTITION BY category ORDER BY amount DESC) AS dense_rank_num,
  ROW_NUMBER() OVER (PARTITION BY category ORDER BY amount DESC) AS row_num
FROM sales;

RANK(): Gives the same rank for ties, but skips the next rank. (1, 2, 2, 4)
DENSE_RANK(): Gives the same rank for ties, but does not skip. (1, 2, 2, 3)
ROW_NUMBER(): Assigns a unique number to every row, regardless of ties. (1, 2, 3, 4)

3. Comparing to Previous/Next Rows (`LAG` and `LEAD`)

Goal: Find the sales amount from the previous day for each category.

LAG() looks “behind” in the partition, while LEAD() looks “ahead”.

SQL

SELECT
  sale_date,
  category,
  amount,
  LAG(amount, 1, 0) OVER (PARTITION BY category ORDER BY sale_date) AS previous_day_sales
FROM sales;

The 1 means look back one row, and the 0 is the default value if no previous row exists.

Result:

sale_date	category	amount	previous_day_sales
2025-09-01	Books	45	0
2025-09-03	Books	55	45
2025-09-01	Electronics	1200	0
2025-09-02	Electronics	25	1200
2025-09-02	Electronics	75	25

Conclusion: Go Beyond `GROUP BY`

While GROUP BY is essential for aggregation, SQL window functions are the key to unlocking a deeper level of analytical insights. They allow you to perform calculations on a specific subset of rows without losing the detail of the individual rows.

By mastering functions like RANK(), SUM() OVER (...), LAG(), and LEAD(), you can write cleaner, more efficient queries and solve complex business problems that would be a nightmare to tackle with traditional aggregation alone.

SQL Window Functions: The Ultimate Guide for Data Analysts

What Are Window Functions, Really?

The Core Syntax

Practical Examples: From Theory to Insight

1. Calculating a Running Total

2. Ranking Rows within a Group (`RANK`, `DENSE_RANK`, `ROW_NUMBER`)

3. Comparing to Previous/Next Rows (`LAG` and `LEAD`)

Conclusion: Go Beyond `GROUP BY`

Comments

Leave a Reply Cancel reply

More posts

Snowflake Openflow Tutorial Guide 2025

A Data Engineer’s Handbook to Snowflake Performance and SQL Improvements 2025

Snowflake Native dbt Integration: Complete 2025 Guide

Your First Salesforce Copilot Action : A 5-Step Guide

SQL Window Functions: The Ultimate Guide for Data Analysts

What Are Window Functions, Really?

The Core Syntax

Practical Examples: From Theory to Insight

1. Calculating a Running Total

2. Ranking Rows within a Group (RANK, DENSE_RANK, ROW_NUMBER)

3. Comparing to Previous/Next Rows (LAG and LEAD)

Conclusion: Go Beyond GROUP BY

Comments

Leave a Reply Cancel reply

More posts

Snowflake Openflow Tutorial Guide 2025

A Data Engineer’s Handbook to Snowflake Performance and SQL Improvements 2025

Snowflake Native dbt Integration: Complete 2025 Guide

Your First Salesforce Copilot Action : A 5-Step Guide

2. Ranking Rows within a Group (`RANK`, `DENSE_RANK`, `ROW_NUMBER`)

3. Comparing to Previous/Next Rows (`LAG` and `LEAD`)

Conclusion: Go Beyond `GROUP BY`