Bash Scriptingscripting~10 mins

sort and uniq in pipelines in Bash Scripting - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Concept Flow - sort and uniq in pipelines

Input lines

↓

sort command

↓

uniq command

↓

Output unique sorted lines

The pipeline takes input lines, sorts them alphabetically, then filters out duplicates to output unique sorted lines.

Execution Sample

Bash Scripting

printf "apple\nbanana\napple\ncherry\nbanana\n" | sort | uniq

This command sorts the list of fruits and removes duplicates, showing unique sorted fruits.

Execution Table

Step	Command	Input	Output	Explanation
1	printf	none	apple banana apple cherry banana	Prints the list of fruits with duplicates
2	sort	apple banana apple cherry banana	apple apple banana banana cherry	Sorts lines alphabetically, duplicates stay
3	uniq	apple apple banana banana cherry	apple banana cherry	Removes adjacent duplicate lines
4	end	N/A	apple banana cherry	Pipeline ends with unique sorted output

💡 uniq stops after processing all sorted lines, outputting unique lines only

Variable Tracker

Variable	Start	After printf	After sort	After uniq	Final
lines	empty	apple banana apple cherry banana	apple apple banana banana cherry	apple banana cherry	apple banana cherry

Key Moments - 3 Insights

Why does uniq only remove duplicates after sort, not before?

What happens if we use uniq before sort?

Why do we use a pipeline with | between commands?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution table, what is the output after the sort command?

Aapple banana apple cherry banana

Bapple apple banana banana cherry

Capple banana cherry

Dbanana apple cherry

Concept Snapshot

Use 'sort | uniq' in a pipeline to get unique sorted lines.
'sort' arranges lines alphabetically.
'uniq' removes only adjacent duplicates.
Pipe output of sort into uniq for correct unique filtering.
Without sort, uniq misses non-adjacent duplicates.

Full Transcript

This visual execution shows how the bash pipeline 'sort | uniq' works. First, input lines are printed with duplicates. Then 'sort' arranges these lines alphabetically but keeps duplicates. Next, 'uniq' removes duplicates only if they are next to each other. Sorting first groups duplicates together so uniq can remove them. The final output is unique sorted lines. Key points include that uniq alone only removes adjacent duplicates, so sorting first is essential. The pipeline uses the pipe symbol to send output from one command to the next. This step-by-step trace helps beginners see how each command changes the data and why the order matters.