Overview - Process substitution (<() and >())

What is it?

Process substitution is a feature in bash scripting that lets you use the output or input of a command as if it were a file. It uses special syntax like <() for input and >() for output. This allows commands that expect file names to work with dynamic data streams without creating temporary files. It makes scripts cleaner and more efficient by connecting commands directly.

Why it matters

Without process substitution, you would often need to create temporary files to hold command outputs before passing them to other commands. This adds clutter, slows down scripts, and risks leftover files. Process substitution solves this by letting commands communicate directly through invisible pipes, making automation faster, safer, and easier to manage.

Where it fits

Before learning process substitution, you should understand basic bash commands, pipes, and redirection. After mastering it, you can explore advanced bash scripting techniques like command substitution, coprocesses, and complex data processing pipelines.

Mental Model

Core Idea

Process substitution lets you treat the output or input of a command as a temporary file, enabling seamless data flow between commands that expect file names.

Think of it like...

It's like having a magic folder where you can drop a live video feed or a live microphone input, and any program can open that folder to watch or listen as if it were a normal file.

Command A producing data ──> <() ──> Command B reading as a file

Command B writing to >() ──> Command A consuming data

┌─────────────┐       ┌─────────────┐
│ Command A   │       │ Command B   │
│ (producer)  │       │ (consumer)  │
└─────┬───────┘       └─────┬───────┘
      │                     │
      │ <() or >()          │
      └─────> Temporary pipe or FIFO
            (looks like a file)

Build-Up - 7 Steps

1

FoundationBasics of Command Output Redirection

Concept: Learn how to redirect command output to files and pipes.

In bash, you can send the output of a command to a file using >, like `ls > files.txt`. You can also send output to another command using a pipe `|`, like `ls | grep txt`. These are the basic ways to move data around in scripts.

Result

The output of the first command is saved in a file or passed to the next command.

Understanding basic redirection is essential because process substitution builds on the idea of redirecting command outputs and inputs.

2

FoundationUnderstanding File Descriptors and Pipes

3

IntermediateUsing <() for Input Process Substitution

4

IntermediateUsing >() for Output Process Substitution

5

IntermediateCombining Multiple Process Substitutions

6

AdvancedLimitations and Compatibility of Process Substitution

7

ExpertInternal Mechanics of Process Substitution

Under the Hood

Bash creates a temporary named pipe (FIFO) or uses /dev/fd entries to connect the output or input of a command to another command expecting a file. The shell runs the inner command asynchronously and provides the outer command a file name that points to the pipe. Data flows through this pipe in real time, without disk storage.

Why designed this way?

Process substitution was designed to overcome the limitation that many Unix commands require file names as arguments, not data streams. Using pipes alone can't always replace files because some commands open files multiple times or seek inside them. Named pipes or /dev/fd provide a file-like interface to streams, enabling compatibility and efficiency.

┌───────────────┐          ┌───────────────┐
│ Outer Command │          │ Inner Command │
│ (expects file)│          │ (produces or  │
│               │          │ consumes data)│
└──────┬────────┘          └──────┬────────┘
       │ File name (e.g. /dev/fd/63)   │
       │──────────────────────────────▶│
       │                               │
       │◀──────────────────────────────│
       │          Pipe/FIFO             │
       └──────────────────────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does <(command) create a real file on disk? Commit to yes or no.

Common Belief:Process substitution creates temporary files on disk that you can find and edit.

Tap to reveal reality

Quick: Can you use >() to redirect output to any command? Commit to yes or no.

Common Belief:>() works exactly like > and can redirect output to any command seamlessly.

Tap to reveal reality

Quick: Does process substitution work in all shells? Commit to yes or no.

Common Belief:Process substitution is a universal shell feature available everywhere.

Tap to reveal reality

Quick: Does process substitution always improve script performance? Commit to yes or no.

Common Belief:Using process substitution always makes scripts faster and better.

Tap to reveal reality

Expert Zone

1

Process substitution uses file descriptors that are unique per shell instance, so scripts relying on fixed file descriptor numbers can break.

2

When stacking multiple process substitutions, the order of execution and resource cleanup can cause subtle bugs or race conditions.

3

Some commands open files multiple times or seek inside them, which process substitution pipes cannot support, requiring fallback to real temporary files.

When NOT to use

Avoid process substitution when writing highly portable scripts for minimal shells like dash or sh. Also, if the command requires random access to the file or multiple reads, use temporary files instead. For very large data, temporary files may be more efficient than pipes.

Production Patterns

In production, process substitution is used to compare outputs of commands (e.g., diff <(cmd1) <(cmd2)), to feed multiple inputs to commands expecting files, and to split output streams with tee >(cmd). It enables concise one-liners and complex pipelines without cluttering the filesystem.

Connections

Named Pipes (FIFOs)

Process substitution uses named pipes internally to simulate files for data streams.

Understanding named pipes helps grasp how process substitution creates file-like interfaces for commands.

Unix File Descriptors

Process substitution relies on file descriptors to connect commands through pipes.

Knowing file descriptors clarifies how bash manages input/output streams behind the scenes.

Data Streaming in Networking

Both process substitution and network streaming handle continuous data flow without storing it permanently.

Recognizing this similarity helps understand the efficiency and real-time nature of process substitution.

Common Pitfalls

#1Trying to edit or open the file created by <() directly.

Wrong approach:vim <(ls)

Correct approach:ls > temp.txt; vim temp.txt

Root cause:Misunderstanding that <() creates a pipe, not a real file, so editors expecting seekable files cannot open it.

#2Using process substitution in a shell that does not support it.

Wrong approach:diff <(ls dir1) <(ls dir2) # run in sh or dash

Correct approach:diff <(ls dir1) <(ls dir2) # run in bash or zsh

Root cause:Assuming all shells support process substitution leads to script failures on minimal shells.

#3Using >() with commands that do not read from standard input properly.

Wrong approach:echo 'data' >() somecommand

Correct approach:echo 'data' | somecommand

Root cause:Not all commands accept input from pipes created by >(), causing unexpected behavior.

Key Takeaways

Process substitution lets you use command outputs or inputs as if they were files, enabling flexible data flow in bash scripts.

It creates temporary pipes or file descriptors, not real files, which is why some commands behave differently with it.

Understanding process substitution helps write cleaner, faster scripts without cluttering the filesystem with temporary files.

It is supported mainly in bash and zsh, so portability requires caution.

Knowing its internal mechanism and limits prevents common bugs and helps choose the right tool for the task.