Overview - capfd for file descriptors

What is it?

capfd is a feature in pytest that captures output sent to file descriptors like stdout and stderr during test execution. It allows you to check what your code prints or logs without changing the code itself. This helps verify that your program outputs the expected messages or errors. It works by temporarily redirecting these outputs so tests can inspect them.

Why it matters

Without capfd, testing printed output or error messages would require manual observation or complex workarounds, making tests fragile and hard to automate. capfd solves this by capturing output automatically, enabling reliable and repeatable tests. This improves software quality by ensuring programs communicate correctly with users or other systems.

Where it fits

Before learning capfd, you should understand basic pytest test functions and assertions. After mastering capfd, you can explore other pytest capturing tools like capsys and caplog, and learn advanced testing techniques involving subprocesses or external commands.

Mental Model

Core Idea

capfd captures everything sent to the system's output streams so tests can check printed or error messages without changing the program.

Think of it like...

It's like putting a temporary recording device on your phone call line to listen later without interrupting the conversation.

┌───────────────┐
│ Your Program  │
│  (prints)    │
└──────┬────────┘
       │
       ▼
┌───────────────┐
│ capfd Capture │
│ (intercepts)  │
└──────┬────────┘
       │
       ▼
┌───────────────┐
│ Test Checks   │
│ (assertions)  │
└───────────────┘

Build-Up - 7 Steps

1

FoundationUnderstanding stdout and stderr

Concept: Learn what stdout and stderr are and why programs use them.

Programs send normal messages to stdout (standard output) and error messages to stderr (standard error). These are like two separate pipes where your program talks to the outside world. For example, print() sends text to stdout, while error messages go to stderr.

Result

You know the difference between normal output and error output streams.

Understanding these streams is key because capfd captures both separately, letting you test normal and error messages independently.

2

FoundationBasic pytest test and assertion

3

IntermediateUsing capfd fixture in tests

4

IntermediateDistinguishing stdout and stderr captures

5

IntermediateResetting capture between reads

6

AdvancedCapturing output from subprocesses

7

ExpertHow capfd hooks into low-level file descriptors

Under the Hood

capfd temporarily replaces the OS file descriptors 1 (stdout) and 2 (stderr) with pipes created by pytest. When the program writes output, it goes into these pipes. pytest reads from the pipes to capture the output. After the test, capfd restores the original file descriptors to avoid side effects.

Why designed this way?

This design allows capturing output at the lowest level, ensuring even output from native code or C extensions is caught. Alternatives like patching print() only capture Python-level output and miss lower-level writes. Using OS file descriptor redirection is more robust and transparent.

┌───────────────┐
│ Original FD 1 │───┐
└───────────────┘   │
                    │ replaced by pipe
┌───────────────┐   ▼
│ Pipe for FD 1 │◄──┤
└───────────────┘   │
                    │
┌───────────────┐   │
│ Your Program  │───┤ writes to FD 1
└───────────────┘   │
                    │
┌───────────────┐   │
│ pytest reads  │───┘
│ from pipe     │
└───────────────┘

Myth Busters - 3 Common Misconceptions

Quick: Does capfd capture output from subprocesses spawned by your code? Commit to yes or no.

Common Belief:capfd captures all output including from subprocesses.

Tap to reveal reality

Quick: Does calling capfd.readouterr() accumulate output or clear it? Commit to your answer.

Common Belief:Calling readouterr() accumulates output so you get all output so far every time.

Tap to reveal reality

Quick: Does capfd capture output only from print() calls? Commit to yes or no.

Common Belief:capfd only captures output from Python print() function.

Tap to reveal reality

Expert Zone

1

capfd captures output by redirecting OS file descriptors, so it works even if code bypasses Python's sys.stdout or print functions.

2

Repeated calls to readouterr() clear the buffer, so tests can check output in stages, but forgetting this can cause missing output in assertions.

3

capfd does not capture output from subprocesses; for those, you need to capture output differently, such as by mocking subprocess calls or using capsys.

When NOT to use

Do not use capfd when testing output from subprocesses or external commands; instead, use capsys or mock subprocess.Popen. Also, if you only need to capture Python-level output, capsys might be simpler.

Production Patterns

In real-world tests, capfd is used to verify CLI tools print correct messages, to check error handling outputs, and to test libraries that write directly to stdout/stderr, including C extensions.

Connections

capsys fixture in pytest

Related capturing tool with different implementation

Understanding capfd helps grasp capsys, which captures Python-level output streams, while capfd captures OS-level file descriptors.

Unix file descriptors

Underlying system concept capfd manipulates

Knowing how Unix file descriptors work clarifies how capfd redirects output streams at the OS level.

Network packet sniffing

Similar pattern of intercepting data streams

Both capfd and packet sniffing intercept data transparently for inspection without altering the source, showing a common pattern in debugging and testing.

Common Pitfalls

#1Expecting capfd to capture output from subprocesses.

Wrong approach:def test_subprocess_output(capfd): import subprocess subprocess.run(['echo', 'hello']) captured = capfd.readouterr() assert 'hello' in captured.out

Correct approach:def test_subprocess_output(): import subprocess result = subprocess.run(['echo', 'hello'], capture_output=True, text=True) assert 'hello' in result.stdout

Root cause:Misunderstanding capfd scope; it only captures current process output, not subprocesses.

#2Calling readouterr() multiple times expecting cumulative output.

Wrong approach:def test_multiple_reads(capfd): print('first') out1 = capfd.readouterr().out print('second') out2 = capfd.readouterr().out assert 'first' in out2 # This fails

Correct approach:def test_multiple_reads(capfd): print('first') out1 = capfd.readouterr().out print('second') out2 = capfd.readouterr().out assert 'first' in out1 assert 'second' in out2

Root cause:Not knowing readouterr() clears the buffer after each call.

#3Assuming capfd only captures print() output.

Wrong approach:def test_logging_output(capfd): import logging logging.error('error message') captured = capfd.readouterr() assert 'error message' not in captured.err # This fails

Correct approach:def test_logging_output(capfd): import logging logging.error('error message') captured = capfd.readouterr() assert 'error message' in captured.err

Root cause:Believing capfd only captures print(), not realizing it captures all writes to file descriptors.

Key Takeaways

capfd captures output by redirecting the system's stdout and stderr file descriptors, allowing tests to inspect printed and error messages.

It captures output at a low level, so it works even for native code or C extensions writing directly to these streams.

capfd separates stdout and stderr, enabling precise testing of normal and error outputs independently.

Each call to capfd.readouterr() returns and clears the captured output, so repeated calls only get new output.

capfd does not capture output from subprocesses; other methods are needed for that.