Overview - Lock files for single instance

What is it?

Lock files are special files used in scripting to ensure that only one instance of a script or program runs at a time. They act like a 'reserved seat' sign, preventing other copies from starting while one is already running. This avoids conflicts or errors caused by multiple instances working on the same resources simultaneously. Lock files are simple but powerful tools for managing script execution safely.

Why it matters

Without lock files, multiple copies of a script could run at the same time, causing problems like data corruption, duplicated work, or system overload. For example, if two scripts try to update the same file simultaneously, the file could become broken or inconsistent. Lock files prevent these issues by making sure only one script runs at once, keeping systems stable and reliable.

Where it fits

Before learning lock files, you should understand basic shell scripting and how scripts run on a system. After mastering lock files, you can explore more advanced process control techniques like semaphores, job scheduling, or systemd services for managing script execution.

Mental Model

Core Idea

A lock file is a simple marker that signals 'this script is running' so no other instance starts until it finishes.

Think of it like...

It's like putting a 'Do Not Disturb' sign on a hotel room door; it tells others to wait until you're done before entering.

┌───────────────┐
│ Script Start  │
└──────┬────────┘
       │
       ▼
┌───────────────┐
│ Check Lock    │───No───► Start Script
│ File Exists?  │
└──────┬────────┘
       │Yes
       ▼
┌───────────────┐
│ Exit or Wait  │
└───────────────┘

After script finishes:
┌───────────────┐
│ Remove Lock   │
│ File          │
└───────────────┘

Build-Up - 7 Steps

1

FoundationWhat is a Lock File

Concept: Introduce the idea of a lock file as a simple file that marks a script is running.

A lock file is just a normal file created by a script when it starts. Its presence means 'I'm running now.' When the script finishes, it deletes this file. Other scripts check for this file before starting. If the file exists, they know another instance is running and will not start.

Result

Scripts can detect if another instance is running by checking the lock file's existence.

Understanding that a lock file is just a simple file helps grasp how scripts communicate their running state without complex tools.

2

FoundationCreating and Removing Lock Files

3

IntermediateAvoiding Race Conditions with Atomic Operations

4

IntermediateHandling Stale Lock Files

5

IntermediateUsing flock for Simpler Locking

6

AdvancedLock Files in Distributed Systems

7

ExpertRace Conditions in Lock Removal and Recovery

Under the Hood

Lock files work by creating a file that signals a script is running. The operating system manages file creation and deletion. Atomic operations like 'mkdir' or 'ln' ensure only one script can create the lock at a time. Scripts check for the lock file's existence before running. If the lock exists, they wait or exit. When the script finishes, it deletes the lock file, releasing the lock. The OS file system guarantees atomicity of these operations, preventing simultaneous creation.

Why designed this way?

Lock files were designed as a simple, universal way to coordinate scripts without complex inter-process communication. Early systems lacked advanced locking tools, so using files was a practical solution. Atomic file system operations provide a reliable way to avoid race conditions. Alternatives like semaphores or message queues are more complex and not always available in shell scripting environments.

┌───────────────┐
│ Script Start  │
└──────┬────────┘
       │
       ▼
┌───────────────┐
│ Atomic Lock   │
│ Creation     │
└──────┬────────┘
       │Success
       ▼
┌───────────────┐
│ Script Runs   │
│ with Lock     │
└──────┬────────┘
       │
       ▼
┌───────────────┐
│ Lock Removed  │
│ on Exit       │
└───────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does the presence of a lock file always mean the script is running? Commit to yes or no.

Common Belief:If a lock file exists, the script is definitely running.

Tap to reveal reality

Quick: Is checking for a lock file then creating it separately always safe? Commit to yes or no.

Common Belief:Checking if a lock file exists before creating it is enough to prevent multiple instances.

Tap to reveal reality

Quick: Can local lock files guarantee single instance across multiple machines? Commit to yes or no.

Common Belief:Lock files on shared network storage work the same as local locks for multiple machines.

Tap to reveal reality

Quick: Is removing a stale lock file always safe without extra checks? Commit to yes or no.

Common Belief:Any script can safely remove a stale lock file and start running.

Tap to reveal reality

Expert Zone

1

Lock files should be created using atomic operations like 'mkdir' or 'ln' to avoid race conditions, not just 'touch'.

2

Storing the process ID inside the lock file allows scripts to detect if the locking process is still alive, preventing stale locks.

3

Using shell traps to remove lock files on script exit or interruption prevents stale locks caused by unexpected termination.

When NOT to use

Lock files are not suitable for distributed systems where scripts run on multiple machines sharing storage. In such cases, use distributed locking services like etcd, Zookeeper, or Redis locks that handle network delays and consistency.

Production Patterns

In production, scripts often use 'flock' for simple locking or create lock directories atomically. They store PIDs in lock files and use traps to clean up. Monitoring tools watch for stale locks and alert operators. For complex systems, distributed locks or job schedulers ensure single instance execution.

Connections

Mutex in Programming

Lock files are a filesystem-based form of mutex (mutual exclusion) used to prevent concurrent access.

Understanding lock files helps grasp the general concept of mutexes used in programming to avoid conflicts.

Database Transactions

Both lock files and database transactions manage access to shared resources to keep data consistent.

Knowing how lock files work clarifies how databases use locks to prevent conflicting changes.

Traffic Lights in Road Systems

Lock files act like traffic lights controlling when scripts can proceed, preventing crashes like cars colliding.

Seeing lock files as traffic control helps understand the importance of coordination in concurrent systems.

Common Pitfalls

#1Creating a lock file by checking existence then creating it separately causes race conditions.

Wrong approach:if [ ! -f /tmp/myscript.lock ]; then touch /tmp/myscript.lock fi # proceed with script

Correct approach:if mkdir /tmp/myscript.lockdir 2>/dev/null; then # proceed with script else echo "Script already running" exit 1 fi

Root cause:The separate check and create steps are not atomic, allowing two scripts to create the lock simultaneously.

#2Not removing lock files on script exit causes stale locks blocking future runs.

Wrong approach:touch /tmp/myscript.lock # script work # script ends without removing lock

Correct approach:trap 'rm -rf /tmp/myscript.lockdir' EXIT mkdir /tmp/myscript.lockdir # script work # lock removed automatically on exit

Root cause:Ignoring cleanup on exit leads to leftover lock files that block new script instances.

#3Assuming lock files work the same on multiple machines with shared storage.

Wrong approach:Scripts on different servers create /shared/myscript.lock without extra coordination.

Correct approach:Use distributed locking tools like etcd or Redis to coordinate locks across machines.

Root cause:Network file systems have caching and delay issues that break simple lock file assumptions.

Key Takeaways

Lock files are simple files that signal a script is running to prevent multiple instances.

Atomic creation of lock files or directories is essential to avoid race conditions.

Scripts must handle stale lock files by checking if the locking process is still alive.

The 'flock' command offers a simpler and safer way to manage locks in bash scripts.

Lock files have limits in distributed systems where specialized tools are needed.