Overview - Recursive function in assembly

What is it?

A recursive function in assembly is a function that calls itself to solve smaller parts of a problem until it reaches a simple case. In ARM assembly, this means writing instructions that manage the function calls, parameters, and return addresses manually. Recursion helps break down complex problems into simpler ones by repeating the same process. It requires careful handling of the stack to keep track of each call's state.

Why it matters

Recursion allows programmers to solve problems that naturally fit a divide-and-conquer approach, like calculating factorials or traversing trees. Without recursion, these problems would require complex loops and manual state management, making code harder to write and understand. Understanding recursion in assembly reveals how high-level languages manage function calls and memory, deepening your grasp of computer operation.

Where it fits

Before learning recursion in assembly, you should understand basic ARM assembly instructions, function calls, and stack operations. After mastering recursion, you can explore advanced topics like optimizing recursive calls with tail call optimization or implementing complex algorithms directly in assembly.

Mental Model

Core Idea

A recursive function solves a problem by calling itself with simpler inputs until reaching a base case, using the stack to remember each call's context.

Think of it like...

It's like a set of nested Russian dolls where each doll contains a smaller one inside; you open each doll until you reach the smallest, then close them back in reverse order.

┌───────────────┐
│ Recursive Call│
│  (n)          │
│   ┌─────────┐ │
│   │ Call    │ │
│   │ (n-1)   │ │
│   └─────────┘ │
│ Base Case     │
└──────┬────────┘
       │
       ▼
  Return results up the chain

Build-Up - 7 Steps

1

FoundationUnderstanding ARM Function Calls

Concept: Learn how ARM assembly handles function calls, including saving return addresses and passing parameters.

In ARM, the 'BL' instruction calls a function by branching and linking, saving the return address in the link register (LR). Parameters are passed in registers R0 to R3. The called function must save any registers it will modify on the stack and restore them before returning with 'BX LR'.

Result

You can write simple functions that accept inputs and return outputs using ARM calling conventions.

Understanding how ARM manages function calls is essential before adding recursion, as recursion relies on repeated calls and returns.

2

FoundationStack Basics in ARM Assembly

3

IntermediateImplementing Base Case in Assembly

4

IntermediateSaving State for Recursive Calls

5

IntermediatePassing Updated Parameters Recursively

6

AdvancedCombining Results After Recursive Return

7

ExpertOptimizing Recursion with Tail Calls

Under the Hood

Each recursive call in ARM assembly creates a new stack frame by pushing the return address and parameters onto the stack. The CPU uses the link register (LR) to store the return address temporarily, but since recursion involves multiple calls, LR must be saved on the stack to avoid overwriting. The stack pointer (SP) moves down to allocate space for each call. When a base case is reached, the function returns, popping the stack frame and restoring the previous state. This process repeats until all calls complete, unwinding the recursion.

Why designed this way?

ARM architecture uses a link register for efficient function returns, but recursion requires saving LR on the stack to handle multiple calls. The stack-based approach allows flexible, nested function calls without fixed memory allocation. This design balances speed and memory use, enabling both simple and complex call patterns. Alternatives like static allocation would limit recursion depth and flexibility.

┌───────────────┐
│ Function Call │
│  Save LR, R0 │
│  Push Stack  │
│     ↓        │
│ New Stack    │
│ Frame Created│
│     ↓        │
│ Recursive   │
│ Call Again  │
│     ↓        │
│ Base Case   │
│ Return      │
│ Pop Stack   │
│ Restore LR  │
└───────────────┘

Myth Busters - 3 Common Misconceptions

Quick: Does the link register (LR) automatically save itself during recursion? Commit yes or no.

Common Belief:The link register (LR) always keeps the return address safe during recursive calls without manual saving.

Tap to reveal reality

Quick: Is it safe to modify input registers after the recursive call returns? Commit yes or no.

Common Belief:You can freely modify input registers after the recursive call returns without saving them.

Tap to reveal reality

Quick: Does recursion always use more stack space than iteration? Commit yes or no.

Common Belief:Recursion always consumes more stack space than iterative solutions.

Tap to reveal reality

Expert Zone

1

Recursive functions in ARM must carefully manage not only LR but also callee-saved registers if used, to avoid subtle bugs.

2

Tail call optimization requires the recursive call to be the final action; any computation after the call disables this optimization.

3

Stack alignment is critical on ARM; failing to keep the stack 8-byte aligned can cause crashes on some processors.

When NOT to use

Avoid recursion in assembly when the problem depth is very large and tail call optimization is not possible, as this can cause stack overflow. Instead, use iterative loops or convert recursion to iteration with explicit stacks.

Production Patterns

In embedded systems, recursive assembly functions are rare but used for simple algorithms like factorial or Fibonacci. Professionals often write recursive logic in C and compile to assembly, but understanding manual recursion helps optimize critical sections and debug low-level issues.

Connections

Stack Data Structure

Recursion relies on the stack data structure to save function states and return addresses.

Understanding the stack as a last-in-first-out structure clarifies how recursive calls are managed and unwound.

Mathematical Induction

Recursion in programming mirrors mathematical induction by solving a base case and building up solutions.

Knowing induction helps grasp why recursion works logically and how base cases ensure correctness.

Russian Doll Nesting

Recursion's nested calls resemble nested objects in real life, where each contains a smaller version inside.

This cross-domain pattern shows how complex problems can be broken down into simpler, self-similar parts.

Common Pitfalls

#1Not saving the link register (LR) before recursive call.

Wrong approach:BL recursive_function ; no PUSH of LR before call

Correct approach:PUSH {LR} BL recursive_function POP {LR}

Root cause:Assuming LR is preserved automatically, leading to corrupted return addresses.

#2Failing to implement a base case, causing infinite recursion.

Wrong approach:recursive_function: ; no base case check SUB R0, R0, #1 BL recursive_function BX LR

Correct approach:recursive_function: CMP R0, #0 BEQ base_case SUB R0, R0, #1 BL recursive_function base_case: MOV R0, #1 BX LR

Root cause:Missing termination condition causes endless calls and stack overflow.

#3Modifying input registers after recursive call without restoring them.

Wrong approach:PUSH {LR} SUB R0, R0, #1 BL recursive_function MUL R0, R0, R1 POP {LR} BX LR

Correct approach:PUSH {R1, LR} SUB R0, R0, #1 BL recursive_function MUL R0, R0, R1 POP {R1, LR} BX LR

Root cause:Not saving registers that hold important data leads to corrupted results.

Key Takeaways

Recursive functions in ARM assembly require manual management of the stack and registers to preserve each call's context.

A base case is essential to stop recursion and prevent infinite loops that crash the program.

Saving the link register (LR) on the stack before recursive calls is critical to ensure correct return addresses.

Tail call optimization can improve recursion efficiency by reusing stack frames when the recursive call is the last action.

Understanding recursion in assembly deepens your knowledge of how high-level languages implement function calls and memory management.