Overview - Node affinity and anti-affinity

What is it?

Node affinity and anti-affinity are rules in Kubernetes that help decide which nodes a pod should or should not run on. Node affinity lets you specify preferred or required conditions for nodes where pods can be scheduled. Anti-affinity is the opposite; it tells Kubernetes to avoid placing pods on certain nodes based on labels or other criteria. These rules help control pod placement to improve performance, availability, and resource use.

Why it matters

Without node affinity and anti-affinity, Kubernetes might place pods randomly or inefficiently, causing resource waste or failures. For example, critical pods might end up on the same node, risking downtime if that node fails. These rules let you guide Kubernetes to spread pods out or group them, improving reliability and making sure your app runs smoothly.

Where it fits

Before learning node affinity, you should understand basic Kubernetes concepts like pods, nodes, and labels. After mastering node affinity and anti-affinity, you can explore more advanced scheduling features like pod affinity, taints and tolerations, and custom schedulers.

Mental Model

Core Idea

Node affinity and anti-affinity are like setting preferences and restrictions for where your app pieces (pods) can live inside a cluster of computers (nodes).

Think of it like...

Imagine you are organizing guests at a party. Node affinity is like saying 'I want my friends to sit at tables with windows' (preferred spots), while anti-affinity is like saying 'I don’t want my noisy friends sitting at the same table' (avoid certain neighbors).

┌───────────────┐       ┌───────────────┐
│   Kubernetes  │       │     Nodes     │
│   Scheduler   │──────▶│  Node A (has  │
│               │       │  labels: zone=1│
│               │       └───────────────┘
│               │       ┌───────────────┐
│               │       │  Node B (has  │
│               │       │  labels: zone=2│
└───────────────┘       └───────────────┘

Pod with node affinity: zone=1
Pod with node anti-affinity: avoid zone=2

Build-Up - 7 Steps

1

FoundationUnderstanding Kubernetes Nodes and Pods

Concept: Learn what nodes and pods are in Kubernetes and how pods run on nodes.

In Kubernetes, a node is a worker machine where your applications run. A pod is the smallest unit that holds one or more containers. Pods need to be scheduled onto nodes to run. By default, Kubernetes schedules pods on any available node without special rules.

Result

You understand that pods run on nodes and that scheduling decides which node a pod uses.

Knowing the basic relationship between pods and nodes is essential before controlling where pods run.

2

FoundationUsing Labels to Identify Nodes

3

IntermediateDefining Node Affinity Rules

4

IntermediateUsing Node Anti-Affinity to Avoid Nodes

5

IntermediateCombining Affinity and Anti-Affinity Rules

6

AdvancedHow Kubernetes Scheduler Uses Affinity Rules

7

ExpertSurprising Effects of Affinity in Large Clusters

Under the Hood

The Kubernetes scheduler uses a multi-step process: it first filters nodes that do not meet required affinity or anti-affinity rules. Then it scores the remaining nodes based on preferred affinity weights and other factors like resource availability. The scheduler picks the node with the highest score. Affinity rules are evaluated by matching pod selectors against node labels using logical operators like In, NotIn, Exists, and DoesNotExist.

Why designed this way?

Affinity and anti-affinity were designed to give users flexible control over pod placement without hardcoding node names. The separation into required and preferred rules balances strictness and flexibility. This design allows Kubernetes to optimize cluster utilization while respecting user constraints. Alternatives like fixed node assignment were too rigid and did not scale well.

┌───────────────┐
│ Pod to schedule│
└──────┬────────┘
       │
       ▼
┌───────────────┐
│ Filter nodes   │
│ (required     │
│ affinity/anti-│
│ affinity)     │
└──────┬────────┘
       │
       ▼
┌───────────────┐
│ Score nodes   │
│ (preferred   │
│ affinity)    │
└──────┬────────┘
       │
       ▼
┌───────────────┐
│ Select node   │
│ with highest  │
│ score         │
└───────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does node affinity guarantee pods will always run on preferred nodes? Commit yes or no.

Common Belief:Node affinity always guarantees pods run on preferred nodes.

Tap to reveal reality

Quick: Can node anti-affinity be used to avoid nodes based on pod labels? Commit yes or no.

Common Belief:Node anti-affinity can only avoid nodes based on node labels.

Tap to reveal reality

Quick: Does combining multiple required affinity rules always increase scheduling success? Commit yes or no.

Common Belief:More required affinity rules always help schedule pods better.

Tap to reveal reality

Quick: Is node affinity evaluated continuously after pod scheduling? Commit yes or no.

Common Belief:Node affinity is enforced continuously, so pods move if nodes change labels.

Tap to reveal reality

Expert Zone

1

Preferred affinity weights can be tuned to influence scheduler decisions subtly without blocking scheduling.

2

Node affinity rules are ignored during pod execution, so node label changes after scheduling do not affect running pods.

3

Combining node affinity with taints and tolerations provides powerful control over pod placement and node usage.

When NOT to use

Avoid strict required node affinity in highly dynamic clusters where nodes frequently join or leave. Instead, use preferred affinity or pod affinity to maintain flexibility. For workload spreading, consider pod anti-affinity or topology spread constraints as alternatives.

Production Patterns

In production, node affinity is used to place workloads on nodes with special hardware like GPUs or SSDs. Anti-affinity helps spread replicas across failure zones to improve availability. Teams combine affinity with taints and tolerations to isolate workloads and optimize resource usage.

Connections

Pod affinity and anti-affinity

Builds-on

Understanding node affinity helps grasp pod affinity, which controls pod placement relative to other pods, adding another layer of scheduling control.

Taints and tolerations

Complementary

Node affinity controls where pods prefer or avoid nodes, while taints and tolerations enforce strict node acceptance policies; together they provide full scheduling control.

Constraint satisfaction problems (CSP) in computer science

Same pattern

Node affinity and anti-affinity are practical examples of CSP, where the scheduler solves constraints to find valid pod placements, similar to puzzles or resource allocation problems.

Common Pitfalls

#1Using required node affinity with labels that no node has.

Wrong approach:spec: affinity: nodeAffinity: requiredDuringSchedulingIgnoredDuringExecution: nodeSelectorTerms: - matchExpressions: - key: nonexistent-label operator: In values: - value1

Correct approach:spec: affinity: nodeAffinity: preferredDuringSchedulingIgnoredDuringExecution: - weight: 1 preference: matchExpressions: - key: nonexistent-label operator: In values: - value1

Root cause:Confusing required and preferred affinity causes pods to remain unscheduled if no nodes match.

#2Mixing node anti-affinity with pod labels instead of node labels.

Wrong approach:spec: affinity: nodeAffinity: requiredDuringSchedulingIgnoredDuringExecution: nodeSelectorTerms: - matchExpressions: - key: app operator: In values: - frontend

Correct approach:spec: affinity: podAntiAffinity: requiredDuringSchedulingIgnoredDuringExecution: - labelSelector: matchExpressions: - key: app operator: In values: - frontend topologyKey: kubernetes.io/hostname

Root cause:Confusing node labels with pod labels leads to wrong affinity type and scheduling errors.

#3Assuming pods will move if node labels change after scheduling.

Wrong approach:Changing node labels expecting pods to reschedule automatically.

Correct approach:Manually evict pods or restart them to reschedule after node label changes.

Root cause:Misunderstanding that affinity is only checked at scheduling time, not continuously.

Key Takeaways

Node affinity and anti-affinity let you control where pods run by specifying node label rules.

Required affinity rules are strict and must be met; preferred rules guide scheduling but allow flexibility.

Anti-affinity helps avoid nodes with certain labels, improving workload distribution and reliability.

Overly strict affinity rules can cause pods to remain unscheduled, so balance control with flexibility.

Affinity works with other Kubernetes features like taints and pod affinity to optimize cluster scheduling.