Recall & Review

beginner

What is a Horizontal Pod Autoscaler (HPA) in Kubernetes?

HPA automatically adjusts the number of pods in a deployment based on observed CPU utilization or other select metrics to maintain application performance.

Click to reveal answer

intermediate

Which metrics can Horizontal Pod Autoscaler use to scale pods?

HPA can use CPU utilization, memory usage, or custom metrics like request rate to decide when to scale pods up or down.

Click to reveal answer

intermediate

How does HPA decide when to increase or decrease pod count?

HPA compares current metric values against target thresholds. If usage is above the target, it adds pods; if below, it removes pods to optimize resource use.

Click to reveal answer

advanced

What is the difference between Horizontal Pod Autoscaler and Vertical Pod Autoscaler?

Horizontal Pod Autoscaler changes the number of pods, while Vertical Pod Autoscaler changes the resource requests and limits (CPU/memory) of existing pods.

Click to reveal answer

intermediate

Why is it important to set minimum and maximum pod limits in HPA?

Setting min and max pod limits prevents scaling too low (causing performance issues) or too high (wasting resources), ensuring stable and efficient operation.

Click to reveal answer

What does Horizontal Pod Autoscaler primarily adjust in a Kubernetes cluster?

ANetwork bandwidth

BCPU limits of pods

CMemory limits of pods

DNumber of pods

Which metric is commonly used by HPA to trigger scaling?

ADisk space usage

BCPU utilization

CNumber of nodes

DPod restart count

What happens if the current CPU usage is below the target in HPA?

APods are removed

BPods remain the same

CPods are added

DNodes are added

Which Kubernetes component typically manages the Horizontal Pod Autoscaler?

Akube-controller-manager

Bkube-scheduler

Ckube-proxy

Detcd

Why should you avoid setting the maximum pod count too high in HPA?

AIt slows down pod startup

BIt can cause pod starvation

CIt wastes cluster resources

DIt causes network congestion

Explain how Horizontal Pod Autoscaler works to maintain application performance.

Describe the key differences between Horizontal Pod Autoscaler and Vertical Pod Autoscaler.

Practice

(1/5)

1. What is the primary purpose of a Horizontal Pod Autoscaler in a Kubernetes microservices environment?

easy

A. Store persistent data for pods

B. Manually restart pods when they fail

C. Balance network traffic between pods

D. Automatically adjust the number of pods based on CPU or custom metrics

2. Which of the following is the correct YAML snippet to define a Horizontal Pod Autoscaler targeting CPU utilization at 50% for a deployment named web-app?

easy

A. apiVersion: autoscaling/v2\nkind: HorizontalPodAutoscaler\nmetadata:\n name: web-app-hpa\nspec:\n scaleTargetRef:\n apiVersion: apps/v1\n kind: Deployment\n name: web-app\n minReplicas: 1\n maxReplicas: 5\n metrics:\n - type: Resource\n resource:\n name: cpu\n target:\n type: Utilization\n averageUtilization: 70

B. apiVersion: v1\nkind: Pod\nmetadata:\n name: web-app\nspec:\n containers:\n - name: web-app\n image: web-app:latest

C. apiVersion: autoscaling/v1\nkind: HorizontalPodAutoscaler\nmetadata:\n name: web-app-hpa\nspec:\n scaleTargetRef:\n apiVersion: apps/v1\n kind: Deployment\n name: web-app\n minReplicas: 2\n maxReplicas: 10\n targetCPUUtilizationPercentage: 50

D. apiVersion: autoscaling/v2beta2\nkind: HorizontalPodAutoscaler\nmetadata:\n name: web-app-hpa\nspec:\n scaleTargetRef:\n apiVersion: apps/v1\n kind: Deployment\n name: web-app\n minReplicas: 1\n maxReplicas: 5\n metrics:\n - type: Resource\n resource:\n name: memory\n target:\n type: Utilization\n averageUtilization: 50

Horizontal Pod Autoscaler in Microservices - Cheat Sheet & Quick Revision

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of Horizontal Pod Autoscaler

Step 2: Compare options with this role

Final Answer:

Quick Check:

Solution

Step 1: Identify correct API version and fields for CPU target

Step 2: Check min/max replicas and target CPU utilization

Final Answer:

Quick Check:

Solution

Step 1: Understand scaling formula based on CPU utilization

Step 2: Round up and check min/max limits

Final Answer:

Quick Check:

Solution

Step 1: Check autoscaler dependency on metrics

Step 2: Understand effect of missing metrics

Final Answer:

Quick Check:

Solution

Step 1: Understand HPA multi-metric support

Step 2: Evaluate options for best practice

Final Answer:

Quick Check: