Practice

(1/5)

1. What is the primary purpose of a Horizontal Pod Autoscaler in a Kubernetes microservices environment?

easy

A. Store persistent data for pods

B. Manually restart pods when they fail

C. Balance network traffic between pods

D. Automatically adjust the number of pods based on CPU or custom metrics

2. Which of the following is the correct YAML snippet to define a Horizontal Pod Autoscaler targeting CPU utilization at 50% for a deployment named web-app?

easy

A. apiVersion: autoscaling/v2\nkind: HorizontalPodAutoscaler\nmetadata:\n name: web-app-hpa\nspec:\n scaleTargetRef:\n apiVersion: apps/v1\n kind: Deployment\n name: web-app\n minReplicas: 1\n maxReplicas: 5\n metrics:\n - type: Resource\n resource:\n name: cpu\n target:\n type: Utilization\n averageUtilization: 70

B. apiVersion: v1\nkind: Pod\nmetadata:\n name: web-app\nspec:\n containers:\n - name: web-app\n image: web-app:latest

C. apiVersion: autoscaling/v1\nkind: HorizontalPodAutoscaler\nmetadata:\n name: web-app-hpa\nspec:\n scaleTargetRef:\n apiVersion: apps/v1\n kind: Deployment\n name: web-app\n minReplicas: 2\n maxReplicas: 10\n targetCPUUtilizationPercentage: 50

D. apiVersion: autoscaling/v2beta2\nkind: HorizontalPodAutoscaler\nmetadata:\n name: web-app-hpa\nspec:\n scaleTargetRef:\n apiVersion: apps/v1\n kind: Deployment\n name: web-app\n minReplicas: 1\n maxReplicas: 5\n metrics:\n - type: Resource\n resource:\n name: memory\n target:\n type: Utilization\n averageUtilization: 50

Users / Load	Pods	CPU/Memory Usage	Response Time	Autoscaler Behavior
100 users	1-2 pods	Low (10-30%)	Fast (low latency)	Minimal scaling, stable pod count
10,000 users	5-10 pods	Moderate (50-70%)	Good (slight increase)	Pods scale up automatically to handle load
1,000,000 users	1000-2000 pods	High (70-90%)	Acceptable (some latency)	Frequent scaling events, possible cooldown delays
100,000,000 users	100,000+ pods (cluster limits)	Very High (near max)	Degraded (high latency)	Autoscaler hits cluster or resource limits, scaling bottlenecks

Horizontal Pod Autoscaler in Microservices - Scalability & System Analysis

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of Horizontal Pod Autoscaler

Step 2: Compare options with this role

Final Answer:

Quick Check:

Solution

Step 1: Identify correct API version and fields for CPU target

Step 2: Check min/max replicas and target CPU utilization

Final Answer:

Quick Check:

Solution

Step 1: Understand scaling formula based on CPU utilization

Step 2: Round up and check min/max limits

Final Answer:

Quick Check:

Solution

Step 1: Check autoscaler dependency on metrics

Step 2: Understand effect of missing metrics

Final Answer:

Quick Check:

Solution

Step 1: Understand HPA multi-metric support

Step 2: Evaluate options for best practice

Final Answer:

Quick Check: