0
0
AWScloud~10 mins

ECS service auto scaling in AWS - Step-by-Step Execution

Choose your learning style9 modes available
Process Flow - ECS service auto scaling
Start: ECS Service Running
Monitor Metrics (CPU, Memory)
Check if Metric > Scale Out Threshold?
NoCheck if Metric < Scale In Threshold?
Increase Desired Task Count
Update ECS Service Desired Count
Wait for Stabilization
Repeat Monitoring
The ECS service auto scaling monitors resource metrics and adjusts the number of running tasks by increasing or decreasing desired count based on thresholds.
Execution Sample
AWS
1. Monitor CPU usage
2. If CPU > 70%, increase tasks by 1
3. If CPU < 30%, decrease tasks by 1
4. Update ECS service desired count
5. Wait and repeat
This process automatically adjusts ECS tasks based on CPU usage thresholds.
Process Table
StepCPU Usage (%)ConditionActionDesired Task CountNotes
15050 > 70? No; 50 < 30? NoNo scaling2Initial state, no change
27575 > 70? YesScale out +13Increased tasks due to high CPU
38080 > 70? YesScale out +14Further increase tasks
46565 > 70? No; 65 < 30? NoNo scaling4CPU normal, no change
52525 < 30? YesScale in -13Decreased tasks due to low CPU
62020 < 30? YesScale in -12Further decrease tasks
73535 > 70? No; 35 < 30? NoNo scaling2CPU normal, no change
85050 > 70? No; 50 < 30? NoNo scaling2Stable state
9ExitReached stable desired countStop scaling2Scaling stabilized
💡 Scaling stops when CPU usage is stable and desired task count matches load.
Status Tracker
VariableStartAfter Step 1After Step 2After Step 3After Step 4After Step 5After Step 6After Step 7After Step 8Final
CPU Usage (%)50507580652520355050
Desired Task Count2234432222
Scaling ActionNoneNoneScale Out +1Scale Out +1NoneScale In -1Scale In -1NoneNoneNone
Key Moments - 3 Insights
Why does the desired task count not change when CPU usage is 50%?
Because 50% CPU is between the scale out threshold (70%) and scale in threshold (30%), so no scaling action is taken as shown in step 1 of the execution table.
What happens when CPU usage is above 70% multiple times in a row?
The desired task count increases by 1 each time CPU is above 70%, scaling out the service, as seen in steps 2 and 3.
Why does scaling stop at the end?
Scaling stops when CPU usage stabilizes within thresholds and desired task count matches the load, indicated by 'Exit' in step 9.
Visual Quiz - 3 Questions
Test your understanding
Look at the execution table, what is the desired task count after step 3?
A2
B4
C3
D5
💡 Hint
Check the 'Desired Task Count' column at step 3 in the execution table.
At which step does the CPU usage first trigger a scale in action?
AStep 6
BStep 4
CStep 5
DStep 7
💡 Hint
Look for the first 'Scale in -1' action in the 'Scaling Action' column.
If the scale out threshold was lowered to 60%, how would the action at step 4 change?
AScale out +1
BScale in -1
CNo change, still no scaling
DStop scaling
💡 Hint
At step 4 CPU is 65%, which is above 60%, so scaling out would occur.
Concept Snapshot
ECS Service Auto Scaling:
- Monitors resource metrics (CPU, Memory)
- If metric > scale out threshold, increase desired tasks
- If metric < scale in threshold, decrease desired tasks
- Update ECS service desired count accordingly
- Wait for stabilization before next check
Full Transcript
ECS service auto scaling works by monitoring resource usage like CPU. When CPU goes above a set high threshold, it increases the number of running tasks to handle more load. When CPU drops below a low threshold, it decreases tasks to save resources. This process repeats continuously to keep the service responsive and efficient. The execution table shows CPU values and scaling actions step by step, with the desired task count adjusting up or down. Scaling stops when CPU stabilizes within thresholds and the task count matches the load.