Overview - Minimum, maximum, and desired capacity
What is it?
Minimum, maximum, and desired capacity are settings used in AWS Auto Scaling groups to control how many instances run. Minimum capacity is the smallest number of instances that must always be running. Maximum capacity is the largest number of instances allowed to run. Desired capacity is the target number of instances the system tries to maintain.
Why it matters
These settings help balance cost and performance by automatically adjusting resources based on demand. Without them, you might pay for too many servers or have too few to handle traffic, causing slow or failed services. They ensure your application runs smoothly and efficiently.
Where it fits
Before learning this, you should understand what cloud servers and Auto Scaling groups are. After this, you can learn about scaling policies and alarms that trigger capacity changes automatically.