Introduction
Scaling rules help your container apps automatically adjust the number of running instances based on demand. This keeps your app responsive during busy times and saves money when traffic is low.
When your app needs to handle more users during certain hours without manual intervention
When you want to save costs by reducing resources during low traffic periods
When you want your app to respond quickly to sudden spikes in requests
When you want to maintain performance by adding more instances as CPU or memory usage grows
When you want to scale based on custom metrics like queue length or HTTP request count