Introduction
When you run an app on Kubernetes, you tell it how many copies (replicas) you want. Kubernetes then works to make sure the actual number of running copies matches what you asked for. This helps keep your app available and balanced.
When you want to make sure your app always has a certain number of copies running for reliability.
When you update your app and want to increase or decrease the number of running copies smoothly.
When you want to check if Kubernetes is keeping your app healthy by matching the desired number of copies.
When you notice your app is slow or down and want to see if the number of running copies is less than expected.
When you want to scale your app up or down based on user demand.