Chaos engineering is a practice used in microservices to improve system reliability. What is its main goal?
Think about why we would want to cause failures on purpose in a system.
Chaos engineering intentionally causes failures to find weaknesses early, so teams can fix them before users are affected.
In a microservices architecture, chaos experiments must be controlled to avoid major outages. Which component helps manage this safely?
Consider which part controls when and how failures happen during chaos testing.
The chaos experiment orchestrator schedules and controls failure injections, ensuring experiments run safely and can be stopped if needed.
When applying chaos engineering to a large system with many microservices, what is the best approach to scaling experiments?
Think about minimizing risk while learning about system weaknesses.
Starting small and isolated reduces risk and helps understand system behavior before expanding experiments.
Running chaos experiments in a live production environment can improve reliability but also has risks. What is a key tradeoff?
Consider what happens when you cause failures in a system users rely on.
Chaos experiments can reveal hidden issues but may cause temporary disruptions, so teams must balance learning with user experience.
You have 10 microservices and want to test 3 failure types (e.g., latency, error, crash) on each independently. What is the minimum number of chaos experiments to cover all cases?
Multiply the number of services by the number of failure types for independent tests.
Each microservice needs 3 separate failure tests, so 10 services × 3 failure types = 30 experiments.