0
0
Prompt Engineering / GenAIml~5 mins

Load balancing for AI services in Prompt Engineering / GenAI - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is load balancing in AI services?
Load balancing is the process of distributing incoming AI service requests evenly across multiple servers or resources to ensure no single server is overwhelmed, improving speed and reliability.
Click to reveal answer
beginner
Why is load balancing important for AI services?
It helps keep AI services fast and available by preventing any one server from getting too busy, which can cause delays or crashes.
Click to reveal answer
intermediate
Name two common load balancing methods used in AI services.
Round Robin (requests go to servers in order) and Least Connections (requests go to the server with the fewest active connections).
Click to reveal answer
intermediate
How does load balancing improve fault tolerance in AI services?
If one server fails, load balancers redirect requests to other healthy servers, keeping the AI service running smoothly.
Click to reveal answer
intermediate
What role does health checking play in load balancing for AI services?
Health checks monitor servers to ensure they are working well; load balancers use this info to avoid sending requests to servers that are down or slow.
Click to reveal answer
What does load balancing do in AI services?
ADeletes old AI data
BIncreases the size of AI models
CReduces the number of AI service users
DDistributes requests evenly across servers
Which load balancing method sends requests to the server with the fewest active connections?
ALeast Connections
BWeighted Distribution
CRandom Selection
DRound Robin
How does load balancing help if one AI server crashes?
AIt redirects requests to other servers
BIt shuts down all servers
CIt deletes user data
DIt slows down the service
What is a health check in load balancing?
AA method to speed up AI predictions
BA way to train AI models
CA test to see if servers are working well
DA tool to increase server storage
Which of these is NOT a benefit of load balancing for AI services?
ABetter reliability
BIncreased server crashes
CImproved speed
DFault tolerance
Explain in your own words why load balancing is important for AI services.
Think about what happens if one server gets too many requests.
You got /4 concepts.
    Describe two common methods of load balancing and how they decide where to send requests.
    One method cycles through servers, the other checks how busy servers are.
    You got /3 concepts.