0
0
Agentic AIml~20 mins

Defining success criteria for agents in Agentic AI - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
Agent Success Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate
2:00remaining
What best defines a success criterion for an AI agent?

Imagine you have a robot that cleans your house. What is the best way to describe its success criterion?

AThe robot makes a sound when it starts cleaning.
BThe robot moves around without bumping into walls.
CThe robot cleans every room in the house within 30 minutes.
DThe robot has a battery that lasts 2 hours.
Attempts:
2 left
💡 Hint

Think about what shows the robot did its main job well.

Metrics
intermediate
2:00remaining
Which metric best measures success for a delivery drone agent?

A delivery drone must drop packages at correct locations on time. Which metric best shows if it succeeded?

APercentage of packages delivered to the correct location within the scheduled time.
BAverage battery usage during flights.
CNumber of flights completed per day.
DTotal distance flown by the drone.
Attempts:
2 left
💡 Hint

Focus on the main goal: delivering packages correctly and on time.

Model Choice
advanced
2:00remaining
Choosing a success criterion for a reinforcement learning agent

You train a reinforcement learning agent to play a game. Which success criterion is best to decide if training worked?

AThe agent's average score over 100 games exceeds a set threshold.
BThe agent's training loss decreases steadily during training.
CThe agent's model size is smaller than 10MB.
DThe agent's action distribution matches the training data.
Attempts:
2 left
💡 Hint

Think about what shows the agent plays the game well, not just training behavior.

🔧 Debug
advanced
2:00remaining
Why does this success criterion fail to measure agent success?

An agent's success criterion is: 'Agent completes task if total steps taken is less than 50'. The agent finishes tasks but often takes 60 steps. Why is this criterion problematic?

AIt requires the agent to take more than 50 steps to succeed.
BIt measures the agent's speed, which is always irrelevant.
CIt uses total steps, which is not a measurable quantity.
DIt ignores whether the task was actually completed correctly.
Attempts:
2 left
💡 Hint

Think about what the criterion checks versus what the agent actually does.

Hyperparameter
expert
2:00remaining
How does changing the success threshold affect agent training?

An agent's success criterion is reaching a score above a threshold. What happens if you set this threshold too high during training?

AThe agent will learn faster because the goal is more challenging.
BThe agent may never reach success, causing training to stall or fail.
CThe agent ignores the threshold and trains normally.
DThe agent's model size will automatically increase.
Attempts:
2 left
💡 Hint

Consider what happens if the goal is impossible or too hard to reach.