Bird
Raised Fist0
Agentic AIml~20 mins

Self-improving agents in Agentic AI - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
Self-Improver Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate
2:00remaining
What is a key characteristic of a self-improving agent?

Which of the following best describes a self-improving agent in AI?

AAn agent that randomly changes its behavior without evaluation.
BAn agent that requires manual updates from developers to improve.
CAn agent that can modify its own code or parameters to improve performance over time.
DAn agent that only follows fixed rules without any learning capability.
Attempts:
2 left
💡 Hint

Think about what makes an agent improve itself without external help.

Model Choice
intermediate
2:00remaining
Choosing a model architecture for a self-improving agent

You want to build a self-improving agent that learns from its past decisions and adapts its strategy. Which model architecture is most suitable?

AA reinforcement learning model with policy gradient methods.
BA simple linear regression model without feedback.
CA clustering algorithm that groups data points.
DA static decision tree with fixed rules.
Attempts:
2 left
💡 Hint

Consider models that learn from interaction and improve policies over time.

Metrics
advanced
2:00remaining
Evaluating improvement in a self-improving agent

Which metric is most appropriate to measure the improvement of a self-improving agent over multiple training episodes?

AAverage cumulative reward per episode.
BMean squared error on a fixed test set.
CNumber of parameters in the model.
DTraining time per epoch.
Attempts:
2 left
💡 Hint

Think about a metric that reflects how well the agent performs its task over time.

🔧 Debug
advanced
2:00remaining
Debugging a self-improving agent stuck in local optimum

An agent using reinforcement learning is stuck with poor performance and does not improve despite training. What is the most likely cause?

AThe agent's model architecture is too complex.
BThe agent's learning rate is too high, causing unstable updates.
CThe reward function is too sparse, providing no feedback.
DThe agent's exploration rate is too low, causing it to exploit suboptimal actions.
Attempts:
2 left
💡 Hint

Consider why the agent might not try new actions to find better solutions.

Predict Output
expert
2:00remaining
Output of a self-modifying agent code snippet

Consider this Python code simulating a simple self-improving agent that updates its parameter to maximize reward. What is the output after running it?

Agentic AI
class SelfImprovingAgent:
    def __init__(self):
        self.param = 0
    def reward(self):
        return -(self.param - 5) ** 2 + 10
    def improve(self):
        best_param = self.param
        best_reward = self.reward()
        for delta in [-1, 1]:
            candidate = self.param + delta
            candidate_reward = -(candidate - 5) ** 2 + 10
            if candidate_reward > best_reward:
                best_param = candidate
                best_reward = candidate_reward
        self.param = best_param
agent = SelfImprovingAgent()
for _ in range(3):
    agent.improve()
print(agent.param)
A4
B3
C5
D2
Attempts:
2 left
💡 Hint

Trace the parameter updates step by step to see where it converges after 3 improvements.