Practice

(1/5)

1. What is the main idea behind a self-improving agent in AI?

easy

A. It learns from its own actions to get better over time.

B. It only follows fixed rules without changing.

C. It requires constant manual updates to improve.

D. It ignores feedback from the environment.

Solution

Step 1: Understand the agent's learning process
A self-improving agent learns by trying actions and observing results to improve itself.
Step 2: Compare options to the definition
Only It learns from its own actions to get better over time. describes learning from its own actions to improve over time.
Final Answer:
It learns from its own actions to get better over time. -> Option A
Quick Check:
Self-improving means learning from actions = B [OK]

Hint: Self-improving means learning and updating itself [OK]

Common Mistakes:

Thinking it never changes (fixed rules)
Assuming manual updates are needed
Ignoring feedback from environment

2. Which of the following is the correct way to represent a self-improving agent's update step in pseudocode?

easy

A. agent.reset() every time without learning

B. agent.run() without feedback

C. agent.update(learn_from=agent.actions, feedback=environment.results)

D. agent.ignore(environment.results)

Solution

Step 1: Identify update step involving learning
The agent must update itself using its actions and feedback from the environment.
Step 2: Match options to update logic
Only agent.update(learn_from=agent.actions, feedback=environment.results) shows the agent updating by learning from its actions and feedback.
Final Answer:
agent.update(learn_from=agent.actions, feedback=environment.results) -> Option C
Quick Check:
Update with actions and feedback = A [OK]

Hint: Update means learning from actions and feedback [OK]

Common Mistakes:

Ignoring feedback in update
Resetting without learning
Running without update

3. Consider this pseudocode for a self-improving agent:

actions = ['move', 'turn', 'scan']
results = [True, False, True]
agent_knowledge = {'move': 0.5, 'turn': 0.5, 'scan': 0.5}

for i in range(len(actions)):
    if results[i]:
        agent_knowledge[actions[i]] += 0.1
    else:
        agent_knowledge[actions[i]] -= 0.1

print(agent_knowledge)

What will be the printed output?

medium

A. SyntaxError

B. {'move': 0.6, 'turn': 0.4, 'scan': 0.6}

C. {'move': 0.4, 'turn': 0.6, 'scan': 0.4}

D. {'move': 0.5, 'turn': 0.5, 'scan': 0.5}

Solution

Step 1: Analyze loop updates on knowledge
For each action, if result is True, add 0.1; if False, subtract 0.1.
Step 2: Calculate final values
'move': 0.5 + 0.1 = 0.6; 'turn': 0.5 - 0.1 = 0.4; 'scan': 0.5 + 0.1 = 0.6.
Final Answer:
{'move': 0.6, 'turn': 0.4, 'scan': 0.6} -> Option B
Quick Check:
True adds 0.1, False subtracts 0.1 = D [OK]

Hint: Add 0.1 for True, subtract 0.1 for False in order [OK]

Common Mistakes:

Not updating values correctly
Mixing True and False effects
Assuming no change

4. This code tries to update an agent's knowledge but has a bug:

actions = ['jump', 'run']
results = [True, False]
knowledge = {'jump': 0.3, 'run': 0.7}

for i in range(len(actions)):
    if results[i]:
        knowledge[actions[i]] += 0.1
    else:
        knowledge[actions[i]] =- 0.1

print(knowledge)

What is the bug and how to fix it?

medium

A. The operator '= -' should be '-=' to subtract; fix: change to '-='.

B. The list lengths mismatch; fix by adding more results.

C. The dictionary keys are missing; fix by adding keys.

D. The print statement is incorrect; fix by using print(knowledge.values()).

Solution

Step 1: Identify the incorrect operator
The code uses '= - 0.1' which assigns negative 0.1 instead of subtracting.
Step 2: Correct the operator to '-='
Changing '= -' to '-=' correctly subtracts 0.1 from the current value.
Final Answer:
The operator '= -' should be '-=' to subtract; fix: change to '-='. -> Option A
Quick Check:
Use '-=' to subtract, not '= -' = C [OK]

Hint: Use '-=' to subtract, not '= -' [OK]

Common Mistakes:

Confusing '= -' with '-=' operator
Ignoring operator syntax errors
Thinking print statement causes error

5. You want to design a self-improving agent that adapts to changing environments by updating its strategy based on success rates. Which approach best fits this goal?

hard

A. Manually update the agent's strategy after every 100 actions.

B. Fix the agent's strategy and never update it to keep consistency.

C. Randomly change strategies without considering past results.

D. Use a feedback loop where the agent tries actions, measures success, and updates probabilities accordingly.

Solution

Step 1: Understand the goal of adapting strategies
The agent must learn from success rates and update its strategy automatically.
Step 2: Evaluate options for self-improvement
Only Use a feedback loop where the agent tries actions, measures success, and updates probabilities accordingly. describes a feedback loop that updates based on success, matching self-improving behavior.
Final Answer:
Use a feedback loop where the agent tries actions, measures success, and updates probabilities accordingly. -> Option D
Quick Check:
Feedback loop with updates = A [OK]

Hint: Use feedback loops to update strategy automatically [OK]

Common Mistakes:

Fixing strategy without updates
Changing randomly without feedback
Relying on manual updates only

Epoch	Loss ↓	Accuracy ↑	Observation
1	0.8	0.3	Agent starts with random decisions, low accuracy
2	0.6	0.45	Agent begins learning from experience, accuracy improves
3	0.4	0.65	Agent refines policy, loss decreases steadily
4	0.25	0.8	Agent shows strong improvement in decision making
5	0.15	0.9	Agent converges to effective policy, high accuracy

Self-improving agents in Agentic AI - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand the agent's learning process

Step 2: Compare options to the definition

Final Answer:

Quick Check:

Solution

Step 1: Identify update step involving learning

Step 2: Match options to update logic

Final Answer:

Quick Check:

Solution

Step 1: Analyze loop updates on knowledge

Step 2: Calculate final values

Final Answer:

Quick Check:

Solution

Step 1: Identify the incorrect operator

Step 2: Correct the operator to '-='

Final Answer:

Quick Check:

Solution

Step 1: Understand the goal of adapting strategies

Step 2: Evaluate options for self-improvement

Final Answer:

Quick Check: