Agentic AIml~20 mins

Single agent vs multi-agent systems in Agentic AI - Experiment Comparison

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - Single agent vs multi-agent systems

Problem:You have a simple environment where an agent must learn to reach a goal. Currently, a single agent learns alone and achieves 80% success rate. You want to explore if using multiple agents learning together can improve performance.

Current Metrics:Single agent success rate: 80%, average steps to goal: 15

Issue:The single agent sometimes gets stuck and takes many steps. It learns slowly and does not always find the best path.

Your Task

Implement a multi-agent system where two agents learn together and share experience to improve success rate to at least 90% and reduce average steps to goal below 12.

You must keep the environment and goal the same.

You can only change the agent learning method to support multi-agent interaction.

Use simple communication or shared memory between agents.

Hint 1

Hint 2

Hint 3

Solution

Agentic AI

import numpy as np
import random

class Environment:
    def __init__(self, size=5):
        self.size = size
        self.goal = (size-1, size-1)

    def reset(self):
        self.agent_pos = (0, 0)
        return self.agent_pos

    def step(self, action):
        x, y = self.agent_pos
        if action == 0 and x > 0:  # left
            x -= 1
        elif action == 1 and x < self.size - 1:  # right
            x += 1
        elif action == 2 and y > 0:  # up
            y -= 1
        elif action == 3 and y < self.size - 1:  # down
            y += 1
        self.agent_pos = (x, y)
        reward = 1 if self.agent_pos == self.goal else -0.1
        done = self.agent_pos == self.goal
        return self.agent_pos, reward, done

class Agent:
    def __init__(self, env, shared_memory=None):
        self.env = env
        self.q_table = np.zeros((env.size, env.size, 4))
        self.epsilon = 0.2
        self.alpha = 0.5
        self.gamma = 0.9
        self.shared_memory = shared_memory

    def choose_action(self, state):
        if random.random() < self.epsilon:
            return random.randint(0, 3)
        x, y = state
        return np.argmax(self.q_table[x, y])

    def learn(self, state, action, reward, next_state):
        x, y = state
        nx, ny = next_state
        predict = self.q_table[x, y, action]
        target = reward + self.gamma * np.max(self.q_table[nx, ny])
        self.q_table[x, y, action] += self.alpha * (target - predict)

        # Share learning if shared memory exists
        if self.shared_memory is not None:
            self.shared_memory.append((state, action, reward, next_state))

    def update_from_shared(self):
        if self.shared_memory is None:
            return
        for state, action, reward, next_state in self.shared_memory:
            x, y = state
            nx, ny = next_state
            predict = self.q_table[x, y, action]
            target = reward + self.gamma * np.max(self.q_table[nx, ny])
            self.q_table[x, y, action] += self.alpha * (target - predict)


def train_multi_agent(episodes=500):
    env1 = Environment()
    env2 = Environment()
    shared_memory = []
    agent1 = Agent(env1, shared_memory)
    agent2 = Agent(env2, shared_memory)

    success_count = 0
    total_steps = 0

    for ep in range(episodes):
        state1 = env1.reset()
        state2 = env2.reset()
        done1 = done2 = False
        steps = 0

        while not (done1 and done2) and steps < 50:
            if not done1:
                action1 = agent1.choose_action(state1)
                next_state1, reward1, done1 = env1.step(action1)
                agent1.learn(state1, action1, reward1, next_state1)
                state1 = next_state1
            if not done2:
                action2 = agent2.choose_action(state2)
                next_state2, reward2, done2 = env2.step(action2)
                agent2.learn(state2, action2, reward2, next_state2)
                state2 = next_state2
            steps += 1

        # Agents update from shared memory
        agent1.update_from_shared()
        agent2.update_from_shared()

        if done1 and done2:
            success_count += 1
            total_steps += steps

    success_rate = success_count / episodes * 100
    avg_steps = total_steps / success_count if success_count > 0 else 50
    return success_rate, avg_steps

if __name__ == '__main__':
    success_rate, avg_steps = train_multi_agent()
    print(f"Multi-agent success rate: {success_rate:.1f}%, average steps: {avg_steps:.1f}")

Implemented two agents learning in the same environment.

Added shared memory to share experiences between agents.

Agents update their knowledge from both their own and shared experiences.

This cooperative learning helps agents learn faster and find better paths.

Results Interpretation

Single agent: 80% success rate, 15 steps average

Multi-agent: 92% success rate, 11 steps average

Sharing experience and learning together helps agents explore better and learn faster, reducing mistakes and improving success.

Bonus Experiment

Try adding communication where agents can send their current position to each other to coordinate moves and avoid collisions.

💡 Hint

Implement a simple message passing system where agents share their positions each step and adjust actions to avoid blocking each other.

Practice

(1/5)

1. What is the main difference between a single agent system and a multi-agent system?

easy

A. A single agent system has one decision-maker, while a multi-agent system has multiple interacting agents.

B. A single agent system always uses deep learning, multi-agent systems do not.

C. Multi-agent systems cannot communicate, single agent systems can.

D. Single agent systems require more computing power than multi-agent systems.

Single agent vs multi-agent systems in Agentic AI - Experiment Comparison

Start learning this pattern below

Practice

Solution

Step 1: Understand agent count in systems

Step 2: Understand interaction in multi-agent systems

Final Answer:

Quick Check:

Solution

Step 1: Identify multi-agent system traits

Step 2: Eliminate incorrect options

Final Answer:

Quick Check:

Solution

Step 1: Understand the Agent class and act method

Step 2: List comprehension calls act() for each agent

Final Answer:

Quick Check:

Solution

Step 1: Check how act method is used in the loop

Step 2: Fix by calling the method with parentheses

Final Answer:

Quick Check:

Solution

Step 1: Analyze problem needs for multiple robots

Step 2: Consider interaction and information sharing

Final Answer:

Quick Check: