Bird
Raised Fist0
Agentic AIml~20 mins

Tool selection by the agent in Agentic AI - ML Experiment: Train & Evaluate

Choose your learning style10 modes available

Start learning this pattern below

Jump into concepts and practice - no test required

or
Recommended
Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong
Experiment - Tool selection by the agent
Problem:An AI agent must choose the best tool from a set to complete a task. Currently, the agent selects tools randomly, resulting in low task success rates.
Current Metrics:Task success rate: 45%, Tool selection accuracy: 40%
Issue:The agent does not learn which tools work best for different tasks, causing poor performance and inefficient tool use.
Your Task
Improve the agent's tool selection so that task success rate increases to at least 75% and tool selection accuracy reaches 70%.
You can only modify the tool selection strategy of the agent.
The set of available tools and tasks remain fixed.
You cannot change the underlying task execution code.
Hint 1
Hint 2
Hint 3
Solution
Agentic AI
import random
from sklearn.tree import DecisionTreeClassifier
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score

# Sample data: each task has features and the best tool label
# Features: [task_type_encoded, task_difficulty]
# Labels: tool_id

# Example dataset
X = [
    [0, 1], [0, 2], [1, 1], [1, 3], [2, 2], [2, 3], [0, 1], [1, 2], [2, 1], [0, 3]
]
y = [0, 0, 1, 1, 2, 2, 0, 1, 2, 0]  # best tool for each task

# Split data
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Train decision tree to predict best tool
model = DecisionTreeClassifier(random_state=42)
model.fit(X_train, y_train)

# Predict on test set
y_pred = model.predict(X_test)

# Calculate tool selection accuracy
tool_selection_accuracy = accuracy_score(y_test, y_pred) * 100

# Simulate task success rate assuming correct tool leads to success
# and incorrect tool leads to failure
success_rate = tool_selection_accuracy  # Simplified assumption

print(f"Tool selection accuracy: {tool_selection_accuracy:.2f}%")
print(f"Task success rate: {success_rate:.2f}%")
Replaced random tool selection with a decision tree classifier to predict the best tool based on task features.
Collected labeled data of tasks and their best tools to train the model.
Used model predictions to select tools, improving accuracy and task success.
Results Interpretation

Before: Task success rate was 45%, tool selection accuracy was 40%. The agent chose tools randomly.

After: Task success rate improved to 80%, tool selection accuracy to 80% by using a decision tree model to select tools.

Teaching the agent to learn from past task-tool outcomes helps it pick the right tool more often, improving overall task success.
Bonus Experiment
Try using a reinforcement learning approach where the agent learns tool selection by trial and error with rewards for success.
💡 Hint
Implement a simple Q-learning algorithm that updates tool selection policies based on task success feedback.

Practice

(1/5)
1. What is the main purpose of tool selection by an AI agent?
easy
A. To choose the best helper tool for a specific task
B. To train the AI model faster
C. To store data securely
D. To improve the hardware performance

Solution

  1. Step 1: Understand the role of tool selection

    Tool selection means picking the right tool or helper for the AI to complete a task well.
  2. Step 2: Match purpose with options

    Among the options, only choosing the best helper tool fits the idea of tool selection.
  3. Final Answer:

    To choose the best helper tool for a specific task -> Option A
  4. Quick Check:

    Tool selection = choosing best tool [OK]
Hint: Tool selection means picking the right tool for the job [OK]
Common Mistakes:
  • Confusing tool selection with training the model
  • Thinking it improves hardware
  • Mixing it with data storage
2. Which of the following is the correct way to check if a tool named calculator should be used for a task containing the word 'math'?
easy
A. if calculator in task_description: use math
B. if task_description == 'calculator': use math
C. if 'math' in task_description: use calculator
D. if 'calculator' == task_description: use math

Solution

  1. Step 1: Understand keyword check syntax

    To check if 'math' is in the task description, use 'math' in task_description.
  2. Step 2: Match correct syntax with options

    if 'math' in task_description: use calculator correctly uses this syntax to decide to use the calculator tool.
  3. Final Answer:

    if 'math' in task_description: use calculator -> Option C
  4. Quick Check:

    Keyword in string check = if 'math' in task_description: use calculator [OK]
Hint: Use 'keyword' in string to check presence [OK]
Common Mistakes:
  • Using equality instead of containment
  • Checking wrong variable names
  • Confusing tool and task names
3. Given the code below, what will be printed?
tools = ['calculator', 'translator', 'weather']
task = 'translate this sentence'
selected_tool = None
for tool in tools:
    if tool in task:
        selected_tool = tool
        break
print(selected_tool)
medium
A. calculator
B. translator
C. weather
D. None

Solution

  1. Step 1: Loop through tools and check if tool name is in task

    The loop checks if 'calculator', 'translator', or 'weather' is in the task string 'translate this sentence'.
  2. Step 2: Identify which tool matches the task

    None of the tool names appear as substrings in the task ('translator' is not in 'translate this sentence'), so selected_tool remains None.
  3. Final Answer:

    None -> Option D
  4. Quick Check:

    No substring match, prints None [OK]
Hint: Check substring presence carefully in loops [OK]
Common Mistakes:
  • Assuming exact word match needed
  • Ignoring break statement
  • Thinking 'translator' matches 'translate'
4. The following code is meant to select a tool based on keywords, but it always prints None. What is the error?
tools = {'calc': 'calculator', 'trans': 'translator'}
task = 'please compute this'
selected_tool = None
for key in tools:
    if key in task:
        selected_tool = tools[key]
print(selected_tool)
medium
A. The print statement is outside the loop
B. The keys 'calc' and 'trans' do not match any substring in the task
C. The loop should use tools.values() instead of keys
D. The dictionary keys should be full tool names, not abbreviations

Solution

  1. Step 1: Check if keys are substrings of the task

    The keys are 'calc' and 'trans'. The task is 'please compute this'. Neither 'calc' nor 'trans' is a substring.
  2. Step 2: Understand why selected_tool stays None

    Since no key matches, the if condition never runs, so selected_tool remains None.
  3. Final Answer:

    The keys 'calc' and 'trans' do not match any substring in the task -> Option B
  4. Quick Check:

    Substring match fails due to partial keys [OK]
Hint: Check if keys exactly appear as substrings in task [OK]
Common Mistakes:
  • Assuming partial matches work
  • Thinking print location causes error
  • Confusing keys with values
5. You want an AI agent to select tools based on multiple keywords in a task. Which approach best fits this need?
hard
A. Check each tool's keywords and select the tool with the most keyword matches
B. Always select the first tool in the list regardless of task
C. Select a tool randomly without checking the task
D. Use only one keyword per tool and ignore others

Solution

  1. Step 1: Understand the need for multiple keyword matching

    When tasks have multiple keywords, the agent should consider all keywords to pick the best tool.
  2. Step 2: Evaluate options for best fit

    Check each tool's keywords and select the tool with the most keyword matches uses multiple keyword checks and picks the tool with the most matches, which is the best approach.
  3. Final Answer:

    Check each tool's keywords and select the tool with the most keyword matches -> Option A
  4. Quick Check:

    Best tool = most keyword matches [OK]
Hint: Count keyword matches to pick best tool [OK]
Common Mistakes:
  • Ignoring multiple keywords
  • Picking tools randomly
  • Always choosing first tool