The ReAct pattern combines reasoning and acting steps in AI models to improve decision-making. To evaluate it, we focus on accuracy and task success rate. Accuracy shows how often the model's final answers are correct. Task success rate measures if the model completes the intended task using its reasoning and actions. These metrics matter because ReAct aims to improve both understanding and execution, so we want to see if the model reasons well and acts correctly.
ReAct pattern in Prompt Engineering / GenAI - Model Metrics & Evaluation
Start learning this pattern below
Jump into concepts and practice - no test required
Confusion Matrix for ReAct model task completion:
Predicted Success Predicted Failure
Actual Success 85 (TP) 15 (FN)
Actual Failure 10 (FP) 90 (TN)
Total samples = 200
Precision = TP / (TP + FP) = 85 / (85 + 10) = 0.8947
Recall = TP / (TP + FN) = 85 / (85 + 15) = 0.85
F1 Score = 2 * (Precision * Recall) / (Precision + Recall) = 0.871
This matrix shows how well the ReAct model predicts successful task completion. High precision means most predicted successes are true. High recall means most actual successes are caught.
In ReAct models, precision and recall balance is key:
- High Precision: The model rarely claims success unless very sure. Good when false success is costly, like medical advice generation.
- High Recall: The model tries to catch all successes, even if some are wrong. Useful when missing a success is worse, like emergency response planning.
Choosing which to prioritize depends on the task. For example, a ReAct model helping with legal advice should have high precision to avoid wrong guidance. A ReAct model for search and rescue should have high recall to not miss any possible success.
Good metrics:
- Accuracy above 85%
- Precision and recall both above 80%
- F1 score close to or above 85%
- Consistent task success rate across different inputs
Bad metrics:
- Accuracy below 70%
- Precision or recall below 50%
- Large gap between precision and recall (e.g., precision 90% but recall 30%)
- Unstable task success rate, failing often on new inputs
Good metrics mean the ReAct model reasons and acts reliably. Bad metrics show it struggles to balance reasoning and action, leading to wrong or missed results.
- Accuracy paradox: High accuracy can be misleading if data is imbalanced (e.g., mostly failures). Always check precision and recall.
- Data leakage: If the model sees answers during training, metrics will be unrealistically high.
- Overfitting: Model performs well on training but poorly on new tasks, hiding in high training accuracy.
- Ignoring task complexity: Metrics alone don't show if reasoning steps are meaningful or just memorized.
- Not measuring intermediate reasoning quality: Only final output metrics miss how well the model reasons before acting.
Your ReAct model has 98% accuracy but only 12% recall on successful task completions. Is it good for production? Why or why not?
Answer: No, it is not good. The very low recall means the model misses most actual successes, even if it rarely makes false success claims. This means many tasks that should succeed are not recognized, which can be critical depending on the application. High accuracy alone is misleading here.
Practice
ReAct pattern in AI?Solution
Step 1: Understand the ReAct pattern concept
The ReAct pattern mixes reasoning (thinking) and actions (doing) to solve problems step-by-step.Step 2: Identify the main goal
This approach helps AI be more transparent and effective by breaking down tasks into Thought, Action, Observation, and Final Answer.Final Answer:
To combine thinking and acting steps for better problem solving -> Option BQuick Check:
ReAct = Reason + Act [OK]
- Thinking AI skips actions
- ReAct stores data only
- ReAct replaces humans fully
Solution
Step 1: Recall the ReAct step order
The ReAct pattern follows a clear order: first the AI thinks (Thought), then acts (Action), then sees results (Observation), and finally gives the answer.Step 2: Match the correct sequence
Thought -> Action -> Observation -> Final Answer correctly lists this order as Thought -> Action -> Observation -> Final Answer.Final Answer:
Thought -> Action -> Observation -> Final Answer -> Option AQuick Check:
Order = T -> A -> O -> FA [OK]
- Mixing up Observation and Action order
- Putting Final Answer before Observation
- Skipping Thought step
thought = 'Check weather'
action = 'Query weather API'
observation = 'It is sunny'
final_answer = f"Weather is {observation}"
print(final_answer)What will be the printed output?
Solution
Step 1: Understand variable assignments
The variableobservationholds the string 'It is sunny'. Thefinal_answeruses this to create 'Weather is It is sunny'.Step 2: Evaluate the print statement
The print outputs thefinal_answerstring, which is 'Weather is It is sunny' because the f-string inserts the full observation string.Final Answer:
Weather is It is sunny -> Option DQuick Check:
Output includes 'Weather is' + observation [OK]
- Ignoring f-string variable insertion
- Printing wrong variable
- Confusing observation with action
thought = 'Calculate sum' action = 'Add 2 and 3' observation = 2 + 3 final_answer = 'Sum is ' + observation print(final_answer)
Solution
Step 1: Analyze the final_answer concatenation
The code tries to add a string 'Sum is ' and an integer observation (5) directly, which causes a TypeError in Python.Step 2: Identify the fix
To fix, convert observation to string usingstr(observation)before concatenation.Final Answer:
Cannot concatenate string and integer directly -> Option AQuick Check:
String + int causes error [OK]
- Ignoring type mismatch in concatenation
- Thinking observation must be string always
- Confusing action with observation
Solution
Step 1: Understand ReAct for stepwise problem solving
The ReAct pattern requires the AI to think (reason), act (calculate), observe (check result), and then answer.Step 2: Match the approach to ReAct steps
AI thinks about the problem, performs a calculation action, observes the result, then states the final answer describes this exact process, making the AI transparent and effective in solving math problems step-by-step.Final Answer:
AI thinks about the problem, performs a calculation action, observes the result, then states the final answer -> Option CQuick Check:
ReAct = Thought + Action + Observation + Answer [OK]
- Skipping reasoning steps
- Guessing without observation
- Ignoring stepwise transparency
