When evaluating tool permission boundaries in agentic AI, the key metric is Precision. This is because we want to ensure the AI only uses tools it is allowed to, avoiding unauthorized actions. High precision means the AI rarely uses tools outside its permission, keeping actions safe and controlled.
Recall is also important but secondary. It measures how often the AI uses all the tools it is allowed to. Missing allowed tools (low recall) can reduce effectiveness but is less risky than using forbidden tools.
