When sandboxing dangerous operations, the key metrics are False Positive Rate and False Negative Rate. This is because sandboxing aims to block harmful actions (like running unsafe code) without stopping safe ones.
A False Positive means safe operations are blocked, causing inconvenience or loss of functionality.
A False Negative means dangerous operations slip through, risking security or damage.
Therefore, metrics like Precision (how many blocked operations are truly dangerous) and Recall (how many dangerous operations are caught) are critical to balance safety and usability.
