Dec 28th 2025
16 min read
Beyond Tool Correctness: Building Custom Efficiency Metrics for DeepEval
Your LLM agent just passed all its tests. It selected a tool, completed the task, and returned the right answer. Success, right? Not quite. What the tests didn't catch is...