Evaluate agent performance on custom metrics
Metric | Description | Scale |
---|---|---|
Goal Completion | Does the agent achieve its purpose? | 0-1 |
Step Efficiency | Optimal path to solution | 1-5 |
Context Retention | Maintains conversation memory | 1-5 |
Error Rate | Unsuccessful steps | % |
User Satisfaction | Predicted user experience | 1-5 |