VerifierResult.
VerifierResult
result field on rollout responses contains the serialized verifier output:
Rubric Judge
For rubric-based evaluation, theRubricJudgeResult provides structured scoring:
| Field | Type | Description |
|---|---|---|
score | float | Score from 0.0 to 1.0 |
verdict | string | "PASS" or "FAIL" |
confidence | float | Confidence in the verdict (0.0–1.0) |
evidence | list[string] | Bullet points with concrete evidence |
failed_criteria | list[string] | Unmet rubric criteria |
dimension_scores | list[object] | Per-dimension breakdowns: { "dimension": str, "score": float, "reason": str } |
error | string | Error message if evaluation failed |

