Skip to main content
After an agent run, verifiers evaluate whether the task was completed successfully. Verifiers consume RunArtifacts and produce reward files.

Output Directory Structure

output/agent_run_{task_id}_{timestamp}/
  artifacts.json              # Full RunArtifacts
  verifier/
    reward.txt                # "1" or "0"
    reward.json               # { "reward": 1.0 }

reward.json Schema

{
  "reward": 1.0
}
FieldTypeDescription
rewardfloat1.0 for success, 0.0 for failure (partial scores possible)