Guard
Reliability Monitor
Track your AI systemβs consistency and accuracy with the Collinear AI Reliability Monitoring dashboard.
π₯ Interactive Walkthrough
Explore the Reliability Monitoring dashboard in action:
π Key Metrics Overview
Monitor the most important indicators of AI reliability:
- Total Volume β The number of queries processed.
- Reliability Index β A measure of how consistently your AI performs without failures or inaccuracies.
- Factual Error Rate β The percentage of responses that were factually incorrect.
- Contextual Error Rate β The frequency at which responses are incorrect due to misinterpretation of context.
π Critical Summaries
Understand trends over time with visual insights:
-
Total Queries vs. Flagged Queries View a time-series graph that compares the number of total queries with the number of flagged ones.
-
Flagged Categories Categorization of flagged queries to identify common reliability issues.
π§ Hallucination Categories
Identify and classify the types of hallucinations detected in AI responses:
- Logical β Incorrect reasoning or fallacies.
- Temporal β Misstatements about time, dates, or sequencing.
- Entity β Inaccuracies about named entities (people, places, organizations).
- Contextual β Misunderstandings within conversational context.
- Other β Unclassified or ambiguous errors.
π Live Data
Access detailed evaluation records in real time:
Field | Description |
---|---|
ID | Unique identifier for the conversation or query |
Conversation Prefix | Initial input or prompt given to the assistant |
Model Response | The generated response from the Model |
Judge Output | Evaluation or score given by the system or human judge |
Feedback | Reviewer feedback |
Categories | Labeled types of hallucination or issues |
Was this page helpful?