collinear-sim-lab CLI is the primary interface. It handles environment composition, task management, agent execution, and teardown.
Prerequisites
- Python 3.11+
- Docker Desktop (or Docker Engine with Compose)
OPENAI_API_KEYset in your environment (or another LLM provider key)- (Optional) API key from platform.collinear.ai for hosted task generation and pre-canned bundles
Step 1 — Start a Local Environment
Initialize an environment from a template (or pick tools interactively), build it, and start it:env up completes, all tool servers are running and seeded with baseline data:
Step 2 — Choose a Task Bundle
Tasks are organized into bundles, which are packaged sets of instructions, seed data, and verifiers. There are two ways to get a bundle: Pre-set bundle (hosted). Use a named bundle from the hosted catalog:-t/--tasks will fetch the bundle into the local cache before running.
Task-gen bundle (hosted generation).
- Provide your MCP tool definitions as JSON
- Generate a bundle into a local directory
- Run against the local bundle
Step 3 — Run With a Built-In or Custom Agent
Built-in reference agent (default).BaseAgent contract and populate RunArtifacts during execution. See Bring Your Own Agent for the full contract.
If --agent-import-path is omitted, the CLI uses the baked-in reference agent.
Step 4 — Configure Verifiers
Verifiers run locally. You must configure the credentials for the LLM-as-a-judge. It is possible to use a different model for the judge than for the agent. Configure judge settings via env variables:Step 5 — Authenticate for Hosted APIs (Optional)
If you are using hosted task generation or pre-canned bundles, authenticate first:tasks-gen and remote bundle fetches.
Pre-canned bundles and task-gen outputs are cached in the OS default cache directory (e.g. ~/.cache/collinear on Linux, ~/Library/Caches/collinear on macOS).

