Workspace Isolation
A sandbox provides:- Isolated network — Each rollout gets its own Docker network with dedicated containers. Parallel rollouts cannot interfere with each other.
- Clean state — Every rollout starts from a freshly provisioned environment with seed data applied from scratch.
- Per-rollout configuration — Environment variables and configuration are scoped to the individual rollout.
Local Sandboxes
By default, sandboxes run locally via Docker Compose. Thesimlab env up command starts the environment, and each simlab tasks run creates an isolated workspace within it.
Remote Sandboxes (Daytona)
For cloud-based execution, Simulation Lab supports remote sandboxes via Daytona. This is useful for running evaluations at scale without local compute constraints.--daytona-api-key, [daytona].api_key in config.toml, SIMLAB_DAYTONA_API_KEY, or DAYTONA_API_KEY.
Why Sandboxes Matter
Sandbox isolation is what makes evaluation results trustworthy:- Parallel execution — Run multiple rollouts concurrently to gather statistical confidence without interference.
- Reproducibility — Same environment, same seed data, same starting conditions every time.
- Safety — Agents interact with simulated services in isolation. Nothing leaks to production systems or other rollouts.

