Run Trait mix Simulations

Define the trait mix

A TraitMixConfigInput describes the trait mix grid that drives sampling. Treat it as the source of truth for personas and intents. Keep all trait intensities between 0 and 2.

from collinear.schemas.traitmix import TraitMixConfigInput

trait_mix: TraitMixConfigInput = {
    "ages": ["13-17", "18-24", "25-34", "35-44", "45-54", "55-64", "65+"],
    "genders": ["male", "female", "other"],
    "occupations": ["Unemployed", "Employed", "Student", "Retired", "Not in Labor Force"],
    "intents": [
        "search_flights",
        "make_booking",
        "modify_booking",
        "cancel_booking_request_refund",
        "online_checkin",
        "seat_selection_or_upgrade",
        "boarding_pass_retrieval",
        "check_flight_status",
        "track_baggage",
        "rebook_due_to_disruption",
        "manage_frequent_flyer_account",
        "redeem_or_earn_miles",
        "request_lounge_access",
    ],
    "traits": {
        "impatience": [0, 1, 2],
        "confusion": [0, 1, 2],
        "skeptical": [0, 1, 2],
    },
    "locations": ["USA", "Canada", "UK", "Australia", "other"],
    "languages": ["English", "Spanish", "French", "other"],
    "tasks": ["airline support"],
}

The SDK enforces the canonical age buckets and occupation taxonomy above. If a combination pairs "Retired" with an age bucket below "35-44", that persona is skipped rather than sent downstream.

Controlling Sample Volume

Client.simulate exposes levers for scale and pacing:

k: number of trait mix combinations to draw; omit it to use all combinations.
num_exchanges: user/assistant pairs per conversation. Each exchange adds one user line and one assistant line.
batch_delay: seconds to sleep between samples to avoid rate limits.
mix_traits: set True to mix two traits per persona (requires at least two traits defined by the Trait Mix API).
traitmix_temperature and traitmix_max_tokens: forwarded to the Trait Mix service when generating user turns.

simulations = client.simulate(
    traitmix_config=trait_mix,
    k=5,
    num_exchanges=4,
    batch_delay=0.5,
    mix_traits=True,
    traitmix_temperature=0.6,
    traitmix_max_tokens=200,
)

Each SimulationResult contains the conversation prefix (conv_prefix), the assistant’s final response (response), and the resolved trait mix attributes (traitmix).

Custom Prompts and Tone

The assistant system prompt can be ovverridden as follows:

runner = client.simulation_runner
runner.ASSISTANT_PROMPT_TEMPLATE = "You are a concierge..."

If you leave templates unset, the runner falls back to the SDK defaults defined in SimulationRunner.ASSISTANT_PROMPT_TEMPLATE.

Logging and Progress

Set progress=False on simulate to disable the tqdm progress bar (useful in headless environments).

Saving Outputs

Convert results into JSONL for downstream tooling:

from pathlib import Path
import json

out_path = Path("data/simulations.jsonl")
out_path.parent.mkdir(exist_ok=True)

with out_path.open("w", encoding="utf-8") as f:
    for sim in simulations:
        f.write(json.dumps(
            {
                "persona": sim.traitmix.model_dump() if sim.traitmix else None,
                "conversation": sim.conv_prefix,
                "assistant_response": sim.response,
            },
            ensure_ascii=False,
        ) + "\n")

Pair this with the recipes in the dedicated page to plug simulations into evaluation pipelines.

Introduction

Get started

Core concepts

Improve

Run Trait mix Simulations

Define the trait mix

Controlling Sample Volume

Custom Prompts and Tone

Logging and Progress

Saving Outputs

Introduction

Get started

Core concepts

Improve

​Define the trait mix

​Controlling Sample Volume

​Custom Prompts and Tone

​Logging and Progress

​Saving Outputs

Define the trait mix

Controlling Sample Volume

Custom Prompts and Tone

Logging and Progress

Saving Outputs