Skip to main content

Module config

Module config 

Source
Expand description

Configuration: which provider runs skills, the default platforms and models a run fans out across, and the model used for natural-language evals.

Config is loaded from a YAML file (default skilltest.yaml) and then refined by CLI overrides (see Config::apply_overrides).

Structs§

ApiJudgeConfig
Settings for judging evals and the simulated user with a direct model API call instead of running them through a harness. This trades the harness’s auth-portability for a single fast HTTP round trip per judge call (no agent-loop cold start), with normalized token usage surfaced into the report.
CommandConfig
Settings for a custom provider command speaking the JSON-lines protocol (see docs/protocol.md). Used by the bundled skilltest-fake-provider and any provider you write yourself.
Config
The full configuration for a run.
OneharnessConfig
Settings for the default oneharness provider, which runs each prompt on a harness via oneharness run.
Overrides
CLI-supplied overrides. None/empty fields leave the config value in place.

Enums§

ApiVendor
Which model vendor’s API the direct-API judge talks to.
JudgeConfig
How evals and the simulated user are judged, independent of the provider that runs the skill. Absent (the default) means the run’s provider judges too (e.g. the oneharness judge_harness).
ProviderConfig
Which provider backs a run.