atomr-infer-cli
The
atomr-infer servebinary. Boots an actor system, applies every[[deployment]]in your project file, mounts the gateway.
Quick start
…and curl http://127.0.0.1:8080/v1/chat/completions against it.
Subcommands
| Subcommand | What it does |
|---|---|
atomr-infer serve --config <path> |
Parse the project file, build the actor system, register every deployment, mount the gateway, wait for Ctrl+C. |
atomr-infer status --config <path> |
Print the deployments in the project file (validate without running). |
atomr-infer cost-report |
Per-deployment cost — talks to a running MetricsActor. (Phase 6 stub.) |
atomr-infer rotate-credentials <name> |
Triggers RemoteSessionActor::rebuild on the named deployment. (Phase 6 stub.) |
Project file (TOML)
[]
= "production"
= "0.0.0.0:8080"
[[]]
= "gpt-4o-mini"
= "gpt-4o-mini"
= "open_ai"
= 2
[]
= 50
= "queue" # queue | reject | fallback
[[]]
= "tinyllama-local"
= "TinyLlama-1.1B-Chat-Q4_0"
= "candle"
= 1
= 1
Build profiles
| Build | Use case |
|---|---|
cargo build -p atomr-infer-cli --no-default-features --features remote-only |
Pure-remote router; no GPU deps in the binary. |
cargo build -p atomr-infer-cli --features all-remote |
All four remote providers + pipeline. |
cargo build -p atomr-infer-cli --features default-prod |
The doc's recommended production preset. |