cargo-ai™
Build AI-powered CLI tools from a single JSON definition.
Define declarative agents in JSON, hatch native executables locally, and share them in minutes.
Cargo AI is an open-source CLI for building auditable AI-powered CLI tools from a single JSON definition. Define inputs, schema, and actions once, hatch a native executable with cargo ai hatch, then inspect, run, and share it on your terms.
Cargo AI keeps agent behavior readable, auditable, and understandable through a single JSON definition.
Why Cargo AI
- Declarative by Design: define exactly what the agent does, what actions it can take, and keep the behavior easy to inspect.
- Open Source and Fully Auditable: inspect the generated code, understand what ships, and keep control of the runtime.
- Handles Real Inputs: work with text, images, URLs, and common files.
- Supports Advanced Logic: add conditions and follow-up behavior without hand-building a custom app.
- Real Actions, Not Just Prompts: run local commands, call child agents, pass command-line arguments, and send email follow-ups.
- Choose Your Own AI: use OpenAI models today or open-source models through Ollama, with room for more providers over time.
- You Own the Output: hatch a local executable and generated code that you can keep, modify, and run wherever you want.
- Portable Across macOS, Linux, and Windows: keep one readable agent definition and hatch it for the systems you care about.
- Easy to Share Through
cargo-ai.org: create a free account to publish definitions in minutes so other people can hatch them locally on their own machines. - No Extra Token Plumbing Required: use your existing Codex workflow when it fits, or bring your own model access when you want direct provider control.
- Built for AI-Assisted Iteration: keep the agent readable, diffable, and easy to improve with tools like Codex.
- Built to Grow With You: start with one clear definition, then add commands, email actions, and shared definitions as your workflow expands.
A concise JSON definition keeps the agent easy to read, review, diff, and improve without losing trust in what it does.
Quick Start
0. Install Cargo
Cargo AI requires Rust and Cargo. If you do not already have them, install Rust with rustup using the official guide for macOS, Linux, or Windows. This usually takes a few minutes.
Official install guide: Install Rust
After installation, verify Cargo is available:
1. Install cargo-ai
2. Choose your model setup
Option A: recommended if you use ChatGPT Plus or above
Includes Codex at no additional cost. This is the easiest path today. cargo-ai uses your Codex login, so no separate API key is required.
If you do not already have Codex installed, get it here: Codex CLI setup
Option B: direct provider control
Use this path if you want an explicit model profile with direct provider credentials and no Codex dependency.
Option C: open-source models with Ollama
Use this path if you want to run cargo-ai without ChatGPT or OpenAI at all.
Install Ollama here: Get Ollama
Then pull a model such as mistral and add a local profile:
3. Hatch a sample agent
On Windows, run adder_test.exe or just adder_test.
4. Register an account
Define agent email alerts with cargo-ai.org and manage your agents in one place. Keep them private, or share them instantly with anyone in the world.
Optional: set a custom public handle
If you want a specific public handle, set it here. Otherwise, cargo-ai.org assigns one automatically, and you can change it later.
Once registered, you can push an agent definition to your account repository and hatch it locally:
The Core Mental Model
[!TIP] You do not need to author this by hand. The fastest path is to tell Codex exactly what kind of agent you want and let it update the file for you. Read this section so the structure is easy to recognize, then review the result and verify exactly what the agent does. When you're ready for that loop, jump to Best First Workflow in Codex.
Cargo AI keeps the authoring model intentionally small:
inputsOrdered model-facing input such astext,url, orimage.agent_schemaThe typed response you expect back.actionsWhat to do after the response is validated, including the orderedrunsteps inside each action.
The next section expands those same pieces from minimal snippets into richer patterns.
A minimal agent looks like this:
That JSON becomes a compiled local executable through:
For Windows users, run my_agent.exe or just my_agent.
You can also override or inject runtime input without editing the JSON. Generated agents accept flags such as --input-text, --input-url, and --input-file. By default, runtime input flags replace the baked inputs array for that run. Use --input-mode append to keep baked inputs first, or --input-mode prepend to place runtime inputs before the baked inputs.
Start Simple, Then Expand
Use these snippets to recognize how inputs, agent_schema, and actions grow as the agent becomes more capable.
Click linked labels to open full runnable examples.
Inputs
Use the input types that fit the job.
URL input:
Image input:
File input:
Multiple inputs with related scoring:
You can override the baked inputs any time you run the generated agent. By default, runtime input flags replace the configured inputs for that execution, and the runtime input order is preserved exactly as you pass it on the command line. Use --input-mode append to keep baked inputs first, or --input-mode prepend to keep runtime inputs first.
agent_schema
The agent_schema is the output contract for the agent. Start simple, then add more structure as the agent becomes more capable.
Minimal output contract:
Add clearer field meaning with descriptions:
agent_schema can include any number of top-level string, integer, number, and boolean fields, plus optional description, string enum, and numeric bounds where supported.
Then expand into richer constraints and exact output choices:
actions
actions define what the agent is allowed to do after it produces the top-level structured output.
Action logic uses JSON Logic.
Within an action, run steps execute in order after the action's JSON Logic condition evaluates true.
Start with one simple local action:
Then expand into multiple action types:
You can keep actions simple or mix local executables, email alerts, child-agent handoffs, and generated image artifacts in the same agent definition. The next section shows how to sequence multiple run steps and control them with when.
run
run is the ordered step list inside an action.
Start with one simple step:
Then expand into a multi-step workflow:
Use run to sequence multiple side effects in order. exec steps can capture output, status, or errors for later steps, generate_image can write a single local image artifact, and when lets later steps react to success or failure without leaving the agent definition.
First-slice image generation uses an explicit model and a local output path. For the default OpenAI account transport, use a tool-capable mainline model such as gpt-5.2. For a direct OpenAI API token and URL, use an image model such as gpt-image-1.
You can also target individual run steps to specific runtime platforms:
Or target multiple platforms with an array:
Child agents
Use child agents when one agent needs to hand work to another agent.
- Point to a child agent that lives next to the parent file, such as
./child_reporter. - By default, an agent can call child agents up to
5levels deep. Override that with--max-agent-depth. - By default, the parent plus any child agents share a total runtime budget of
600seconds. Override that with--max-runtime-in-sec. - A parent can pass inputs to a child and record whether the child succeeded or failed.
- Child
agentsteps may setinput_modetoreplace,append, orprependwhen they also provide childinputs. - A parent cannot automatically pull the child's structured return fields back into its own output.
Example:
Build In Any Editor
You can build a cargo-ai agent in any editor you want. If you want to check whether the definition is valid before exporting a binary, run:
If your config file already matches the agent name, the shorthand works too:
When the file checks cleanly, use the Codex workflow below for the fastest iteration loop.
Best First Workflow in Codex
If you want the fastest authoring loop, start in a new folder and let Codex build the agent definition with you.
This creates AGENTS.md plus helper files under .cargo-ai/guidance/ so Codex knows the Cargo AI contract.
Then tell Codex: I want to build a Cargo AI agent. Describe what the agent should do, what inputs it should accept, what structured output it should return, and any follow-up actions you want.
Ask Codex to:
- build the JSON definition
- run
cargo ai hatch my_agent --config ./my_agent.json --check - update the JSON until the check passes
Then review the generated JSON yourself to make sure it matches your intent.
Cargo AI works best when the definition stays small, understandable, and easy to verify as you iterate.
Account-Backed Flows
After registration, you can use Cargo AI as more than a local hatching tool:
- store and retrieve agent definitions through your account
- hatch from your own hosted definitions
- hatch public definitions from another owner's handle
- use account-aware email workflows
Examples:
# Hatch your own hosted definition
# Validate scaffold and compile path without exporting a binary
# Hatch a public definition from another handle
Where To Go Next
When you want deeper details, use these files:
- Examples:
- JSON/schema reference:
- Actions and authoring patterns:
- Hatch/check workflow:
- Troubleshooting:
Notes
cargo ai hatch --checkvalidates scaffold and compile behavior withcargo checkwithout exporting a binary.- Generated binaries use your configured/default profile unless you override runtime flags.
- Scheduling is not built into Cargo AI today. To run an agent on a schedule, use your operating system scheduler such as
cronon macOS/Linux or Task Scheduler on Windows. We know scheduling matters and expect this area to expand over time. - Cargo AI recommends manual upgrade via:
License
MIT. See LICENSE.