decapod 0.47.29

Decapod is the daemonless, local-first control plane that agents call on demand to align intent, enforce boundaries, and produce proof-backed completion across concurrent multi-agent work. 🦀
Documentation

Get running

cargo install decapod
decapod init

That's it.

decapod init asks about your project and creates .decapod/, a local folder your agent uses to remember intent, rules, context, specs, task ownership, and proof.

Your workflow does not change.

You keep talking to your agent like normal. The agent calls Decapod when it needs project memory, intent, boundaries, specs, todos, or proof.

Decapod runs, rehydrates repo state, returns what the agent needs, writes explicit artifacts when useful, and exits.

Decapod is the shared backhaul beneath Claude, Codex, Kilo, OpenCode, and other coding agents. Different providers get the same repo-native memory, boundaries, task ownership, and proof surface.


In plain English

AI coding agents move fast, but they often lose the plot.

They forget what you meant, pull too much context, skip dependencies, touch files they should not touch, and call work "done" without proving it.

Decapod gives your agent a repo-native checkpoint system.

Before the agent acts, Decapod helps it clarify intent, gather only the context it needs, respect project boundaries, and understand what proof will be required.

After the agent acts, Decapod helps check the result, record what changed, and preserve the evidence in the repo.

You keep talking to your agent normally. The agent checks in with Decapod when the work needs intent, context, boundaries, dependencies, feedback, or proof to become explicit.

The simplest model:

You talk to the agent.
The agent talks to Decapod.
Decapod keeps the work bounded, remembered, specified, and provable.

How agents use it

Agents invoke Decapod as a short-lived local utility:

  1. Clarify intent before spending model context.
  2. Assemble bounded context instead of loading the whole repo.
  3. Claim task ownership and respect workspace boundaries.
  4. Emit specs, plans, summaries, or proof artifacts when useful.
  5. Validate the result before returning completion.

If uncertainty is high, the agent should ask for clarification before inference-heavy work or irreversible edits.

Decapod is daemonless. Agents call it like cat, awk, or grep: short-lived, host-agnostic, local, repo-native, and only when needed.

See the canonical router in constitution/core/DECAPOD.md.

AI agents do not fail because they lack tools. They fail because they lose intent, skip dependencies, mutate context unsafely, learn from noisy signals, and return vibes instead of proof.


The loop

     User
       │
       ▼
    Agent ───────┐
       │         │
       │    ┌────▼────┐
       │    │ Decapod │
       │    │ (check) │
       │    └────┬────┘
       │         │
       ├─────────┤
       │         │
     Model     Agent
       │         │
       └────┬────┘
            ▼
          User

Decapod is not the agent.

Decapod is the governance kernel the agent calls when intent, context, boundaries, dependencies, feedback, ownership, or proof need to become explicit.

Humans mostly experience Decapod through the quality of the agent's work: clearer intent, smaller context, safer changes, better specs, and proof-backed completion.


What Decapod does

  1. Clarifies intent — what is the goal?
  2. Bounds context — what does the agent actually need?
  3. Defines scope — what is in bounds for this task?
  4. Generates specs — what should be built, changed, or preserved?
  5. Tracks ownership — who owns which task or artifact?
  6. Enforces boundaries — what must not be touched?
  7. Governs adaptation — what feedback may change future behavior?
  8. Requires proof — what makes the work complete?

Decapod resolves only what is relevant to the user's intent. Your agent gets surgical context, not the whole repo and not the entire constitution.


Why it exists

Agent workbenches improve the session.

Decapod improves the shared substrate behind those sessions.

Agents act in private context. Decapod makes the durable parts of their work public to the repo: intent, resolved context, boundaries, todos, specs, validation, feedback-derived proposals, and proof artifacts.

A task started by Claude Code should be auditable by Codex, resumable by Gemini CLI, and verifiable by Kilo or OpenCode. The source of truth lives in .decapod/, not chat history, IDE state, or provider memory.

Classical acceptance pipelines made completion criteria explicit. Decapod applies that intent earlier in agent work: before inference, it shapes context and boundaries; after implementation, it requires inspectable proof. Human review still matters, but unreviewed agent-speed work needs machine-checkable governance before promotion.

Decapod absorbs common agent failure modes:

  • ambiguity
  • context waste
  • boundary drift
  • forgotten dependencies
  • unsafe mutation
  • noisy feedback loops
  • instruction drift
  • unverifiable "done"

Governed feedback loops

Agents need feedback loops, but feedback loops are only safe when they are governed.

A correction from a human, a failed test, a review comment, a Slack reaction, or an observed workflow pattern can all be useful signals. But signals should not silently rewrite how an agent behaves.

Decapod treats durable behavior changes as reviewable state.

feedback signal
  → interpreted against intent
  → scoped to the affected behavior
  → proposed as a spec, rule, or instruction change
  → reviewed before activation
  → preserved with proof and provenance

The point is not to make agents self-improve without control.

The point is to let agents learn without drifting.

Decapod makes the loop explicit:

  • What signal was consumed?
  • What behavior would change?
  • What scope does the change affect?
  • Who reviewed it?
  • Can it be rolled back?
  • What proof shows the change improved the work?

Feedback loops help agents improve.

Decapod keeps improvement from becoming unmanaged mutation.


The shared substrate

.decapod/
  generated/
    specs/         # human-visible intent, architecture, and work specs
    context/       # deterministic context capsules and summaries
    artifacts/     # verification output, proof, provenance
  governance/      # todos, claims, workunits
  data/            # durable repo-native state
  config.toml      # project shape and agent-facing configuration
  OVERRIDE.md      # local rules that override embedded defaults

This is what persists.

Not the chat transcript.

Most of Decapod is background machinery for agents. Humans inspect the generated files Decapod intentionally surfaces: specs, context summaries, verification output, proof, and provenance.


The constitution

Decapod ships with an embedded engineering constitution: over 100 declarative documents covering architecture, security, performance, testing, knowledge graphs, claims, proof surfaces, interfaces, evaluation criteria, and workflows.

Everything an engineering org usually keeps in scattered docs, tribal memory, review culture, and unwritten taste becomes executable guidance your agent can consult.

Recent research has confirmed what Decapod was built around from the start: AI coding agents waste significant context on irrelevant files. — arXiv:2602.11988

Your agent does not guess. It reads the constitution, cites claim IDs, follows gates, asks for clarification, and produces proof.


Your interface

Agents read .decapod/OVERRIDE.md for plain-English project rules that override the embedded constitution:

.decapod/
  OVERRIDE.md    # your rules, overriding embedded defaults

Agents read .decapod/config.toml for project-level configuration:

name = "my-project"
summary = "What this repo does"
primary_languages = ["rust", "typescript"]
architecture = "cli"

config.toml captures project context and setup preferences:

  • project name and summary
  • primary language or languages
  • architecture type
  • generated agent entrypoints such as CLAUDE.md, GEMINI.md, and others

Keep durable rules in OVERRIDE.md. Keep project shape in config.toml.


What humans inspect

Humans do not need to babysit the command surface. The useful interface is the generated evidence:

  • specs and plans that show intended behavior
  • context summaries that explain what the agent used
  • task ownership and handoff records
  • validation output and proof artifacts
  • explicitly surfaced files in pull requests

The command surface exists for agents and automation. The human-facing result is a repo that remembers why work happened and how completion was proven.


Proof lives in the repo

Every run leaves operational evidence in .decapod/:

  • captured intent → generated/specs/INTENT.md
  • resolved context → generated/context/
  • todos and dependencies: governance/todos.jsonl
  • verification results: generated/artifacts/
  • proof artifacts: generated/artifacts/provenance/

The generated files are the human-visible proof surface. They can be inspected locally, reviewed in pull requests, archived with the codebase, and used by the next agent invocation to re-establish state.

The repo remembers. Chat history does not.


Agent workbench gaps

Workbenches optimize Decapod preserves
The current session Reusable repo-native knowledge
Worker throughput Shared substrate quality
Provider-specific context Explicit intent, boundaries, and proof
Session-scoped memory Durable state in .decapod/
Prompt tweaks Reviewable instruction changes
Agent improvement Governed adaptation

Use whatever agent you already use: Claude, Codex, Gemini, Cursor, Kilo, OpenCode, or another provider.


Before / after

Before

User: "build auth"
Agent: [full repo in prompt]
       → generates
       → commits

After

User: "build auth"
Agent: [Decapod]
       → intent: auth system
       → context: src/auth/
       → specs generated
       → work completed
       → [Decapod]
       → proof: verified
       → commits

Guarantees

  • Daemonless — runs on demand
  • Repo-native — state lives in .decapod/
  • Local-first — no SaaS control plane required
  • Provider-agnostic — works across agent workbenches
  • Agent-first — optimized for programmatic invocation by coding agents
  • Proof-gated — VERIFIED means gates passed
  • Boundary-aware — protected paths and branches are enforced
  • Feedback-governed — durable behavior changes require explicit scope, review, and provenance

Contributing

git clone https://github.com/DecapodLabs/decapod
cd decapod
cargo build && cargo test

Docs


Support