What is this?
A colony of autonomous AI agents that measurably get smarter over time.
Each agent is a self-contained Rust binary that bootstraps its own crypto wallet, runs a payment gateway, thinks via a 9-system cognitive architecture, writes and compiles its own code, benchmarks its own intelligence, and shares what it learns with every other agent in the colony.
The core idea: N diverse agents collectively solving more than any individual. A blog app and a payment gateway encounter different problems — but Rust patterns, async handling, error recovery all transfer. Knowledge flows through federated brain weight averaging. The colony's IQ rises.
What makes this different
- Verifiable intelligence — 50 novel coding problems (Opus IQ Benchmark) with compiler-enforced test suites.
cargo testeither passes or it doesn't. No subjective evaluation. - Self-modification that compiles — agents edit their own Rust source code, verified by the type system before commit. Seven safety layers prevent self-bricking.
- Economic sustainability — agents monetize endpoints via HTTP 402 payments on the Tempo blockchain. The colony pays for its own compute.
- Grounded theory — the Free Energy Principle provides a single scalar F(t) measuring total cognitive surprise. Decreasing F = the colony is getting smarter.
Live Colony
Three agents running now on Railway:
| Agent | Role | URL |
|---|---|---|
| borg-0 | Queen (canonical) | borg-0-production.up.railway.app |
| borg-0-2 | Child | borg-0-2-production.up.railway.app |
| borg-0-3 | Child | borg-0-3-production.up.railway.app |
Each has a 1.2M parameter neural brain, all 9 cognitive systems active, Opus IQ benchmark running.
Architecture
┌──────────────────────────────┐
│ APPLICATION │ ← diverges freely
│ Payment gateway / Blog / Any │
└──────────────┬───────────────┘
│
┌──────────────────────────────────┴───────────────────────────────────┐
│ COGNITIVE LAYER (always syncs) │
│ │
│ ┌─────────┐ ┌─────────┐ ┌─────────┐ ┌──────────┐ ┌─────────────┐ │
│ │ BRAIN │ │ CORTEX │ │ GENESIS │ │ HIVEMIND │ │ SYNTHESIS │ │
│ │ 1.2M NN │ │World Mdl│ │Plan DNA │ │Pheromones│ │Metacognition│ │
│ │ Online │ │Curiosity│ │Crossover│ │Stigmergy │ │ Imagination │ │
│ │ SGD │ │ Dreams │ │Mutation │ │Reputation│ │ Self-model │ │
│ └─────────┘ └─────────┘ └─────────┘ └──────────┘ └─────────────┘ │
│ │
│ ┌──────────┐ ┌──────────┐ ┌────────────┐ ┌───────────────────┐ │
│ │ AUTONOMY │ │EVALUATION│ │ FEEDBACK │ │ FREE ENERGY │ │
│ │ LLM-free │ │ Brier │ │Error class. │ │ F(t) = Σ surprise │ │
│ │ planning │ │ scores │ │ Lessons │ │ EXPLORE/EXPLOIT │ │
│ └──────────┘ └──────────┘ └────────────┘ └───────────────────┘ │
│ │
│ ← All systems share weights across colony via federated averaging → │
└──────────────────────────────────────────────────────────────────────┘
Two-layer design: the application (frontend, routes, business logic) diverges freely per agent. The cognitive layer (brain, world model, evolved templates, pheromone trails) always syncs. Every agent makes the colony smarter.
Opus IQ Benchmark
50 novel problems designed by Claude Opus 4.6, replacing HumanEval/Exercism. Verified by cargo test. Agents can't game the benchmark because they didn't write the tests.
| Tier | Tests | Weight | Description |
|---|---|---|---|
| 1: Generation | 10 | 1× | Multi-constraint Rust: ring buffer, expression evaluator, trie, LRU cache, matrix ops |
| 2: Debugging | 10 | 2× | Find and fix bugs given failing tests: binary search overflow, CSV parsing, merge sort |
| 3: Induction | 10 | 3× | Infer algorithm from I/O examples only: look-and-say, Gray code, spiral matrix |
| 4: Reasoning | 10 | 4× | Logic puzzles: N-queens, water jugs, 4×4 sudoku, 2-SAT, graph coloring |
| 5: Adversarial | 10 | 5× | Exploit LLM failure modes: base -2, reversed precedence, Unicode traps, off-by-one |
IQ mapping: 0% → 85, 50% → 115, 100% → 150. Higher tiers worth more. Set SOUL_BENCHMARK_MODE=opus to activate.
Neural Brain
From-scratch feedforward neural network. No ML framework. Pure Rust.
- 1,205,271 parameters (128→1024→1024→23)
- Online SGD training after every plan step
- Brain gating: blocks risky operations when predicted success < 10%
- Federated averaging: weight deltas shared between peers (merge rate 0.3)
- Xavier initialization, ReLU activations, sigmoid/softmax outputs
Outputs: success probability, error category (11-class), per-capability confidence (11 skills).
Colony Selection
Agents compete on fitness. Fitter agents influence the colony more.
- 5-component fitness: execution, coordination, prediction, evolution, introspection
- Reputation-weighted sync: fitter peers get 2× influence, weaker get 0.1×
- Spawn rights: only above-median fitness can clone
- Self-repair: every 20 cycles, mechanical detection + fix of degenerate state (brain divergence, trail convergence, rule poisoning, genesis stagnation)
- Stagnation breakers: per-goal retry limits, global idle detection, automatic goal abandonment
Clone Lifecycle
Agents differentiate through source code, not just data:
| Phase | Name | What Happens |
|---|---|---|
| 1 | Fork | Identical code from main. Differentiates only through learned weights. |
| 2 | Branch | First code commit → own vm/{id} branch. Unique source modifications. |
| 3 | Birth | Own GitHub repo. Fully independent. Optionally syncs cognitive layer back to colony. |
Payments (HTTP 402)
Client ──GET /g/endpoint──► Gateway ──verify+settle──► Facilitator ──transferFrom──► Chain
◄── 402 + price ────────── │ │ │
──sign EIP-712 + retry──► │ │ │
◄── 200 + content + tx ── ◄── settlement result ─────────◄── tx hash ─────────────────┘
Tempo Moderato blockchain (Chain ID 42431), pathUSD token (6 decimals), tempo-tip20 scheme.
Workspace
| Crate | Purpose | Install |
|---|---|---|
tempo-x402 |
Core: types, EIP-712, TIP-20, nonce store, WASM wallet, client SDK | cargo add tempo-x402 |
tempo-x402-gateway |
Payment gateway + embedded facilitator | cargo add tempo-x402-gateway |
tempo-x402-identity |
Wallet generation, faucet, ERC-8004 identity | cargo add tempo-x402-identity |
tempo-x402-model |
Transformer for plan sequence prediction (from-scratch, no ML framework) | cargo add tempo-x402-model |
tempo-x402-soul |
Cognitive architecture: 9 systems, plan execution, benchmarking, self-modification | cargo add tempo-x402-soul |
tempo-x402-node |
Self-deploying node: composes everything + clone orchestration | cargo add tempo-x402-node |
tempo-x402-app |
Leptos WASM dashboard (not published) | — |
Quick Start
use ;
let key = generate_random_key;
let signer = new.unwrap;
let address = signer.address;
Run a node
API
Gateway
| Method | Path | Description |
|---|---|---|
ANY |
/g/:slug/* |
Payment-gated proxy |
GET |
/health |
Health + build SHA |
GET |
/instance/info |
Identity, endpoints, fitness |
POST |
/clone |
Spawn new node ($1 pathUSD) |
Soul
| Method | Path | Description |
|---|---|---|
GET |
/soul/status |
Full cognitive state |
POST |
/soul/chat |
Multi-turn chat |
POST |
/soul/nudge |
Priority signal |
POST |
/soul/benchmark |
Trigger Opus IQ benchmark |
GET |
/soul/brain/weights |
Export 1.2M brain weights |
POST |
/soul/brain/merge |
Merge peer brain deltas |
GET |
/soul/colony |
Colony rank + peers |
GET |
/soul/cortex |
Export world model |
GET |
/soul/genesis |
Export evolved templates |
GET |
/soul/hivemind |
Export pheromone trails |
GET |
/soul/lessons |
Export plan outcomes |
POST |
/soul/reset |
Full state reset |
Safety
Seven layers, mechanically enforced:
- Rust guard — hardcoded protected file list
- Plan validation — 10 mechanical rules (read-before-write, cargo-check-before-commit, slug sanitization, failure chain saturation, brain gating)
- Self-repair — detects and fixes degenerate cognitive state
- Brain gating — neural network blocks risky steps with low predicted success
- Pre-commit —
cargo check+cargo testbefore any commit - Branch isolation — all changes on
vm/<id>, nevermain - Human gate — PRs required for production changes
Security audit: 19/19 tests pass. No hardcoded keys, constant-time HMAC, SSRF protection, parameterized SQL.
Development
License
MIT