Open Pincery

Open Pincery is an open-source multi-agent platform for durable, event-driven AI agents. Each agent is a continuous entity with a stable identity, append-only event log, and wake/sleep lifecycle. Agents wake on messages, webhooks, or timers, work until done, and rest. Configure them by conversation. Orchestrate fleets via async messaging.

Status: v5 shipped (v1.0.0 on crates.io, 2026-04-20). Full v1–v3 feature set (CAS lifecycle, event log, LLM-powered wake/sleep, maintenance projections, HTTP API, PostgreSQL persistence, graceful shutdown, Docker Compose, rate limiting, HMAC webhook ingress, agent management, structured JSON logging, Prometheus metrics, /health + /ready split, CI, signed releases, operator runbooks); v4 self-host hardening (non-root container UID 10001, runtime budget caps, webhook-secret rotation, pcy CLI, vanilla-JS control plane, published API stability contract); v5 operator onramp (one-command demo, Caddy/TLS overlay, /api/login, bootstrap hardening).

Next: v6 is a documentation iteration — no code ships. It synthesizes the project's strategic direction into docs/input/north-star-2026-04.md, which becomes the ground floor for v7 onward: memory substrate (pgvector), codebase steward as the first Tier 1 mission, reasoner abstraction across provider / model / data-governance class, and the groundwork for a sovereign substrate that can run a one-person company. See the north star for the full direction and the twelve Durable Bets.

Why Another Agent Platform?

Most AI agent frameworks treat agents as ephemeral function calls — stateless, session-scoped, and gone between requests. Open Pincery inverts that. An agent is a continuous entity: the same identity, the same accumulated history, the same ongoing responsibilities, persisting across every interaction and every span of time between them.

The architecture draws on event sourcing, the actor model, and distributed systems engineering to give agents properties that matter for real work:

Durable identity — agents maintain a prose self-description that evolves through conversation, not config files
Append-only event log — the complete record of everything that happened, queryable and replayable
Wake/sleep lifecycle — agents rest by default, wake on events, work until done, then rest again
Async inter-agent messaging — agents communicate by message passing with no shared transcript
Shell as universal tool — one programmable executor instead of dozens of individual tool definitions
Self-configuration — reshape an agent's behavior by talking to it; the maintenance process captures changes durably

Architecture at a Glance

┌──────────────────────────────────────────────────────┐
│                    Open Pincery                      │
│                                                      │
│  ┌─────────┐  ┌─────────┐  ┌─────────┐               │
│  │ Agent A │  │ Agent B │  │ Agent C │  ...          │
│  │         │  │         │  │         │               │
│  │ Identity│  │ Identity│  │ Identity│               │
│  │WorkList │  │WorkList │  │WorkList │               │
│  │EventLog │  │EventLog │  │EventLog │               │
│  └────┬────┘  └─────┬───┘  └──────┬──┘               │
│       │  async msg  │  async msg  │                  │
│       └─────────────┼─────────────┘                  │
│                     │                                │
│  ┌──────────────────┴──────────────────┐             │
│  │         Runtime Harness             │             │
│  │  Wake/Sleep · Maintenance · CAS     │             │
│  └──────────────────┬──────────────────┘             │
│                     │                                │
│  ┌──────────────────┴──────────────────┐             │
│  │     PostgreSQL (event store)        │             │
│  └─────────────────────────────────────┘             │
└──────────────────────────────────────────────────────┘
        ▲            ▲            ▲
        │            │            │
    Webhooks      Timers      Messages

Each agent has its own event stream, identity projection, and work list. The runtime harness orchestrates wake/sleep cycles using CAS (compare-and-swap) to ensure exactly one wake is active per agent at any time. PostgreSQL is the single source of truth — no message brokers, no cloud-specific services.

Core Concepts

Concept	Description
Continuous Agent	A persistent entity with durable identity, work list, and append-only event log. Not a session.
Wake Cycle	A bounded active episode: wake on event → reason → use tools → sleep when done.
Between-Wakes Maintenance	A single LLM call after each wake updates identity, work list, and produces a wake summary.
Event Log	Append-only record of everything: messages, tool calls, timer firings, wake boundaries. Source of truth.
Projections	Identity and work list — free-form prose derived from the event log, maintained incrementally.
Shell Tool	A programmable executor where agents write programs, not individual tool calls. Intermediate data stays out of the prompt.
Semantic Stop	No kill switch. Say "stop" in a message. The agent reads it and stops. Say "carry on" and it resumes.

Stack

Concern	Choice
Runtime	Rust
Database	PostgreSQL
HTTP/API	axum
Async	tokio
SQL	sqlx (compile-time checked)
Sandbox	zerobox (per-tool isolation)
Credentials	Vault/proxy model (LLM never sees real secrets)

Security Model

Six defense layers, from innermost to outermost:

Zerobox — deny-by-default per-tool process sandboxing with secret injection via proxy
OneCLI — credential vault where agents authenticate with proxy tokens; real credentials injected at the gateway
Prompt injection defense — delimiter enforcement, instruction hierarchy, canary tokens, rate limiting
Greywall — outer host-level sandbox wrapping the entire runtime
Database security — Postgres RLS, compile-time checked queries, append-only audit
Webhook/API security — HMAC-SHA256 verification, SHA-256 dedup, rate limiting, TLS

Deployment Modes

Open Pincery is designed for four deployment modes. Self-hosting is first-class — the runtime functions without any proprietary control plane.

Mode	Description
`self_host_individual`	Single-user, local binary + Postgres
`self_host_team`	Team deployment with optional split topology
`saas_managed`	Hosted service with GitHub OAuth
`enterprise_self_hosted`	Entra/OIDC, SCIM, BYOK

How It Differs

Property	Open Pincery	Typical Agent Frameworks
Agent lifetime	Continuous — persists across interactions	Ephemeral — created per request
Memory	Event-sourced log + prose projections	Chat history, RAG, or vector DB bolt-on
Concurrency control	CAS lifecycle, exactly-one-wake	None or implicit
Configuration	Conversation-driven, durable	Code, YAML, config files
Tool model	Programmable shell executor	Fixed tool registry
Multi-agent	Async message passing, no shared state	Shared transcripts or orchestrator patterns
Infrastructure	Rust + Postgres, no cloud lock-in	Python + various cloud services

Project Structure

src/
  api/                # HTTP handlers (agents, events, messages, bootstrap)
  background/         # Listener (NOTIFY/LISTEN) and stale recovery
  runtime/            # Wake loop, maintenance, LLM client, tools, prompt assembly
  models/             # Database models (agents, events, projections, etc.)
  config.rs           # Environment configuration
  db.rs               # Pool creation + migrations
  error.rs            # Unified error type
  auth.rs             # Token hashing
migrations/           # PostgreSQL schema (16 migrations)
tests/                # Integration tests (25 tests across 14 files)
docker-compose.yml    # App + PostgreSQL (Docker deploy)
docs/
  input/              # Architecture specs, TLA+ model, design docs
  reference/          # Audit reports, adoption plans
scaffolding/          # Scope, design, readiness, experiment log

Getting Started

To explore the architecture:

git clone https://github.com/RCSnyder/open-pincery.git
cd open-pincery

Start with:

docs/input/technical-stack.md — implementation stack and crate choices
docs/input/OpenPinceryAgent.tla — formal TLA+ specification of the agent state machine. Copy into TLA+ Process Studio for visualizing the state machine of the system.
docs/input/security-architecture.md — six-layer security model
docs/input/best-practices.md — practices mapped to academic research

References

The Continuous Agent Architecture that Open Pincery implements is described in detail in an upcoming publication by the original author, to be released under MIT license.

Academic Foundation

The platform's design practices are informed by emerging research in agentic software engineering:

Agentic Pipelines in Embedded Software Engineering: Emerging Practices and Challenges. arXiv:2601.10220 [cs.SE]. https://doi.org/10.48550/arXiv.2601.10220

Additional Resources

Agentic Strategy Lab — Deliverables — research and frameworks for agentic system design considerations. Only that page specifically.

License

Licensed under either of

Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Quick Start

Prerequisites

Docker 24+
Rust toolchain (for building the pcy CLI)
An LLM API key (OpenRouter default, OpenAI-compatible also supported)

Build the pcy CLI

cargo build --release --bin pcy

Then add it to your PATH:

# Linux / macOS / Git Bash
export PATH="$PWD/target/release:$PATH"

# Windows PowerShell
$env:PATH = "$PWD\target\release;$env:PATH"

Or copy the binary to a directory already on your PATH:

# Linux / macOS
sudo cp target/release/pcy /usr/local/bin/

# Windows PowerShell (admin)
Copy-Item target\release\pcy.exe C:\Windows\System32\

Web UI (fastest path)

Prepare env:

cp .env.example .env

Set non-placeholder values in .env for:

OPEN_PINCERY_BOOTSTRAP_TOKEN
LLM_API_KEY

Start stack and wait for health:

docker compose up -d --wait
curl -fsS http://localhost:8080/ready

Open the UI:

http://localhost:8080

Bootstrap once (copy the returned session token into the UI login panel):

pcy --url http://localhost:8080 bootstrap --bootstrap-token "$OPEN_PINCERY_BOOTSTRAP_TOKEN"

One-command end-to-end smoke test

If you just want to confirm the stack works, run:

pcy --url http://localhost:8080 demo --bootstrap-token "$OPEN_PINCERY_BOOTSTRAP_TOKEN"

This bootstraps (or logs in if already bootstrapped), creates a throwaway agent, sends it a message, waits up to 60s for a real LLM reply, and prints it. If this succeeds, your database, runtime, and LLM provider are all wired up correctly.

pcy CLI path

Use this if you prefer terminal-first operations.

If OPEN_PINCERY_URL=http://localhost:8080 is exported, the shortest command forms are:

pcy bootstrap --bootstrap-token "$OPEN_PINCERY_BOOTSTRAP_TOKEN"
pcy agent create "my-agent"
pcy message AGENT_ID "hello from cli"
pcy events AGENT_ID

Equivalent explicit URL form:

# one-time bootstrap
pcy --url http://localhost:8080 bootstrap --bootstrap-token "$OPEN_PINCERY_BOOTSTRAP_TOKEN"

# create an agent
pcy --url http://localhost:8080 agent create "my-agent"

# send a message
pcy --url http://localhost:8080 message AGENT_ID "hello from cli"

# read events
pcy --url http://localhost:8080 events AGENT_ID

You can run the full scripted flow with:

bash scripts/smoke.sh

On Windows PowerShell:

powershell -ExecutionPolicy Bypass -File .\scripts\smoke.ps1

curl/HTTP appendix

curl -X POST http://localhost:8080/api/bootstrap \
  -H "Authorization: Bearer YOUR_BOOTSTRAP_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"email": "admin@example.com", "name": "Admin"}'

curl -X POST http://localhost:8080/api/agents \
  -H "Authorization: Bearer SESSION_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"name": "my-agent"}'

curl -X POST http://localhost:8080/api/agents/AGENT_ID/messages \
  -H "Authorization: Bearer SESSION_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"content": "Hello, what can you do?"}'

curl http://localhost:8080/api/agents/AGENT_ID/events \
  -H "Authorization: Bearer SESSION_TOKEN"

From Signed Release Binary

If you do not want to build from source, use release artifacts from GitHub Releases (AC-20).

# Example verification flow (replace filenames with your release assets)
cosign verify-blob open-pincery-linux-x86_64 \
  --signature open-pincery-linux-x86_64.sig \
  --certificate open-pincery-linux-x86_64.pem \
  --certificate-identity-regexp "https://github.com/RCSnyder/open-pincery/.*" \
  --certificate-oidc-issuer "https://token.actions.githubusercontent.com"

Troubleshooting

Anchor index:

bootstrap-401
rate-limit-429
silent-wake
compose-up-failed
already-bootstrapped
lost-session-token
log-format-json
metrics-scrape
backup-one-liner

bootstrap-401

Confirm .env has the same OPEN_PINCERY_BOOTSTRAP_TOKEN you pass to pcy bootstrap.
Verify the compose stack loaded your env: docker compose config | grep OPEN_PINCERY_BOOTSTRAP_TOKEN.

rate-limit-429

Unauthenticated routes are limited to 10 req/min and authenticated routes to 60 req/min.
Back off and retry after the Retry-After header.

silent-wake

Check logs: docker compose logs -f app.
Confirm LLM_API_KEY is valid and LLM_API_BASE_URL points to your provider.

compose-up-failed

Check docker compose logs -f app and docker compose logs -f db for startup errors.
Confirm required .env values are set (OPEN_PINCERY_BOOTSTRAP_TOKEN, LLM_API_KEY, LLM_API_BASE_URL).
If this is a stale local state issue, run reset (docker compose down -v) and retry.

already-bootstrapped

/api/bootstrap is one-time initialization. To get a new session token, use pcy login --bootstrap-token <token> or POST /api/login.
If you need a full clean reset, run docker compose down -v and re-bootstrap.

lost-session-token

Run pcy login --bootstrap-token "$OPEN_PINCERY_BOOTSTRAP_TOKEN" to get a fresh session token.
The bootstrap token in .env is your recovery credential — keep it safe.

log-format-json

Set LOG_FORMAT=json in .env, restart with docker compose up -d, then stream logs:

docker compose logs -f app

metrics-scrape

Set METRICS_ADDR=127.0.0.1:9090 in .env and restart.
Scrape metrics:

curl -fsS http://127.0.0.1:9090/metrics | head

backup-one-liner

docker compose exec db pg_dump -U open_pincery open_pincery > backup.sql

See docs/runbooks/db-restore.md for restore steps.

Reset (wipe local state)

docker compose down -v

This removes the Postgres volume and all local data.

Going public with HTTPS

Default compose bindings are loopback-only (127.0.0.1) for safety. To expose Open Pincery over HTTPS with Caddy:

Edit Caddyfile.example with your real domain and email.
Start with overlay:

docker compose -f docker-compose.yml -f docker-compose.caddy.yml up -d

Confirm Caddy is serving 80/443 and reverse-proxying to app:8080.

Observability (optional)

Structured logs: set LOG_FORMAT=json for JSON lines suitable for log pipelines.
Prometheus metrics: set METRICS_ADDR=127.0.0.1:9090 to expose /metrics on a dedicated listener.
Operator runbooks: see docs/runbooks/ for stale wake, DB restore, rollback, rate-limit tuning, and webhook debugging.

API

Method	Path	Description
GET	`/health`	Liveness (always 200 while serving)
GET	`/ready`	Readiness (DB + migrations + bg tasks)
GET	`/metrics`	Prometheus scrape (opt-in, own port)
POST	`/api/bootstrap`	One-time admin setup
POST	`/api/login`	New session via bootstrap token
POST	`/api/agents`	Create agent (returns webhook_secret)
GET	`/api/agents`	List agents
GET	`/api/agents/:id`	Agent detail with projections
PATCH	`/api/agents/:id`	Update agent name/enabled status
DELETE	`/api/agents/:id`	Soft-delete (disable with reason)
POST	`/api/agents/:id/messages`	Send message (triggers wake)
GET	`/api/agents/:id/events`	Event log
POST	`/api/agents/:id/webhook/rotate`	Rotate per-agent webhook secret
POST	`/api/agents/:id/webhooks`	Webhook ingress (HMAC-SHA256)

Compatibility note: older docs may refer to /api/agents/:id/rotate-webhook-secret. The shipped v4/v5 route is /api/agents/:id/webhook/rotate.

All /api/* routes (except bootstrap) require Authorization: Bearer <session_token>. Webhook routes require X-Signature header with HMAC-SHA256 of the body.

Rate limits: 10 requests/minute (unauthenticated), 60 requests/minute (authenticated). Exceeding the limit returns 429 Too Many Requests with a Retry-After header.

Tests

cargo test -- --test-threads=1

open-pincery 1.0.1