zeph 0.12.6

Lightweight AI agent with hybrid inference, skills-first architecture, and multi-channel I/O
zeph-0.12.6 is not a library.

The AI agent that respects your resources.

Single binary. Minimal hardware. Maximum context efficiency.

Crates.io docs CI codecov MSRV License: MIT


Zeph is a Rust AI agent built around one principle: every token in the context window must earn its place. Skills are retrieved semantically, tool output is filtered before injection, and the context compacts automatically under pressure — keeping costs low and responses fast on hardware you already own.

curl -fsSL https://github.com/bug-ops/zeph/releases/latest/download/install.sh | sh
zeph init   # interactive setup wizard
zeph        # start the agent

[!TIP] cargo install zeph also works. Pre-built binaries and Docker images are on the releases page.


What's inside

Feature Description
Hybrid inference Ollama, Claude, OpenAI, any OpenAI-compatible API, or fully local via Candle (GGUF). Multi-model orchestrator with fallback chains and EMA latency routing. → Providers
Skills-first architecture YAML+Markdown skill files with BM25+cosine hybrid retrieval. Bayesian re-ranking, 4-tier trust model, and self-learning evolution — skills improve from real usage. The load_skill tool lets the LLM fetch the full body of any skill outside the active TOP-N set on demand. → Skills · → Self-learning
Context engineering Semantic skill selection, command-aware output filters, tool-pair summarization, and reactive middle-out compaction keep the window efficient under any load. → Context
Semantic memory SQLite + Qdrant with MMR re-ranking, temporal decay, cross-session recall, implicit correction detection, and credential scrubbing. → Memory
IDE integration (ACP) Stdio, HTTP+SSE, or WebSocket transport. Session modes, live tool streaming, LSP diagnostics injection, file following, usage reporting. Works in Zed, Helix, VS Code. → ACP
Multi-channel I/O CLI, Telegram, TUI dashboard — all with streaming. Voice and vision input supported. → Channels
MCP & A2A MCP client with full tool exposure to the model. A2A agent-to-agent protocol for multi-agent orchestration. → MCP · → A2A
Sub-agents Spawn isolated agents with scoped tools, skills, and zero-trust secret delegation — defined as Markdown files. → Sub-agents
Instruction files Drop zeph.md (or CLAUDE.md / AGENTS.md) in your project root. Zeph auto-detects and injects them into every system prompt — project rules, conventions, and domain knowledge applied automatically. Changes are picked up live via filesystem watching (500 ms debounce) — no restart required. → Instruction Files
Defense-in-depth Shell sandbox, SSRF protection, skill trust quarantine, secret zeroization, audit logging, unsafe_code = "deny" workspace-wide. → Security
Document RAG zeph ingest <path> indexes .txt, .md, .pdf into Qdrant. Relevant chunks surface automatically on each turn. → Document loaders
Daemon & scheduler HTTP webhook gateway with bearer auth. Cron-based periodic tasks and one-shot deferred tasks with SQLite persistence — add, update, or cancel tasks at runtime via natural language using the built-in scheduler skill. Background mode. → Daemon
Single binary ~15 MB, no runtime dependencies, ~50 ms startup, ~20 MB idle memory.
┌─ Skills (3/12) ────────────────────┐┌─ MCP Tools ─────────────────────────┐
│  web-search  [████████░░] 82% (117)││  - filesystem/read_file             │
│  git-commit  [███████░░░] 73%  (42)││  - filesystem/write_file            │
│  code-review [████░░░░░░] 41%   (8)││  - github/create_pr                 │
└────────────────────────────────────┘└─────────────────────────────────────┘

Documentation

Full documentation — installation, configuration, guides, and architecture reference — at bug-ops.github.io/zeph.

Contributing

See CONTRIBUTING.md. Found a vulnerability? Use GitHub Security Advisories.

License

MIT