zeph-0.12.5 is not a library.
The AI agent that respects your resources.
Single binary. Minimal hardware. Maximum context efficiency.
Zeph is a Rust AI agent built around one principle: every token in the context window must earn its place. Skills are retrieved semantically, tool output is filtered before injection, and the context compacts automatically under pressure — keeping costs low and responses fast on hardware you already own.
|
[!TIP]
cargo install zephalso works. Pre-built binaries and Docker images are on the releases page.
What's inside
| Feature | Description |
|---|---|
| Hybrid inference | Ollama, Claude, OpenAI, any OpenAI-compatible API, or fully local via Candle (GGUF). Multi-model orchestrator with fallback chains and EMA latency routing. → Providers |
| Skills-first architecture | YAML+Markdown skill files with BM25+cosine hybrid retrieval. Bayesian re-ranking, 4-tier trust model, and self-learning evolution — skills improve from real usage. The load_skill tool lets the LLM fetch the full body of any skill outside the active TOP-N set on demand. → Skills · → Self-learning |
| Context engineering | Semantic skill selection, command-aware output filters, tool-pair summarization, and reactive middle-out compaction keep the window efficient under any load. → Context |
| Semantic memory | SQLite + Qdrant with MMR re-ranking, temporal decay, cross-session recall, implicit correction detection, and credential scrubbing. → Memory |
| IDE integration (ACP) | Stdio, HTTP+SSE, or WebSocket transport. Session modes, live tool streaming, LSP diagnostics injection, file following, usage reporting. Works in Zed, Helix, VS Code. → ACP |
| Multi-channel I/O | CLI, Telegram, TUI dashboard — all with streaming. Voice and vision input supported. → Channels |
| MCP & A2A | MCP client with full tool exposure to the model. A2A agent-to-agent protocol for multi-agent orchestration. → MCP · → A2A |
| Sub-agents | Spawn isolated agents with scoped tools, skills, and zero-trust secret delegation — defined as Markdown files. → Sub-agents |
| Instruction files | Drop zeph.md (or CLAUDE.md / AGENTS.md) in your project root. Zeph auto-detects and injects them into every system prompt — project rules, conventions, and domain knowledge applied automatically. → Instruction Files |
| Defense-in-depth | Shell sandbox, SSRF protection, skill trust quarantine, secret zeroization, audit logging, unsafe_code = "deny" workspace-wide. → Security |
| Document RAG | zeph ingest <path> indexes .txt, .md, .pdf into Qdrant. Relevant chunks surface automatically on each turn. → Document loaders |
| Daemon & scheduler | HTTP webhook gateway with bearer auth, cron-based task scheduler with SQLite persistence, background mode. → Daemon |
| Single binary | ~15 MB, no runtime dependencies, ~50 ms startup, ~20 MB idle memory. |
┌─ Skills (3/12) ────────────────────┐┌─ MCP Tools ─────────────────────────┐
│ web-search [████████░░] 82% (117)││ - filesystem/read_file │
│ git-commit [███████░░░] 73% (42)││ - filesystem/write_file │
│ code-review [████░░░░░░] 41% (8)││ - github/create_pr │
└────────────────────────────────────┘└─────────────────────────────────────┘
Documentation
Full documentation — installation, configuration, guides, and architecture reference — at bug-ops.github.io/zeph.
Contributing
See CONTRIBUTING.md. Found a vulnerability? Use GitHub Security Advisories.