zeph-0.13.0 is not a library.
The AI agent that respects your resources.
Single binary. Minimal hardware. Maximum context efficiency.
Zeph is a Rust AI agent built around one principle: every token in the context window must earn its place. Skills are retrieved semantically, tool output is filtered before injection, and the context compacts automatically under pressure — keeping costs low and responses fast on hardware you already own.
|
[!TIP]
cargo install zephalso works. Pre-built binaries and Docker images are on the releases page.
What's inside
| Feature | Description |
|---|---|
| Hybrid inference | Ollama, Claude, OpenAI, any OpenAI-compatible API, or fully local via Candle (GGUF). Multi-model orchestrator with fallback chains, EMA latency routing, and adaptive Thompson Sampling for exploration/exploitation-balanced model selection. → Providers |
| Skills-first architecture | YAML+Markdown skill files with BM25+cosine hybrid retrieval. Bayesian re-ranking, 4-tier trust model, and self-learning evolution — skills improve from real usage. Agent-as-a-Judge feedback detection with adaptive regex/LLM hybrid analysis. The load_skill tool lets the LLM fetch the full body of any skill outside the active TOP-N set on demand. → Skills · → Self-learning |
| Context engineering | Semantic skill selection, command-aware output filters, tool-pair summarization, proactive context compression (reactive + proactive strategies), and reactive middle-out compaction keep the window efficient under any load. → Context |
| Semantic memory | SQLite + Qdrant with MMR re-ranking, temporal decay, query-aware memory routing (keyword/semantic/hybrid), cross-session recall, implicit correction detection, and credential scrubbing. → Memory |
| IDE integration (ACP) | Stdio, HTTP+SSE, or WebSocket transport. Session modes, live tool streaming, LSP diagnostics injection, file following, usage reporting. Works in Zed, Helix, VS Code. → ACP |
| Multi-channel I/O | CLI, Telegram, TUI dashboard — all with streaming. Voice and vision input supported. → Channels |
| MCP & A2A | MCP client with full tool exposure to the model. A2A agent-to-agent protocol for multi-agent orchestration. → MCP · → A2A |
| Sub-agents | Spawn isolated agents with scoped tools, skills, and zero-trust secret delegation — defined as Markdown files. 4-level resolution priority (CLI > project > user > config), permission_mode (default/accept_edits/dont_ask/bypass_permissions/plan), fine-grained tools.except denylists, background fire-and-forget execution, max_turns limits, persistent memory scopes (user/project/local) with MEMORY.md injection, persistent JSONL transcript storage with /agent resume for continuing completed sessions, and lifecycle hooks (SubagentStart/SubagentStop at config level, PreToolUse/PostToolUse per agent with pipe-separated matchers). Manage definitions with `zeph agents list |
| Instruction files | Drop zeph.md (or CLAUDE.md / AGENTS.md) in your project root. Zeph auto-detects and injects them into every system prompt — project rules, conventions, and domain knowledge applied automatically. Changes are picked up live via filesystem watching (500 ms debounce) — no restart required. → Instruction Files |
| Defense-in-depth | Shell sandbox, SSRF protection, skill trust quarantine, secret zeroization, audit logging, unsafe_code = "deny" workspace-wide. → Security |
| Document RAG | zeph ingest <path> indexes .txt, .md, .pdf into Qdrant. Relevant chunks surface automatically on each turn. → Document loaders |
| Daemon & scheduler | HTTP webhook gateway with bearer auth. Cron-based periodic tasks and one-shot deferred tasks with SQLite persistence — add, update, or cancel tasks at runtime via natural language using the built-in scheduler skill. Background mode. → Daemon |
| Single binary | ~15 MB, no runtime dependencies, ~50 ms startup, ~20 MB idle memory. |
┌─ Skills (3/12) ────────────────────┐┌─ MCP Tools ─────────────────────────┐
│ web-search [████████░░] 82% (117)││ - filesystem/read_file │
│ git-commit [███████░░░] 73% (42)││ - filesystem/write_file │
│ code-review [████░░░░░░] 41% (8)││ - github/create_pr │
└────────────────────────────────────┘└─────────────────────────────────────┘
Documentation
Full documentation — installation, configuration, guides, and architecture reference — at bug-ops.github.io/zeph.
Contributing
See CONTRIBUTING.md. Found a vulnerability? Use GitHub Security Advisories.