zeph-skills

SKILL.md parser, registry, embedding matcher, and hot-reload for Zeph.

Overview

Parses SKILL.md files (YAML frontmatter + markdown body) from the skills/ directory, maintains an in-memory registry with hot-reload support, and formats selected skills into LLM system prompts. Supports semantic matching via Qdrant embeddings and self-learning skill evolution with trust scoring.

Key modules

Module	Description
`loader`	SKILL.md parser (YAML frontmatter + markdown)
`registry`	In-memory skill registry with hot-reload
`matcher`	Keyword-based skill matching
`qdrant_matcher`	Semantic skill matching via Qdrant; BM25 hybrid search with RRF fusion when `hybrid_search = true`
`evolution`	Self-learning skill generation and refinement; handles `FailureKind`-tagged rejections and triggers improvement cycles
`trust`	`SkillTrust` — Wilson score Bayesian re-ranking (`posterior_weight`, `posterior_mean`); `check_trust_transition()` for auto-promote/demote; re-exports `TrustLevel` from `zeph-tools`
`watcher`	Filesystem watcher for skill hot-reload
`prompt`	Skill-to-prompt formatting (`full`, `compact`, `auto` modes via `SkillPromptMode`); injects `reliability="N%"` and `uses="N"` health XML attributes
`manager`	`SkillManager` — install, remove, verify, and list external skills (`~/.config/zeph/skills/`)

Re-exports: SkillError, SkillTrust, TrustLevel (from zeph-tools), compute_skill_hash

Prompt modes

The prompt_mode config option ([skills] section) controls how skills are serialized into the LLM system prompt:

Mode	Description
`full`	Full XML format with complete skill body (default)
`compact`	Condensed XML with name, description, and trigger list only
`auto`	Selects `compact` when context budget is below threshold, `full` otherwise

All modes include reliability="N%" and uses="N" XML attributes derived from the Wilson score posterior, so the model is aware of each skill's historical reliability.

Self-learning and re-ranking

Skills accumulate outcomes over time. After each use, the Wilson score lower-bound is recomputed and stored as posterior_weight and posterior_mean. check_trust_transition() evaluates whether accumulated evidence justifies a trust level change:

Sufficient high-quality outcomes → promote toward Trusted
Repeated failures or rejections → demote toward Quarantined

The /skill reject <name> <reason> command records a typed FailureKind rejection immediately, persisting it to the outcome_detail column (migration 018).

Hybrid search configuration

[skills]
cosine_weight  = 0.7   # weight of cosine similarity in RRF fusion (default: 0.7)
hybrid_search  = true  # enable BM25 + cosine hybrid search (default: true)

[!NOTE] When hybrid_search = true, BM25 keyword scores are computed locally and fused with Qdrant cosine scores using Reciprocal Rank Fusion. This improves recall for exact-match queries while preserving semantic ranking quality for paraphrase queries.

Installation

cargo add zeph-skills

License

MIT

zeph-skills 0.13.0