Vectorless
A hierarchical, reasoning-native document intelligence engine.
Features
- Tree-based indexing — Documents are organized as hierarchical trees, not flat vectors
- LLM-driven retrieval — Uses reasoning to navigate document structure
- Multi-format support — Markdown, PDF, HTML, DOCX (planned)
- Workspace persistence — LRU-cached storage with lazy loading
- Configurable retrieval — Pluggable retriever strategies (LLM navigate, beam search, MCTS)
Quick Start
use ;
async
Configuration
Create config.toml in your project root:
[]
= "gpt-4o-mini"
= "https://api.openai.com/v1"
= "sk-..."
[]
= "gpt-4o"
= "llm_navigate"
= 3
[]
= "./workspace"
Status
Early development. Core functionality works:
- ✅ Markdown indexing with LLM summaries
- ✅ Tree-based retrieval via LLM navigation
- ✅ Workspace persistence with LRU cache
- 🚧 PDF/HTML/DOCX parsing
- 🚧 Additional retriever strategies
License
Apache-2.0