cerebro-1.1.6 has been yanked.
Cerebro 🧠
A blazing-fast AI memory layer that enables teams of specialized agents to collaborate through a shared cognitive architecture.
Features
- 🚀 Minimal Overhead: Powered by a lean async pipeline in Rust, designed for high-scale agentic workloads.
- 🔌 Universal Storage: Trait-based backends — swap between MemoryVectorStore, PgVectorStore, or Qdrant.
- 🧠 Pluggable Compute: Route embeddings through local models (Candle) or remote APIs (OpenAI, Anthropic).
- 🔄 Active Consolidation: Background "Sleep Cycle" worker for autonomous memory pruning and semantic organization.
- 🔍 Hybrid Search: Native RRF (Reciprocal Rank Fusion) combining keyword and vector retrieval for highest precision.
- 🐝 SwarmForge: Multi-agent swarming engine with sequential, parallel, and hierarchical orchestration patterns.
- 🤖 Universal LLM: Supports Ollama, OpenAI, Gemini, Anthropic, and any OpenAI-compatible API.
- 🌐 MCP Ready: Native Model Context Protocol server (cerebro-mcp) for AI desktop apps.
- 🦀 Multi-Language: Native Python (PyO3) and WASM bindings.
- 📄 Complex Ingestion: PDF extraction and HTML-aware semantic chunking.
Getting Started
[]
= "1.1.6"
1. Working with the Memory Engine
Store and retrieve semantic memory using Reciprocal Rank Fusion (hybrid search).
use *;
use Arc;
async
2. Building a Multi-Agent Swarm
Orchestrate a team of agents that share Cerebro's memory layer.
use *;
use *;
use Arc;
async
Supported LLM Providers
| Provider | Config Variant | Target Use Cases |
|---|---|---|
| Ollama | LlmProvider::Ollama |
Privacy-first local models (Llama 3, Mistral, Phi) |
| Anthropic | LlmProvider::Anthropic |
Deep reasoning and coding (Claude 3.5 Sonnet) |
| OpenAI | LlmProvider::OpenAI |
General agentic workflows (GPT-4o, o3) |
| Google Gemini | LlmProvider::Gemini |
Multimodal data ingestion (Gemini 1.5 Pro) |
| Universal API | LlmProvider::OpenAICompatible |
Fast inference engines (Groq, Together, vLLM) |