Cerebro π§
A blazing-fast AI memory layer that enables teams of specialized agents to collaborate through a shared cognitive architecture.
Features
- π Minimal Overhead: Powered by a lean async pipeline in Rust, designed for high-scale agentic workloads.
- π Universal Storage: Trait-based backends β swap between MemoryVectorStore, PgVectorStore, or Qdrant.
- π§ Pluggable Compute: Route embeddings through local models (Candle) or remote APIs (OpenAI, Anthropic).
- π Active Consolidation: Background "Sleep Cycle" worker for autonomous memory pruning and semantic organization.
- π Hybrid Search: Native RRF (Reciprocal Rank Fusion) combining keyword and vector retrieval for highest precision.
- πΈοΈ Graphify: LLM-powered dynamic entity extraction into Neo4j or in-memory Knowledge Graphs.
- 𧬠Advanced Cognitive Architecture: Built-in time-traveling event-sourced memory, swarm immunology, and 3D spatial semantic navigation.
- π SwarmForge: Multi-agent swarming engine with sequential, parallel, and hierarchical orchestration patterns.
- π€ Universal LLM: Supports Ollama, OpenAI, Gemini, Anthropic, and any OpenAI-compatible API.
- π MCP Ready: Native Model Context Protocol server (cerebro-mcp) for AI desktop apps.
- π¦ Multi-Language: Native Python (PyO3) and WASM bindings.
- π Complex Ingestion: PDF extraction and HTML-aware semantic chunking.
Getting Started
[]
= "1.1.8"
1. Working with the Memory Engine
Store and retrieve semantic memory using Reciprocal Rank Fusion (hybrid search).
use *;
use Arc;
async
2. Building a Multi-Agent Swarm
Orchestrate a team of agents that share Cerebro's memory layer.
use *;
use *;
use Arc;
async
3. Model Context Protocol (MCP)
Cerebro acts as a native Model Context Protocol server for your local IDE (Cursor, Claude Desktop).
Compile the server:
Add the generated binary to your IDE's MCP config:
Supported LLM Providers
| Provider | Config Variant | Target Use Cases |
|---|---|---|
| Ollama | LlmProvider::Ollama |
Privacy-first local models (Llama 3, Mistral, Phi) |
| Anthropic | LlmProvider::Anthropic |
Deep reasoning and coding (Claude 3.5 Sonnet) |
| OpenAI | LlmProvider::OpenAI |
General agentic workflows (GPT-4o, o3) |
| Google Gemini | LlmProvider::Gemini |
Multimodal data ingestion (Gemini 1.5 Pro) |
| Universal API | LlmProvider::OpenAICompatible |
Fast inference engines (Groq, Together, vLLM) |