trusty-memory
Memory palace MCP server (HTTP/SSE) backed by usearch (HNSW) vector store,
redb metadata and knowledge-graph stores, and fastembed embeddings. Stores
and retrieves natural-language memories organized into named "palaces"
(namespaces), with an optional knowledge-graph layer for structured triples.
Claude Code integration uses trusty-memory serve --stdio — a direct
stdio JSON-RPC MCP server that forwards every request to the running HTTP
daemon and returns daemon responses verbatim. No Unix domain socket is
involved.
A DEPRECATED trusty-memory-mcp-bridge shim binary is also installed by
cargo install trusty-memory so that existing .mcp.json configs that still
reference the old bridge name keep working without per-project changes.
To update your config, manually set the trusty-memory entry to
"command": "trusty-memory", "args": ["serve", "--stdio"] in your .mcp.json
or ~/.claude/mcp.json.
Integrates with Claude Code and any other MCP-aware client as a first-class long-term memory backend.
System Requirements
- RAM: 512 MB minimum; 1 GB+ recommended (ONNX embedding model loads ~22 MB, usearch index scales with corpus size)
- Disk: ~100 MB for the model cache on first run
(
~/Library/Application Support/trusty-memory/on macOS,~/.local/share/trusty-memory/on Linux) - Rust: 1.88+ (if building from source)
Installation
From GitHub Releases (recommended for binary users)
Prebuilt binaries are available for macOS (Apple Silicon) and Linux (x86_64).
-
Download the latest release from GitHub Releases:
- Look for assets tagged
trusty-memory-v0.15.0 - Download the archive for your platform:
- macOS arm64 (Apple Silicon):
trusty-memory-v0.15.0-aarch64-apple-darwin.tar.gz - Linux x86_64:
trusty-memory-v0.15.0-x86_64-unknown-linux-gnu.tar.gz
- macOS arm64 (Apple Silicon):
- Look for assets tagged
-
Extract and install:
-
Verify the installation:
From Source with Cargo
Requires Rust 1.91 or later (install Rust).
This builds from the latest commit on main and installs the binary to ~/.cargo/bin/. Make sure ~/.cargo/bin/ is on your PATH.
Installing trusty-memory produces four binaries in one command: trusty-memory, trusty-memory-mcp-bridge (deprecated shim — forwards to serve --stdio), trusty-bm25-daemon, and trusty-console.
To install a specific version:
With Homebrew (recommended)
Or install directly without tapping:
Homebrew provides:
- Automatic updates via
brew upgrade trusty-memory - Standard macOS / Linux PATH integration
- Easy dependency management
Prerequisites & Special Cases
Prerequisites
None — the daemon is self-contained and requires no external databases or configuration files to start.
Optional: OpenRouter API Key
The embedded memory UI includes a chat panel that requires an OpenRouter API key for the language model integration. Set OPENROUTER_API_KEY in your environment or enter it in the UI to enable chat features.
Chat is optional; the daemon fully functions without it.
Note: Embedded Svelte UI
This crate embeds a Svelte admin UI (built and compiled into the binary). The UI is pre-built and included in releases; no additional steps are needed. The embedded UI runs on http://127.0.0.1:<port> — see the daemon output for the live port.
Verify Installation
All installations can be verified by running:
Expected output: the semantic version of the installed binary (e.g., trusty-memory 0.15.0).
Quick Start
Start the daemon
By default, serve self-spawns a detached background daemon (alias for
trusty-memory start) and returns control to the shell so you keep your
prompt. The daemon binds HTTP/SSE on a dynamic port in the 7070..=7079
range (with OS fallback) and writes the resolved address to its
discovery file. Pass --foreground to keep the daemon inline (used by
launchd / systemd / Docker), or --http <ADDR> to pin a specific address.
Check the listening port
# Shell substitution — stdout is clean (logs go to stderr):
Exits non-zero with a message on stderr when no daemon is running, so shell substitution fails cleanly.
Browser dashboard + REST API
The same trusty-memory serve daemon serves the embedded Svelte admin UI
at the bound address (printed by trusty-memory monitor web once the
daemon is running) and a REST API under /api/v1/.
Key REST API field names (verified against src/web.rs and src/service.rs):
- Recall endpoints (
GET /api/v1/palaces/{id}/recallandGET /api/v1/recall) accept the query string asq(notquery):?q=my+search+term&top_k=5. - Drawer-create body (
POST /api/v1/palaces/{id}/drawers) expects acontentfield (nottext) for the drawer body.
Bind to a named palace
When all tool calls should default to one palace namespace, use --palace:
With a default set, the palace argument becomes optional in every MCP tool
call.
Claude Code Integration
Run trusty-memory setup once — it installs the launchd LaunchAgent
(macOS), pre-warms the embedder cache, and patches every Claude settings
file it finds with the canonical MCP server entry.
Canonical MCP config (0.15.3+)
The recommended entry in .mcp.json or ~/.claude/mcp.json is:
trusty-memory serve --stdio is a pure daemon-bridge proxy: it ensures the
HTTP daemon is running (auto-starting it if absent), then forwards every
JSON-RPC request to POST /rpc on the daemon and returns the response
verbatim. No Unix domain socket is involved. The stdio process never opens
the redb write-lock directly, so it co-exists safely with the running HTTP
daemon.
If you are switching from the legacy kuzu-memory server, run
trusty-memory migrate kuzu-memory to rewrite all Claude settings files
automatically (see Migrating from kuzu-memory below).
Compatibility shim (for existing installations)
If your .mcp.json still references trusty-memory-mcp-bridge, the shim
binary (installed alongside trusty-memory since 0.15.3) forwards to
serve --stdio automatically. You will see a one-line deprecation warning
on stderr (which Claude Code ignores). To update your config manually, set
the trusty-memory entry to:
There is no automated command to rewrite trusty-memory-mcp-bridge entries
(unlike the kuzu-memory migration). The shim will be removed in a future
minor version.
Claude Code auto-discovers .mcp.json on project open. The daemon must be
running (started either by trusty-memory setup's LaunchAgent or by
trusty-memory start / trusty-memory serve).
Available MCP Tools
The MCP server registers 25 tools (authoritative source:
src/tools/definitions.rs tool_definitions, asserted by the
tool_definitions_lists_all_tools test). All are exposed via both the MCP
protocol (over the serve --stdio path) and the HTTP API (/api/v1/). The
palace argument is required unless the server was started with --palace <name>. The tables below cover the primary surface; the full roster also
includes memory_note, memory_recall_all, palace_delete,
palace_update, palace_compact, kg_gaps, kg_bootstrap, add_alias,
discover_aliases, list_prompt_facts, remove_prompt_fact,
get_prompt_context, memory_send_message, upgrade, and
console_metrics.
Memory tools
| Tool | Arguments | Description |
|---|---|---|
memory_remember |
palace, text, room?, tags? |
Store a memory. Returns the drawer_id. |
memory_recall |
palace, query, top_k? |
Hybrid BM25+vector recall (L0/L1/L2 layers). |
memory_recall_deep |
palace, query, top_k? |
Deep recall (L3 — slower, higher recall). |
memory_list |
palace, room?, tag?, limit? |
List stored memories, optionally filtered. |
memory_forget |
palace, drawer_id |
Delete a specific memory by ID. |
Palace management tools
| Tool | Arguments | Description |
|---|---|---|
palace_create |
name, description?, force? |
Create a new palace namespace. See Palace as Project for naming rules. Pass force=true to bypass project-slug validation and create a palace under an arbitrary app/tenant slug (chat-session-manager use case). |
palace_list |
— | List all palaces and their IDs. |
palace_info |
palace |
Palace metadata and statistics. |
Chat session tools
A redb-backed, per-palace chat store for applications using trusty-memory as a
conversation session manager. Turns are stored verbatim and bypass the
memory_remember signal/noise + dedup gates.
| Tool | Arguments | Description |
|---|---|---|
chat_session_create |
palace, session_id?, title? |
Create (or reference) a session. Returns { session_id, created_at, message_count }. |
chat_session_add_turn |
palace, session_id, role, content |
Append a user/assistant/system turn (creates the session if missing). Returns { message_count, updated_at }. |
chat_session_get |
palace, session_id |
Full session with all turns in order. |
chat_session_list |
palace, limit?=50, offset?=0 |
Paginated session metadata. Returns { sessions, total_count }. |
Knowledge graph tools
| Tool | Arguments | Description |
|---|---|---|
kg_assert |
palace, subject, predicate, object, confidence?, provenance? |
Assert a knowledge triple. |
kg_query |
palace, subject |
Query triples by subject. |
Dream and status tools
| Tool | Arguments | Description |
|---|---|---|
memory_dream |
palace? |
Run a consolidation cycle (merge near-duplicates, prune, compact). |
dream_consolidate_room |
palace, room?, max_age_days?=7 |
On-demand, synchronous LLM consolidation scoped to one room (omit room for all rooms). Summarises facts older than max_age_days into canonical drawers, then evicts the superseded originals. Task drawers are always skipped. No-op when no inference backend is configured. Returns { summary_facts_created, facts_evicted }. |
memory_status |
— | Global statistics (total drawers, vectors, KG triples). |
Task drawers (protected memory)
DrawerType::Task is a drawer classification for goals, milestones, and
checkpoints that an application must re-derive across sessions. Task drawers are
protected from the dream cycle: they are never evicted (content-prune,
dedup-merge, or age/importance prune) and never consolidated into summaries,
regardless of age or importance. An optional completed_at timestamp marks a
task done — this makes it eligible for manual cleanup but never triggers
automatic eviction. Store a Task drawer by classifying a write as Task; it
will survive every subsequent dream cycle until explicitly deleted.
Dream Cycle: Semantic Consolidation (issue #87)
The memory_dream tool (and the background idle-dream timer) now includes an
inference-backed semantic consolidation phase after the existing NLP passes
(dedup, prune, closet-refresh):
What it does
- Groups palace drawers into batches of up to 8 (configurable via
DreamConfig). - Sends each batch to an LLM backend (OpenRouter or local Ollama) with a
structured prompt asking for three actions:
- Alias — two terms refer to the same concept (e.g.
"ts"→"trusty-search"). Asuperseded_byKG triple is written to link the alias to its canonical form. - Merge — a cluster of overlapping drawers is collapsed into one canonical
drawer. The original drawers are preserved (additive-only policy); a
superseded_byKG triple is written from each original to the new canonical drawer so lineage is traceable. - Flag — a drawer contradicts another and should be reviewed by a human.
Flagged drawers are logged at
WARNlevel but not deleted.
- Alias — two terms refer to the same concept (e.g.
- Each batch response is cached by a SHA-256 key over the drawer IDs + content, so repeated dream cycles do not re-spend LLM tokens on stable content.
- A per-cycle call budget (
max_calls_per_cycle, default 20) prevents runaway LLM spend on large palaces.
Configuration
Semantic consolidation is enabled when at least one inference backend is available:
| Priority | When used |
|---|---|
| OpenRouter | OPENROUTER_API_KEY env var is set, or DreamConfig.openrouter_api_key is non-empty |
| Ollama (local) | DreamConfig.local_model_enabled = true AND no OpenRouter key is present |
| Disabled (no-op) | Neither backend is available — the phase is silently skipped |
The consolidation model defaults to anthropic/claude-haiku-4-5 for low cost.
Override via DreamConfig.semantic.model.
# ~/.trusty-memory/config.toml — set these to enable semantic consolidation
[]
= "sk-or-v1-..." # enables semantic consolidation via OpenRouter # pragma: allowlist secret
[]
= false # set true + unset api_key to use Ollama instead
= "http://127.0.0.1:11434"
= "llama3"
Testing
All consolidation tests use MockInference — no real API calls are made during
cargo test. Live-inference tests are marked #[ignore]:
# Unit + integration tests (no network)
# Live-network tests (requires OPENROUTER_API_KEY)
Inter-project messaging (issue #99)
| Tool | Arguments | Description |
|---|---|---|
memory_send_message |
to_palace, purpose, content, from_palace? |
Deliver a message to another palace's inbox. |
Plus two CLI subcommands:
trusty-memory send-message --to <palace> --purpose <p> --content <text> [--from <palace>]— non-MCP entry point. Posts to the daemon'sPOST /api/v1/messages.trusty-memory inbox-check [--palace <id>]— installed as a Claude CodeSessionStarthook bysetup. Reads unread messages from the cwd-derived palace, prints them to stdout (Claude Code injects stdout as session context), and atomically marks them read.
Design
A message is a drawer in the recipient's palace carrying a namespaced tag envelope. No new schema, no new database — just convention:
| Tag | Example | Meaning |
|---|---|---|
msg:v1 |
(literal) | Marker tag for the v1 envelope. |
msg:from=<palace> |
msg:from=trusty-tools |
Sender palace id. |
msg:to=<palace> |
msg:to=claude-mpm |
Recipient palace id (audit). |
msg:purpose=<text> |
msg:purpose=task |
Free-text purpose / category. |
msg:sent_at=<rfc3339> |
msg:sent_at=2026-05-25T12:34:56+00:00 |
UTC send timestamp. |
msg:read=<bool> |
msg:read=false |
Receiver-flipped read flag. |
Addressing
Sender and recipient palaces are addressed by repo slug. The slug is derived from the working directory by:
- Take the basename of
git rev-parse --show-toplevel(or cwd, when not in a git checkout). - Strip a trailing
.gitsuffix if present. - Lowercase.
- Replace every run of whitespace or
_with a single-. - Strip every character outside
[a-z0-9-]. - Collapse consecutive
-and trim leading/trailing-.
Examples (all resolve to trusty-tools):
/Users/bob/Projects/trusty-tools,
/Users/bob/Projects/Trusty_Tools,
/Users/bob/Projects/trusty tools,
/Users/bob/Projects/.trusty-tools.git.
No central registry; sender and receiver agree on the slug out of band.
Delivery
The receiver's trusty-memory setup installs trusty-memory inbox-check as
a SessionStart hook in every Claude Code settings file it finds (alongside
the existing UserPromptSubmit prompt-context hook). On every new Claude
Code session, the hook:
- Resolves the receiver palace slug from cwd.
- Fetches unread messages from
GET /api/v1/messages?palace=<slug>&unread_only=true. - Prints each as a Markdown block to stdout — Claude Code injects stdout as session context.
- Atomically marks each delivered message read via
POST /api/v1/messages/mark_read.
The mark-read step uses an in-memory compare-and-swap on the palace's
drawer table so two concurrent sessions opening at once cannot
double-deliver: exactly one observes read=false and flips the flag, the
other returns false and emits nothing.
Every failure path in inbox-check degrades to exit 0 with empty stdout
so a missing or slow daemon never blocks Claude Code session start.
Migration from claude-mpm /mpm-message
This primitive replaces the Python /mpm-message skill in claude-mpm
(which wrote to ~/.claude-mpm/messaging.db via a process-local SQLite
file). The companion ticket in claude-mpm is #557; data migration is
out of scope here.
Palace as Project
(Issue #88) Palace names are anchored to the project they belong to.
One palace per project
When you create a new palace, trusty-memory walks upward from the current
working directory looking for project markers (.git, Cargo.toml,
pyproject.toml, package.json, go.mod) and derives a canonical slug
from the project root's directory name:
/Users/bob/Projects/trusty-tools → trusty-tools
/Users/bob/Projects/My_App! → my-app
/Users/bob/Projects/my project → my-project
New palaces must be named with the derived slug:
# Inside /Users/bob/Projects/trusty-tools — this succeeds
)
# Any other name fails with a descriptive error:
# "palace name 'notes' does not match the project slug 'trusty-tools'.
# Either use 'trusty-tools' or use 'personal' for non-project memories."
)
The personal palace
When operating outside any project directory (no .git / Cargo.toml /
etc. found anywhere above CWD), only the special palace name personal is
allowed. This is the "no project, no problem" escape hatch for global notes,
one-off sessions, and personal task lists.
# From any directory without a project root:
) # always succeeds
) # fails — no project root detected
Existing palaces are grandfathered
The enforcement applies only to new palace creation. Every palace created before this feature was added continues to work without any migration. Existing palaces remain read-write and are never auto-renamed.
doctor --fix-palaces
The doctor --fix-palaces subcommand audits existing palaces and reports
which ones are orphaned (name does not correspond to a detectable project
directory on disk):
# Dry-run audit (no filesystem changes):
# Include rename suggestions for orphaned palaces (still read-only):
Output example:
✅ trusty-tools — project palace ok
⚠️ old-project — orphaned (no matching project directory found on disk)
→ rename suggested: old-project → personal
❌ broken-palace — empty (no palace.json; directory may be a leftover)
· palace audit: 1 ok, 1 orphaned, 1 empty.
The --fix flag prints rename suggestions but does not mutate the
filesystem. Actual renaming (merging orphaned data into personal) is
planned for a future release. The doctor audit is purely advisory for
existing palaces.
Web UI
When running in HTTP mode, the embedded Svelte admin dashboard is available at:
http://127.0.0.1:<port>/
The dashboard provides:
- Real-time palace overview (drawer counts, vector counts, KG triple counts)
- Live event stream (palace created, drawer added/deleted, dream completed)
- Manual dream (consolidation) trigger
- Palace-scoped memory browsing
Configuration
Environment variables
| Variable | Default | Description |
|---|---|---|
RUST_LOG |
warn |
Tracing filter. E.g. RUST_LOG=info or RUST_LOG=trusty_memory=debug. |
OPENROUTER_API_KEY |
— | Enables chat completions via OpenRouter (/api/v1/chat). |
TRUSTY_DATA_DIR_OVERRIDE |
— | Override the data directory (intended for tests). |
Config file
~/.trusty-memory/config.toml (created on first run if absent):
[]
= ""
= "anthropic/claude-3.5-sonnet"
[]
= false
= "http://127.0.0.1:11434"
= "llama3"
Data directory
Memories and vector indexes persist under the OS-standard data directory:
- macOS:
~/Library/Application Support/trusty-memory/<palace-id>/ - Linux:
~/.local/share/trusty-memory/<palace-id>/
Each palace directory contains:
kg.redb— redb store for drawer metadata and knowledge-graph triplesindex.usearch— usearch vector index (approximate nearest-neighbour)recall.redb— recall analytics log (redb; migrated fromrecall.dbon first open)l1_cache.json— top-15 drawers by importance (L1 hot cache, rebuilt at each write)palace.json— palace metadata (name, description, created_at)
Architecture
trusty-memory (this crate) trusty-common `memory-core` feature
axum HTTP/SSE server ──────► PalaceRegistry
Unix domain socket ──────► usearch vector index (index.usearch)
embedded Svelte UI redb metadata + KG (kg.redb)
30 MCP tools fastembed (AllMiniLML6V2Q)
trusty-memory-mcp-bridge (separate binary, PR #149)
Claude Code stdio ◄──pipe──► trusty-memory UDS
The memory-core feature of trusty-common owns the storage engine: usearch for approximate
nearest-neighbor search, redb for drawer metadata and knowledge-graph triples, and
fastembed for 384-dim text embeddings. The MCP server (trusty-memory) is a
thin protocol layer on top.
The embedded Svelte UI is compiled at build time and served via rust-embed —
no separate web server or Node.js installation is needed at runtime.
Feature Flags
| Flag | Default | Description |
|---|---|---|
axum-server |
enabled | Compiles the HTTP server, SSE endpoint, and axum-based REST API. Disable with default-features = false when embedding only the in-process MCP tools (e.g. from open-mpm). |
# Full daemon build — no change needed (axum-server is on by default)
= { = true }
# rlib consumer — omit the HTTP stack
= { = true, = false }
Migration
From kuzu-memory (MCP config, issue #278)
If you're switching from the legacy kuzu-memory Python MCP server, rewrite
every Claude mcpServers config entry in one command:
# Dry-run: see what would change
# Apply: rewrite all Claude settings files atomically
Knowledge-graph hygiene (issue #278)
Auto-KG extraction skips drawers tagged cross-project-qa, test, or
fixture so synthetic content never pollutes the graph. Drawers deleted via
memory_forget cascade-delete their derived triples automatically.
The REST endpoint DELETE /api/v1/palaces/{id}/kg/triples/{triple_id} lets
you surgically remove a single active triple; triple_id is the base64url
encoding of subject + "\0" + predicate.
From kuzu-memory data (issue #277)
Import entities and relations from a kuzu-memory store.redb into a
trusty-memory palace:
# Dry-run: see what would be imported
# Apply the import (idempotent — safe to re-run)
# Cap at 100 entities
Each kuzu-memory entity becomes one drawer; each relation becomes one KG triple. Re-running is idempotent: entity IDs are SHA-256-derived UUIDs so duplicate imports produce the same drawer ID and are silently skipped.
Development
# Build and run (background daemon, dynamic port)
# Run inline on a specific address (foreground, useful for debuggers)
# Tests
# Storage-engine tests live in trusty-common behind the memory-core feature:
# Check only (faster)
License
Licensed under the MIT License.