Standalone guarantee: AgenticCodebase is independently installable and operable. Integration with AgenticMemory/AgenticVision is optional, never required.
Autonomic defaults (phase 4): profile-driven operations (
ACB_AUTONOMIC_PROFILE=desktop|cloud|aggressive) now control backup/maintenance posture, while compile keeps rolling output backups and collective cache self-maintenance active by default. Storage migration is policy-gated (ACB_STORAGE_MIGRATION_POLICY=auto-safe|strict|off) with checkpointed auto-safe behavior. SLA-aware collective maintenance throttling is enabled (ACB_SLA_MAX_REGISTRY_OPS_PER_MIN) plus health-ledger snapshots (ACB_HEALTH_LEDGER_DIRorAGENTRA_HEALTH_LEDGER_DIR) for CLI and collective surfaces. Default ledger location is~/.agentra/health-ledger. Optional tuning:ACB_AUTO_BACKUP,ACB_AUTO_BACKUP_RETENTION,ACB_AUTO_BACKUP_DIR,ACB_COLLECTIVE_CACHE_MAINTENANCE_SECS,ACB_HEALTH_LEDGER_EMIT_SECS.
AI agents can't understand code across sessions.
Your agent reads a file, analyzes one function, and forgets. Next session -- blank slate. It can't recall the architecture it mapped yesterday. It can't trace the impact chain from three conversations ago. It can't search its own understanding of your codebase.
RAG over source files doesn't work. You get "similar text," never "what breaks if I change this?". Embedding chunks loses all structure -- no call graphs, no dependency chains, no type relationships. Grep is fast but flat.
AgenticCodebase compiles your repository into a navigable concept graph stored in a single binary file. Not "search your source code." Your agent has a map -- functions, classes, modules, imports, call chains, type hierarchies -- all connected, all queryable in microseconds.
# Compile any repository (Python, Rust, TypeScript, Go)
# Query it
Four languages. Twenty-four query types. One file holds everything. Sub-microsecond lookups. Works with Claude Desktop, VS Code, Cursor, Windsurf, and any MCP-compatible client.
Benchmarks
Rust core. Tree-sitter parsing. Binary .acb format. Real numbers from cargo bench --release:
| Operation | 1K units | 10K units | Notes |
|---|---|---|---|
| Graph build | 388 us | 3.77 ms | Semantic analysis + edge resolution |
| Write .acb | 169 us | 2.29 ms | LZ4-compressed binary format |
| Read .acb | 473 us | 4.91 ms | Memory-mapped I/O |
| Query (10K graph) | Latency | Notes |
|---|---|---|
| Symbol lookup (exact) | 14.3 us | Hash-based, O(1) |
| Dependency graph (depth 5) | 925 ns | BFS traversal |
| Impact analysis | 1.46 us | With risk scoring |
| Call graph (depth 3) | 1.27 us | Bidirectional |
All benchmarks on Apple M4 Pro, macOS, Rust 1.90.0
--release. Criterion 0.5 with 100 iterations after warm-up.
Why AgenticCodebase?
| Approach | Finds symbols | Traces dependencies | Predicts impact | Persists across sessions | Sub-ms queries |
|---|---|---|---|---|---|
| grep / ripgrep | partial | no | no | no | yes |
| LSP / IDE | yes | partial | no | no | varies |
| RAG over source | partial | no | no | yes | no |
| AgenticCodebase | yes | yes | yes | yes | yes |
Quickstart
Install
Compile a codebase
# Parse and compile a repository into a .acb graph
# View graph metadata
# Query symbols
# Impact analysis -- what breaks if I change unit 42?
# Code prophecy -- what's likely to break next?
MCP Server
# Start the MCP server (stdio transport)
Configure in Claude Desktop (claude_desktop_config.json):
See the Full Install Guide for VS Code, Cursor, and Windsurf configuration.
The Query Engine
AgenticCodebase provides 24 query types across three tiers:
Core Queries (8)
| Query | CLI | Description |
|---|---|---|
| Symbol lookup | acb query ... symbol -n <name> |
Find code units by name (exact, prefix, contains) |
| Dependency graph | acb query ... deps -u <id> |
Forward dependencies with depth control |
| Reverse dependency | acb query ... rdeps -u <id> |
Who depends on this unit? |
| Call graph | acb query ... calls -u <id> |
Function call chains (callers + callees) |
| Similarity | acb query ... similar -u <id> |
Structurally similar code units |
| Type hierarchy | via library API | Inheritance and implementation chains |
| Containment | via library API | Module/class nesting relationships |
| Pattern match | via library API | Structural code pattern detection |
Built Queries (5)
| Query | CLI | Description |
|---|---|---|
| Impact analysis | acb query ... impact -u <id> |
Risk-scored change impact with test coverage |
| Coverage | via library API | Test coverage mapping |
| Trace | via library API | Execution path tracing |
| Path | via library API | Shortest path between two units |
| Reverse | via library API | Reverse call/dependency chains |
Novel Queries (11)
| Query | CLI | Description |
|---|---|---|
| Prophecy | acb query ... prophecy |
Predict which units will break next |
| Stability | acb query ... stability -u <id> |
Stability score with contributing factors |
| Coupling | acb query ... coupling |
Detect tightly coupled unit pairs |
| Collective | via library API | Cross-repository pattern extraction |
| Temporal | via library API | Git history evolution analysis |
| Dead code | via library API | Unreachable code detection |
| Concept | via library API | Abstract concept clustering |
| Migration | via library API | Language migration planning |
| Test gap | via library API | Missing test identification |
| Drift | via library API | Code drift detection over time |
| Hotspot | via library API | Change frequency hotspot analysis |
Architecture
AgenticCodebase models source code as a directed graph G = (U, E) where each vertex is a typed CodeUnit and each edge carries a semantic relationship.
Compilation Pipeline
Source Files (Py/Rust/TS/Go)
-> tree-sitter Parse
-> Semantic Analysis
-> Graph Builder
-> .acb Binary
Binary Format (.acb)
| Section | Size | Description |
|---|---|---|
| Header | 128 B | Magic, version, counts, offsets |
| Unit Table | 96N B | Fixed-size unit records (O(1) access) |
| Edge Table | 40M B | Fixed-size edge records |
| String Pool | Variable | LZ4-compressed names and paths |
| Feature Vectors | Variable | f32 embedding arrays |
Code Unit Types (13)
Function, Method, Class, Struct, Enum, Interface, Trait, Module, Import, Variable, Constant, TypeAlias, Macro
Edge Types (18)
Calls, CalledBy, Imports, ImportedBy, Contains, ContainedBy, Inherits, InheritedBy, Implements, ImplementedBy, Uses, UsedBy, Returns, Accepts, Overrides, OverriddenBy, Tests, TestedBy
Install
See the Full Installation Guide for all options including MCP server setup, library usage, and build-from-source instructions.
Integration with Agentic Ecosystem
AgenticCodebase is part of the Agentic ecosystem:
- AgenticMemory -- Persistent, navigable memory for AI agents
- AgenticVision -- Visual memory and image understanding for AI agents
- AgenticCodebase -- Semantic code understanding for AI agents
All three share the MCP protocol for seamless AI agent integration. Run all three servers together for an agent with memory, vision, and code understanding.
Implementation
- Language: Rust
- Source files: 58
- Lines of code: 13,709
- Tests: 432 (38 unit + 394 integration)
- Benchmarks: 21 Criterion benchmarks
- Clippy warnings: 0
- Supported languages: Python, Rust, TypeScript, Go
Contributing
See CONTRIBUTING.md for development setup and guidelines.
License
MIT -- see LICENSE for details.