ast-outline
Fast, AST-based code-navigation toolkit for source files — read the shape of a file (signatures with line numbers, no method bodies), the true public API of a package, the dependency graph between files, and search the repo by symbol or behaviour. Twelve subcommands, one binary, built for LLM coding agents and humans who'd rather not waste tokens reading every file just to understand a codebase.
ast-outline is written in Rust, leveraging the incredibly fast ast-grep bindings for tree-sitter, and uses rayon to parse your entire workspace concurrently in milliseconds. The show/digest/implements commands were inspired by dim-s/code-outline. The Rust code itself originated from a larger code-agent project and is not a direct port.
Purpose
ast-outline exists to make LLM coding agents faster, cheaper, and smarter
when navigating unfamiliar code.
Modern agentic coding tools explore codebases by reading files directly. That's reliable but has a massive cost: on a 1000-line file, the agent pays for 1000 lines of tokens just to answer "what methods exist here?" — and reading is only one of several questions an agent has. "Who imports this?" "What's the public API?" "Are there cycles?" "Where in the repo is the login flow?" — each one historically required dozens of file reads or noisy greps.
ast-outline collapses each of those questions into a single command:
- Shape over bytes.
map/digest/showgive you signatures and line ranges instead of method bodies — typically a 95% token saving vs reading the file.implementsfinds subclasses with AST accuracy, nogrepfalse positives. - Published API in one call.
surfaceresolvespub usere-exports (Rust),__all__(Python), barrel files (TypeScript),exportclauses (Scala) so you see the surface a downstream user actually sees — not the union of every public item per file. - Dependency graph for free.
deps/reverse-deps/cycles/graphbuild a file-level import graph (Rust, Python, TS/JS, Java, C#, Kotlin, Scala, Go) cached at.ast-outline/deps/. Usereverse-depsbefore refactoring to know the blast radius.cyclesexits non-zero — wire it into a CI gate.graphemits the full dependency graph (text by default,--jsonfor JSON). - Hybrid semantic search.
searchruns BM25 + dense embeddings viapotion-code-16M(a static, no-inference model — ~64 MB, runs on CPU in microseconds).find-relatedreturns chunks structurally similar to one you already have, with a dep-graph-aware boost when a graph cache exists. - Twelve native MCP tools. Every CLI command is also exposed as an MCP tool —
ast-outline install --mcp <agent>wires it into Claude Code, Cursor, Gemini, Codex, or VS Code Copilot in one line.
The workflow
Before ast-outline:
Agent: Read Player.cs # 1200 lines of tokens
Agent: Read Enemy.cs # 800 lines of tokens
Agent: Read DamageSystem.cs # 400 lines of tokens
Agent: grep "IDamageable" src/ # noisy, lots of false matches
...
With ast-outline:
Agent: ast-outline surface . # one-page true public API of the crate/package
Agent: ast-outline digest src/Combat # ~100 lines, whole module's structure
Agent: ast-outline implements IDamageable # precise list, no grep noise
Agent: ast-outline search "damage handling" # hybrid BM25 + dense semantic, ranked
Agent: ast-outline show Player.cs TakeDamage # just the method body
Agent: ast-outline reverse-deps Player.cs # who imports this — blast radius before refactor
Agent: ast-outline cycles src/ # find import cycles via Tarjan SCC
Result: same understanding, a fraction of the tokens, a fraction of the round-trips.
For "what does this package actually expose?" — historically the most expensive question, since the answer was "read every file" — surface resolves the re-export graph and gives you the answer directly, often replacing dozens of file reads with a single call. For "what would break if I change this file?" — reverse-deps gives you the impact set in one call instead of agent-grepping for usages.
Supported languages
| Language | Extensions |
|---|---|
| Rust | .rs |
| C# | .cs |
| C++ | .cpp, .cc, .cxx, .hpp, .hh |
| Python | .py, .pyi |
| TypeScript | .ts, .tsx |
| JavaScript | .js, .jsx, .mjs, .cjs |
| Java | .java |
| Kotlin | .kt, .kts |
| Scala | .scala, .sc |
| Go | .go |
| PHP | .php |
| Ruby | .rb |
| SQL | .sql, .ddl, .dml |
| Markdown | .md, .markdown, .mdx, .mdown |
More coming soon! Adding another language is a single new adapter file leveraging the massive ast-grep language ecosystem.
What gets walked
ast-outline skips a lot of files when walking a directory — by design. Filters apply uniformly across every subcommand.
-
.gitignoreand friends — every level's.gitignore, your global gitignore,.git/info/exclude, and.ignorefiles (theignorecrate's convention used byripgrep/fd). -
Hardcoded denylist — directories almost no one wants walked, even if
.gitignoredoesn't list them:.git,node_modules,target,dist,build,__pycache__,.venv,venv,.cache,.idea,.vscode,.next,.nuxt,.turbo,.parcel-cache,.gradle,.tox,.mypy_cache,.pytest_cache,.ruff_cache,.eggs,.ast-outline, and a few others. -
.ast-outline-ignore— per-repo escape hatch. Same syntax as.gitignore. Useful for excluding paths fromast-outlinethat you don't want excluded from git itself, e.g. test fixtures or vendored corpora:# .ast-outline-ignore tests/fixtures/large_corpus/ benches/data/ *.generated.rs -
Extension allowlist — files are only opened if their extension is one ast-outline knows how to parse (the table above for map/digest/show/implements; a broader set for the search commands).
Want to see exactly what ast-outline walks? Compare ast-outline digest some/dir with rg --files some/dir — anything in rg but not the digest is being filtered by one of the layers above.
Install
Homebrew (macOS)
Cargo
This installs the ast-outline CLI globally into ~/.cargo/bin — make sure that's on your PATH.
Nix
You can run ast-outline directly with Nix without installing:
Or add it as a dependency in your Nix flake:
{
inputs = {
nixpkgs.url = "github:NixOS/nixpkgs/nixpkgs-unstable";
ast-outline.url = "github:aeroxy/ast-outline";
};
outputs = { self, nixpkgs, ast-outline }:
let
system = "x86_64-linux";
pkgs = nixpkgs.legacyPackages.${system};
in {
devShells.${system}.default = pkgs.mkShell {
buildInputs = [ ast-outline.packages.${system}.default ];
};
};
}
Quick start
# Map the structure of one file
# Map a whole directory (recurses supported extensions in parallel)
# Print the exact source of one specific method
# Compact public-API map of a whole module
# True public surface (resolves `pub use` / `__all__`, not every `pub` item)
# Every class that inherits/implements a given type
# Dependency graph: forward, reverse, cycles, full
# Hybrid BM25 + dense semantic search (builds an index on first call)
# Find code semantically similar to a given file:line
# Build / refresh / inspect the per-repo search index
# Output a prompt snippet to steer LLM agents
# Machine-readable JSON (stable schema, great for tooling)
Using with LLM coding agents
This is the main use case. The fastest path is ast-outline install,
which writes the agent prompt snippet (and, where supported, a real
Read-interceptor hook) into your coding agent's config.
# Install into every supported CLI it can detect on your system.
# Or pick a single target.
# See exactly what would change before writing.
# Per-repo install (default is global).
# Remove everything we wrote.
# Quick visibility.
Supported targets: claude-code, gemini, tabnine, cursor,
aider, codex, copilot. Claude Code, Gemini, and Tabnine also get
a tool-call hook that intercepts Read on supported source files when
they exceed --min-lines (default 200) and substitutes the map output.
The other targets receive the prompt only.
Claude Code subagent shadowing
Claude Code has isolated subagents (Explore, Plan, general-purpose) that run in
their own context and cannot see the main CLAUDE.md. ast-outline install
automatically shadows these subagents with .claude/agents/<Name>.md files
containing the full ast-outline prompt.
When you run ast-outline install --target claude-code, you get:
CLAUDE.md— main agent prompt (global or local per-repo).claude/settings.json—Readtool hook.claude/agents/Explore.md— Explore subagent with the prompt injected
This solves the "why doesn't my subagent use ast-outline?" problem — subagents
now get the prompt automatically. Legacy manual ~/.claude/agents/Explore.md files
are wrapped in marker blocks in-place (non-breaking).
Skills for manual installation
A skills/ folder is included in the repo for users who prefer manual setup:
# Clone or download the repo
# Copy the skill to your user skills directory
# Then manually invoke from Claude Code
This works alongside ast-outline install — the skill definition tells Claude Code
how to invoke the ast-outline CLI with proper tool schemas and documentation.
Manual install via ast-outline prompt (e.g. project-level):
|
Works with
- Claude Code (+ custom subagents like
Explore,codebase-scout) - Cursor agent mode
- Aider
- Copilot Chat / Workspace
- Any custom agent on the Claude / OpenAI / Gemini APIs
- Humans (the colored terminal format is highly readable;
showis a nice alternative togrep -A 20)
Output format
The format is designed to be LLM-friendly: Python-style indentation,
line-number suffixes in L<start>-<end> form, doc-comments preserved.
The header summarises scale and flags partial parses.
When you run it yourself, you'll see a gorgeous ANSI-colored output. Don't worry, the terminal colors are automatically stripped when piped to a file or consumed by an agent's shell hook!
Rust
# src/core.rs (490 lines, 3 types, 12 methods, 5 fields)
pub struct Declaration L10-120
pub kind: DeclarationKind L12
pub name: String L15
pub fn lines_suffix(&self) -> String L30-48
show with ancestor context
ast-outline show <file> <Symbol> prints a # in: ... breadcrumb
between the header and the body so you know what the extracted code is
nested inside, without a second map call:
# Player.cs:30-48 Game.Player.PlayerController.TakeDamage (method)
# in: namespace Game.Player → public class PlayerController : MonoBehaviour, IDamageable
/// <summary>Apply damage.</summary>
public void TakeDamage(int amount) { ... }
JSON output
Add --json to any command to get the full symbol graph as stable,
structured JSON instead of formatted text — ideal for editors, language
servers, CI tooling, or any script that needs to consume the data
programmatically.
Every JSON document includes a schema field that is bumped on breaking
changes, so downstream tooling can guard on it:
| Schema | Command |
|---|---|
ast-outline.outline.v1 |
default outline, digest --json |
ast-outline.show.v1 |
show --json |
ast-outline.implements.v1 |
implements --json |
ast-outline.surface.v1 |
surface --json |
ast-outline.deps.v1 |
deps --json |
ast-outline.reverse-deps.v1 |
reverse-deps --json |
ast-outline.cycles.v1 |
cycles --json |
ast-outline.graph.v1 |
graph --json |
ast-outline.search.v1 |
search --json |
ast-outline.related.v1 |
find-related --json |
ast-outline.index-stats.v1 |
index --stats --json |
MCP server
Run ast-outline as a Model Context Protocol
server over stdio so any MCP-aware coding agent can call the same operations
as native tools — no shell parsing required:
The server speaks line-delimited JSON-RPC 2.0 on stdin/stdout and exposes twelve tools that map 1:1 to the CLI commands:
| Tool | Equivalent CLI | Returns |
|---|---|---|
map |
ast-outline map <paths> |
text, or ast-outline.map.v1 with json: true |
digest |
ast-outline digest <paths> |
text, or ast-outline.map.v1 with json: true |
show |
ast-outline show <path> <syms> |
text, or ast-outline.show.v1 with json: true |
implements |
ast-outline implements <type> <paths> |
text, or ast-outline.implements.v1 with json: true |
surface |
ast-outline surface [path] |
text, or ast-outline.surface.v1 with json: true |
deps |
ast-outline deps <file> |
text, or ast-outline.deps.v1 with json: true |
reverse_deps |
ast-outline reverse-deps <file> |
text, or ast-outline.reverse-deps.v1 with json: true |
cycles |
ast-outline cycles [path] |
text, or ast-outline.cycles.v1 with json: true |
graph |
ast-outline graph [path] |
text by default; json: true for ast-outline.graph.v1 |
search |
ast-outline search "<query>" |
text, or ast-outline.search.v1 with json: true |
find_related |
ast-outline find-related <file>:<line> |
text, or ast-outline.related.v1 with json: true |
index |
ast-outline index |
text, or ast-outline.index-stats.v1 with json: true |
Wire it into a client by pointing at the binary:
{
"mcpServers": {
"ast-outline": { "command": "ast-outline", "args": ["mcp"] }
}
}
The server is fully synchronous, has no extra runtime dependencies, and adds
roughly 1% to the binary size. The CLI itself is unaffected — none of the MCP
code runs unless you invoke the mcp subcommand.
Semantic search
ast-outline search runs hybrid retrieval over a per-repo index:
- BM25 for exact identifier matches and keyword density.
- Dense embeddings via
minishlab/potion-code-16M— a static (no inference)vocab × 256model that runs on CPU in microseconds. - Reciprocal Rank Fusion (k = 60) blends the two; alpha auto-resolves to 0.3 for symbol queries (
HandlerStack,Sinatra::Base— lean BM25) and 0.5 for natural language ("how does login work" — balanced). - A ranking pass adds definition boosts (3× for chunks that define a queried symbol), file-coherence boosts (multi-chunk hits in the same file lift the top chunk), file-stem matches for NL queries, and path-based penalties (test files 0.3×,
.d.tsstubs 0.7×,__init__.py0.5×).
ast-outline find-related <file>:<line> is the same engine in semantic-only mode, language-filtered, with the source chunk excluded — useful for "what else is structured like this?"
How indexing works
First call to search / find-related builds an index at .ast-outline/index/:
.ast-outline/
.gitignore # auto-written, contents: "*"
index/
meta.json # schema + model + chunk_count
chunks.bin # per-chunk content + line range + language
embeddings.f32 # chunk_count × 256 little-endian f32, mmap-friendly
bm25.bin # vocab + idf + postings
files.bin # per-file mtime + xxhash + chunk range
lock # advisory lock for concurrent writers
Subsequent calls walk the tree, compare (mtime, size) against files.bin, and only hash files where the cheap check fails. If anything changed, the index rebuilds automatically (a v2 will support partial updates against the same on-disk format). Steady-state cost on an unchanged 10k-file repo: ~30 ms of stat syscalls.
The model is downloaded once (~64 MB) on first use to ~/.cache/ast-outline/models/. It tries HuggingFace first, falls back to hf-mirror.com if blocked. TLS verification is disabled by default so corporate MITM proxies don't break setup; integrity is enforced via SHA-256 on every cached file. Set AST_OUTLINE_TLS_STRICT=1 to enforce strict TLS.
For more on what gets indexed (the five filter layers, .ast-outline-ignore syntax) see the "What gets walked" section above. For the security trade-offs around the TLS default, see the network-security wiki page on GitHub.
find-related quietly benefits from the dep graph too — when one is cached, results are reranked so files within depth 2 of the source (importer or importee) get a multiplicative boost. Disable with --no-dep-boost.
Dependency graph
ast-outline deps, reverse-deps, cycles, and graph build a file-level import graph for the project and answer different questions on it:
All four commands share one dep graph cached at .ast-outline/deps/graph.bin. First call builds it (~hundreds of ms for typical repos via the same ignore-respecting walk used by search); subsequent calls reuse it via mtime-based delta detection, with --rebuild to force a fresh build.
Resolution is per-language but shares one suffix-index resolver:
- Rust:
use crate::*/use super::*/mod foo;(with#[path]attribute support). - Python: relative imports (
from .x import y),__init__.pypackages, bareimport a.b. - TypeScript / JavaScript: relative paths with extension probing (
.ts → .tsx → .mts → .cts → .d.ts → .js → ... → .json),index.*fallback,tsconfig.jsonpathsaliases. - Java / Kotlin / Scala / C#: FQN suffix index built from each file's
package/namespacedeclaration. Inner classes resolve via strip-and-retry. - Go:
go.modmoduleprefix is stripped;import "mymod/pkg/foo"resolves topkg/foo/*.go(directory-as-package).
The four commands are also exposed as MCP tools for agents.
Architecture & Development
See the wiki on GitHub for details on how ast-outline leverages ast-grep internally and how you can add new language adapters.
Getting started
# With Cargo
# With Nix flake
Contributions welcome.