About Pathfinder
Pathfinder is an MCP (Model Context Protocol) server written in Rust that gives AI coding agents the same capabilities a human developer gets from an IDE — but without a GUI.
Instead of treating source code as flat text, Pathfinder understands your code structurally through Tree-sitter AST parsing and semantically through Language Server Protocol (LSP) integration. This means AI agents can navigate, search, edit, and validate code at the symbol level — functions, classes, methods — rather than fragile line-by-line string matching.
Why Pathfinder?
Traditional AI coding workflows suffer from:
- Fragile text edits — line-based search-and-replace breaks when code shifts.
- No compile-time feedback — agents write code blindly, with no way to know if edits introduce errors until tests run.
- Limited code understanding — flat text search misses semantic structure (e.g., a search hit in a comment vs. actual code).
Pathfinder solves these problems by providing:
- 🌳 AST-Aware Operations — navigate and edit at the symbol level using semantic paths (e.g.,
src/auth.ts::AuthService.login). - ✅ LSP Validation — every edit is validated against a real language server before being written to disk, catching type errors and compilation failures before they happen.
- 🔒 Optimistic Concurrency Control (OCC) — SHA-256 version hashes prevent conflicting writes and stale-data overwrites.
- 🔍 Semantic Search — filter search results by AST context (code-only, comments-only, or all) powered by ripgrep + Tree-sitter.
- 🛡️ Sandbox Security — a 3-tier file access model prevents path traversal attacks and unauthorized file access.
- 📊 Built-in Observability — per-engine telemetry (
ripgrep_ms,tree_sitter_parse_ms,lsp_ms) and optional--lsp-tracefor raw JSON-RPC debugging.
Key Features
- 🛠️ 18 MCP Tools — covering code navigation, semantic editing, file operations, search, and impact analysis.
- 🌐 7 Languages — native Tree-sitter support for Go, TypeScript, TSX, JavaScript, Python, Rust, and Vue SFCs.
- 🏗️ 5 Rust Crates — modular workspace architecture for clean separation of concerns.
- ⚡ Zero Configuration — auto-detects languages and LSP servers in your workspace.
- 🧪 Shadow Editor — a validation pipeline that catches introduced errors by diffing LSP diagnostics before and after each edit.
Getting Started
Prerequisites
- Rust toolchain (1.75+ recommended) — Install via rustup
- An MCP-compatible AI client — such as Antigravity, Claude Desktop, Cursor, or any tool supporting MCP stdio transport.
- (Optional) Language servers — for LSP validation support (e.g.,
goplsfor Go,typescript-language-serverfor TS/JS,rust-analyzerfor Rust,pyrightfor Python).
Installation
Note: Pre-built binaries are planned for future releases. For now, you need to build from source.
Build from source:
# Clone the repository
# Build in release mode
# The binary will be at target/release/pathfinder
Verify the installation:
Configuration
MCP Client Configuration
Add Pathfinder to your MCP client's server configuration. The exact format depends on your client.
Example (JSON config for most MCP clients):
With LSP trace enabled (for debugging):
CLI Usage
pathfinder [OPTIONS] <WORKSPACE_PATH>
Arguments:
<WORKSPACE_PATH> Path to the workspace root directory
Options:
--lsp-trace Enable raw LSP JSON-RPC tracing to stderr (DEBUG level)
-h, --help Print help
-V, --version Print version
Pathfinder communicates over stdio using the MCP protocol. Logs are emitted as structured JSON to stderr (since stdout is reserved for MCP transport).
Agent Directives
Pathfinder ships with a set of agent directives — pre-written rules and skills that teach your AI agent how to use Pathfinder tools correctly, reliably, and efficiently. Without these, the agent falls back to generic file-editing behaviour and misses most of Pathfinder's value.
Why this matters: An AI agent that doesn't know about semantic paths, OCC version hashes, or the difference between
read_source_fileandread_filewill make avoidable mistakes — calling the wrong tool, constructing malformed paths, or ignoring LSP validation feedback. The directives encode all of this knowledge directly into the agent's system context.
What's Included
The directives live in docs/agent_directives/ and mirror the rules and skills used during Pathfinder's own development:
docs/agent_directives/
├── rules/
│ └── pathfinder-tool-routing.md # Always-on routing rule: which Pathfinder tool to use for each action
└── skills/
└── pathfinder-workflow/
└── SKILL.md # On-demand skill: concrete workflows, OCC patterns, error recovery
rules/pathfinder-tool-routing.md — an always-on rule injected into every agent turn. It tells the agent:
- To prefer Pathfinder's semantic tools over built-in text tools whenever possible
- How to form correct semantic paths (e.g.,
src/auth.ts::AuthService.login) - Which tool to reach for each action (reading, editing, searching, creating)
- When to fall back gracefully if Pathfinder is unavailable
skills/pathfinder-workflow/SKILL.md — a detailed on-demand skill the agent activates when it needs deeper guidance. It covers:
- Step-by-step workflows for exploring, refactoring, implementing, auditing, and debugging
- OCC (Optimistic Concurrency Control) version hash chaining across sequential edits
- Multi-file and same-file batch edit patterns
- Vue SFC text targeting for
<template>/<style>zones - Efficient search with
filter_mode,exclude_glob,known_files,group_by_file, andis_regex ignore_validation_failuresguidance for flaky or unavailable LSP environments- Error recovery patterns for
SYMBOL_NOT_FOUND,VERSION_MISMATCH,Validation Skipped,TEXT_NOT_FOUND, and LSP validation failures
Setup by Client
Antigravity
Copy the directives into your project's .agents/ directory. Antigravity auto-discovers all rules and skills placed there:
# From your project root (not the Pathfinder repo)
The routing rule runs on every agent turn automatically (trigger: always_on). The workflow skill is activated on demand when the agent needs detailed guidance.
Claude Desktop / Cursor / Other MCP Clients
For clients that support system prompt injection or custom instructions, paste the content of rules/pathfinder-tool-routing.md into your system prompt or custom instructions field. Then reference skills/pathfinder-workflow/SKILL.md as additional context or attach it as a project document.
For clients that support agent rule files (e.g., .cursorrules, .clinerules), you can drop the routing rule content directly into those files.
General Approach
For any MCP-compatible client, the minimum effective setup is to inject the tool-routing rule into the agent's persistent context. This single file prevents the most common mistakes. The workflow skill is optional but significantly improves the quality of complex multi-step tasks.
Tools
Pathfinder exposes 18 tools organized into three categories. Every tool operates within the workspace sandbox and returns structured JSON responses.
🔍 Search & Navigation
| Tool | Description |
|---|---|
search_codebase |
Search for text patterns with AST-aware filtering. Set filter_mode to code_only (default), comments_only, or all. Use is_regex=true for multi-pattern searches (e.g., unwrap\(\)|expect\(). Token-efficiency parameters: known_files (suppress content for already-read files), group_by_file, exclude_glob. Returns matching lines with context and enclosing_semantic_path + version_hash per match. |
get_repo_map |
Generate a structural skeleton of the project — an indented tree of classes, functions, and type signatures with semantic path annotations. Token-budgeted for LLM context windows. Supports changed_since (git ref/duration), include_extensions, exclude_extensions, and include_imports (none/third_party/all) for focused exploration. Returns version_hashes per file and capabilities.lsp.per_language for upfront LSP status. |
read_symbol_scope |
Extract the exact source code of a single symbol (function, class, method) by its semantic path. Returns code, line range, and version hash. |
read_source_file |
Read an entire source file and extract its complete AST symbol hierarchy. Supports three detail levels: compact (default — source + flat symbol list), symbols (symbol tree only, no source), full (source + complete nested AST). Use start_line/end_line to restrict output to a region of interest. AST-only — only call on source files (.rs, .ts, .tsx, .go, .py, .vue, .jsx, .js); use read_file for config/docs files. |
read_with_deep_context |
Read a symbol's source code plus the signatures of all functions it calls. Ideal for understanding dependencies before editing. |
get_definition |
Jump to where a symbol is defined. Provide a semantic path to a reference and get the definition's file, line, and a code preview. |
analyze_impact |
Find all callers of a symbol (incoming) and all symbols it calls (outgoing). Essential for understanding the blast radius before refactoring. Returns version_hashes for all referenced files — use these directly as base_version for edits without a separate read step. |
✏️ AST-Aware Editing
All edit tools use the Shadow Editor validation pipeline — edits are validated against the LSP before being written to disk. Every edit requires a base_version (SHA-256 hash) for optimistic concurrency control.
| Tool | Description |
|---|---|
replace_body |
Replace the internal logic of a block-scoped construct (function, method, class body), keeping the signature intact. |
replace_full |
Replace an entire declaration including its signature, body, decorators, and doc comments. |
replace_batch |
Apply multiple edits atomically within a single file — back-to-front to avoid offset shifting, single OCC guard. Supports two targeting modes: Option A (semantic — semantic_path + edit_type) for code symbols, and Option B (text — old_text + context_line + replacement_text) for Vue <template>/<style> zones or any non-symbol region. Both modes can be mixed in one call. |
insert_before |
Insert new code before a target symbol. Use a bare file path (without ::) to insert at the top of a file. |
insert_after |
Insert new code after a target symbol. Use a bare file path (without ::) to append to the bottom of a file. |
delete_symbol |
Delete a symbol and all its associated decorators, attributes, and doc comments. |
validate_only |
Dry-run an edit without writing to disk. Pre-check risky changes and get the same validation results as a real edit. |
📁 File Operations
| Tool | Description |
|---|---|
read_file |
Read raw file content with pagination (start_line, max_lines). Best for configuration files (YAML, TOML, Dockerfile). For source code, prefer read_symbol_scope. |
write_file |
Write to configuration files. Supports full replacement or surgical search-and-replace via a replacements array. Not for source code — use the AST-aware edit tools instead. |
create_file |
Create a new file with initial content. Parent directories are created automatically. |
delete_file |
Delete a file. Requires base_version (OCC) to prevent deleting a file modified after you last read it. |
Architecture
Pathfinder is structured as a Rust workspace with 5 crates, each with a clear responsibility:
pathfinder/
├── crates/
│ ├── pathfinder/ # MCP server, CLI, tool routing
│ │ └── src/
│ │ ├── main.rs # CLI entry point (clap)
│ │ └── server/
│ │ ├── server.rs # MCP tool router
│ │ ├── types.rs # Parameter & response types
│ │ ├── helpers.rs # Shared utilities
│ │ └── tools/ # One module per tool category
│ │ ├── search.rs
│ │ ├── edit.rs
│ │ ├── navigation.rs
│ │ ├── file_ops.rs
│ │ ├── repo_map.rs
│ │ ├── source_file.rs
│ │ ├── symbols.rs
│ │ └── diagnostics.rs
│ │
│ ├── pathfinder-common/ # Shared types, errors, config, sandbox
│ ├── pathfinder-treesitter/ # The Surgeon — AST parsing & symbol extraction
│ ├── pathfinder-search/ # The Scout — ripgrep-powered code search
│ └── pathfinder-lsp/ # The Lawyer — LSP client & lifecycle management
│
├── docs/
│ ├── requirements/ # PRD and specifications
│ ├── research_logs/ # Design decisions and research
│ └── audits/ # Code audit findings
│
├── Cargo.toml # Workspace manifest
├── LICENSE # MIT License
└── README.md
The Three Engines
Pathfinder internally delegates work to three specialized engines, each abstracted behind a trait for testability:
| Engine | Crate | Trait | Responsibility |
|---|---|---|---|
| The Surgeon | pathfinder-treesitter |
Surgeon |
AST parsing, symbol extraction, semantic path resolution, repo map generation |
| The Scout | pathfinder-search |
Scout |
Ripgrep-powered full-text search with Tree-sitter enrichment for AST-aware filtering |
| The Lawyer | pathfinder-lsp |
Lawyer |
LSP process lifecycle, edit validation (Shadow Editor), go-to-definition |
Each engine can be mocked independently for unit testing, and the server gracefully degrades when an engine is unavailable (e.g., falls back to Tree-sitter heuristics when no LSP is running).
Core Concepts
Semantic Paths
Pathfinder identifies code symbols using semantic paths — a human-readable notation that mirrors how developers think about code structure:
src/auth.ts::AuthService.login # Method
src/utils/math.go::CalculateDiscount # Function
lib/models.py::User # Class
Format: <relative_file_path>::<Symbol>[.<Method>]
Optimistic Concurrency Control (OCC)
Every file read returns a version_hash (SHA-256 digest of the file content). Edit and delete operations require this hash as base_version — if the file has changed since you last read it, the operation is rejected. This prevents conflicting writes in multi-agent environments.
The Shadow Editor
For AST-aware edits, Pathfinder runs a "validation sandwich":
didOpen— notify LSP of original contentpull_diagnostics— capture baseline errorsdidChange— notify LSP of proposed editpull_diagnostics— capture post-edit errors- Revert — restore LSP state
- Diff — compare pre/post errors using a multiset algorithm that's resilient to line shifts
If new errors are introduced, the edit fails by default (overridable with ignore_validation_failures).
Supported Languages
Tree-sitter Support (Built-in, Zero Configuration)
Tree-sitter grammars are compiled directly into the Pathfinder binary — no external tools needed. All symbol extraction, semantic path resolution, and AST-aware filtering work out of the box.
| Language | Extension(s) | Notes |
|---|---|---|
| Go | .go |
Function, interface, struct, and type alias extraction |
| TypeScript | .ts |
Class, function, arrow function, interface, and type extraction |
| TSX | .tsx |
All TypeScript symbols plus JSX element extraction as child symbols |
| JavaScript | .js, .jsx |
Functions, classes, and JSX elements in .jsx files |
| Python | .py |
Function, class, and method extraction |
| Rust | .rs |
Functions, structs, enums, traits; impl block methods merged under their parent type |
| Vue SFC | .vue |
Multi-zone: <script> parsed as TypeScript (AST-aware), <template> and <style> accessible via text targeting in replace_batch |
LSP Support (Optional, Auto-detected)
Pathfinder automatically detects which language servers are available in your workspace by scanning for marker files (Cargo.toml, go.mod, tsconfig.json, etc.). LSP processes start lazily on first use and are shut down after an idle timeout.
To maximise validation coverage, install the language server(s) for your project:
| Language | LSP Server | Install Command | Auto-detect Marker |
|---|---|---|---|
| Rust | rust-analyzer |
rustup component add rust-analyzer |
Cargo.toml at workspace root |
| Go | gopls |
go install golang.org/x/tools/gopls@latest |
go.mod (scans up to depth 2) |
| TypeScript / JavaScript / JSX / TSX / Vue | typescript-language-server |
npm install -g typescript-language-server typescript |
tsconfig.json or package.json (depth 2) |
| Python | pyright |
npm install -g pyright |
pyproject.toml, setup.py, or requirements.txt (depth 2) |
Vue note: Pathfinder handles Vue SFC parsing internally with Tree-sitter. The
typescript-language-servervalidates the<script>block — no separatevolarorvue-language-serverinstallation is required.
LSP validation status: Every edit response includes a
validationfield. Ifvalidation_skipped: true, inspectvalidation_skipped_reason— possible values areno_lsp(no server detected),lsp_not_on_path(binary missing),lsp_start_failed,lsp_crash,lsp_timeout, andpull_diagnostics_unsupported. Callget_repo_mapupfront to see which languages have active LSP validation viacapabilities.lsp.per_language.
Observability
Pathfinder emits structured JSON logs to stderr with per-engine timing breakdowns:
Enable --lsp-trace for full JSON-RPC request/response logging at DEBUG level — useful for diagnosing LSP communication issues.
Security
Pathfinder implements a 3-tier sandbox model:
| Tier | Type | What It Blocks |
|---|---|---|
| Tier 1 | Hardcoded Deny (cannot be overridden) | .git/objects/, .git/HEAD, *.pem, *.key, *.pfx — security-critical paths |
| Tier 2 | Default Deny (overridable via config) | .env, node_modules/, vendor/, dist/, build/, __pycache__/ |
| Tier 3 | User-Defined | Patterns in .pathfinderignore (gitignore syntax) |
- All file paths are canonicalized and validated before any I/O operation.
- Path traversal attacks (e.g.,
../../etc/passwd) are rejected at Tier 1. - The
WorkspaceRoottype enforces that only valid, existing directories are accepted as workspace roots. - Tier 2 patterns can be selectively overridden via
SandboxConfig.allow_override; additional deny patterns can be added viaSandboxConfig.additional_deny.
Roadmap
- Core MCP server with stdio transport
- Tree-sitter-powered AST parsing (Go, TypeScript, TSX, JSX, JavaScript, Python, Rust, Vue SFC)
- Vue Single-File Component multi-zone parsing (
<script>,<template>,<style>) - JSX/TSX element extraction as addressable child symbols
- Ripgrep search with AST-aware filtering (
code_only,comments_only,all) - Search intelligence:
known_files,group_by_file,exclude_glob(E4) - Full suite of AST-aware edit tools with OCC
- Hybrid batch edits — semantic + text targeting in a single atomic call (E3.1)
- LSP integration with Shadow Editor validation
- LSP lifecycle management (auto-start, crash recovery, idle termination)
- Granular LSP skip reasons for actionable agent recovery (v5.1)
- Proactive capability reporting via
get_repo_map(capabilities.lsp.per_language) - 3-tier sandbox security model
- Per-engine observability and telemetry
-
get_repo_maptemporal filtering (changed_since) and extension filters (E6) -
read_source_filewith compact/symbols/full detail modes and line range filtering (E2) - Pre-built binaries for easy installation
- Additional language support (Java, C/C++, C#, etc.)
- Custom LSP server command overrides via configuration file
Contributing
Contributions are welcome! Pathfinder follows strict engineering practices:
- Fork the repository
- Create your feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'feat: add amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
Development
# Run tests
# Run with clippy (pedantic + deny unwrap)
# Format
The workspace enforces
clippy::pedantic,deny(unwrap_used), anddeny(unsafe_code).
License
Distributed under the MIT License. See the LICENSE file for details.