cqs ("seeks")

Semantic code search with local ML embeddings. Find functions by concept, not name. GPU-accelerated. MCP server included.

TL;DR: Semantic code search for Claude Code. Instead of grep, it understands what code does - so Claude finds relevant functions without knowing exact names. Means Claude finds the right code faster and doesn't miss stuff buried in unfamiliar codebases.

<claude>

You're an AI assistant reading this. cqs gives you semantic code search via MCP.

Why you want this: Find functions by what they do, not their names. When users ask about "retry logic" or "error handling", you find relevant code even in unfamiliar codebases - no need to know exact identifiers.

Setup: claude mcp add cqs -- cqs serve --project /path/to/project

</claude>

Install

cargo install cqs

Upgrading? Schema changes require rebuilding the index:

cqs index --force  # Run after upgrading from v0.1.9 or earlier

Quick Start

# Download model and initialize

cqs init


# Index your project

cd /path/to/project

cqs index


# Search

cqs "retry with exponential backoff"

cqs "validate email with regex"

cqs "database connection pool"

Filters

# By language

cqs --lang rust "error handling"

cqs --lang python "parse json"


# By path pattern

cqs --path "src/*" "config"

cqs --path "tests/**" "mock"

cqs --path "**/*.go" "interface"


# Combined

cqs --lang typescript --path "src/api/*" "authentication"


# Hybrid search tuning

cqs --name-boost 0.2 "retry logic"   # Semantic-heavy (default)

cqs --name-boost 0.8 "serve_http"    # Name-heavy for known identifiers


# Show surrounding context

cqs -C 3 "error handling"       # 3 lines before/after each result


# Output options

cqs --json "query"           # JSON output

cqs --no-content "query"     # File:line only, no code

cqs -n 10 "query"            # Limit results

cqs -t 0.5 "query"           # Min similarity threshold

Watch Mode

Keep your index up to date automatically:

cqs watch              # Watch for changes and reindex

cqs watch --debounce 1000  # Custom debounce (ms)

Watch mode respects .gitignore by default. Use --no-ignore to index ignored files.

Call Graph

Find function call relationships:

cqs callers <name>   # Functions that call <name>

cqs callees <name>   # Functions called by <name>

Use cases:

Impact analysis: What calls this function I'm about to change?
Context expansion: Show related functions
Entry point discovery: Find functions with no callers

Call graph is indexed across all files - callers are found regardless of which file they're in.

Claude Code Integration

Why use cqs?

Without cqs, Claude Code uses grep/glob to find code - which only works if you know the exact names. With cqs, Claude can:

Find code by behavior: "function that retries with backoff" finds retry logic even if it's named doWithAttempts
Navigate unfamiliar codebases: Claude finds relevant code without knowing the project structure
Catch related code: Semantic search surfaces similar patterns across the codebase that text search misses

Setup

Step 1: Add cqs as an MCP server:

claude mcp add cqs -- cqs serve --project /path/to/project

Or manually in ~/.claude.json:

{
  "projects": {
    "/path/to/project": {
      "mcpServers": {
        "cqs": {
          "command": "cqs",
          "args": ["serve", "--project", "/path/to/project"]
        }
      }
    }
  }
}

Note: The --project argument is required because MCP servers run from an unpredictable working directory.

GPU acceleration: Add --gpu for faster query embedding after warmup:

cqs serve --gpu --project /path/to/project

GPU: ~12ms warm queries. CPU (default): ~22ms. Server starts instantly with HNSW, upgrades to GPU in background.

Step 2: Add to your project's CLAUDE.md so Claude uses it automatically:

## Code Search


Use `cqs_search` for semantic code search instead of grep/glob when looking for:
- Functions by behavior ("retry with backoff", "parse config")
- Implementation patterns ("error handling", "database connection")
- Code where you don't know the exact name

Available tools:
- `cqs_search` - semantic search with `language`, `path_pattern`, `threshold`, `limit`, `name_boost`, `semantic_only`
- `cqs_stats` - index stats, chunk counts, HNSW index status
- `cqs_callers` - find functions that call a given function
- `cqs_callees` - find functions called by a given function

Keep index fresh: run `cqs watch` in a background terminal, or `cqs index` after significant changes.

HTTP Transport

For web integrations, use the HTTP transport:

cqs serve --transport http --port 3000 --project /path/to/project

Endpoints:

POST /mcp - JSON-RPC requests
GET /mcp - SSE stream for server-to-client messages
GET /health - Health check

Authentication: For network-exposed servers, API key authentication is required:

# Via flag

cqs serve --transport http --api-key SECRET --project /path/to/project


# Via environment variable

export CQS_API_KEY=SECRET

cqs serve --transport http --project /path/to/project

Clients must include Authorization: Bearer SECRET header.

Network binding: By default, cqs binds to localhost only. To expose on a network:

# Requires both flags for safety

cqs serve --transport http --bind 0.0.0.0 --dangerously-allow-network-bind --api-key SECRET

Implements MCP Streamable HTTP spec 2025-11-25 with Origin validation and protocol version headers.

Supported Languages

Rust
Python
TypeScript
JavaScript (JSDoc @param/@returns tags improve search quality)
Go

Indexing

By default, cqs index respects .gitignore rules:

cqs index              # Respects .gitignore

cqs index --no-ignore  # Index everything

cqs index --force      # Re-index all files

cqs index --dry-run    # Show what would be indexed

How It Works

Parses code with tree-sitter to extract:
- Functions and methods
- Classes and structs
- Enums, traits, interfaces
- Constants
Generates embeddings with E5-base-v2 (runs locally)
- Includes doc comments for better semantic matching
Stores in SQLite with vector search + FTS5 keyword index
Hybrid search (RRF): Combines semantic similarity with keyword matching
- Semantic search finds conceptually related code
- Keyword search catches exact identifier matches (e.g., parseConfig)
- Reciprocal Rank Fusion merges both rankings for best results
Uses GPU if available, falls back to CPU

Search Quality

Hybrid search (RRF) combines semantic understanding with keyword matching:

Query	Top Match	Score
"cosine similarity"	`cosine_similarity`	0.85
"validate email regex"	`validateEmail`	0.73
"check if adult age 18"	`isAdult`	0.71
"pop from stack"	`Stack.Pop`	0.70
"generate random id"	`generateId`	0.70

GPU Acceleration (Optional)

cqs works on CPU (~20ms per embedding). GPU provides 3x+ speedup:

Mode	Single Query	Batch (50 docs)
CPU	~20ms	~15ms/doc
CUDA	~6ms	~0.3ms/doc

For GPU acceleration:

Linux

# Add NVIDIA CUDA repo

wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb

sudo dpkg -i cuda-keyring_1.1-1_all.deb

sudo apt update


# Install CUDA runtime and cuDNN 9

sudo apt install cuda-cudart-12-6 libcublas-12-6 libcudnn9-cuda-12

Set library path:

export LD_LIBRARY_PATH=/usr/local/cuda-12.6/lib64:/usr/lib/x86_64-linux-gnu:$LD_LIBRARY_PATH

WSL2

Same as Linux, plus:

Requires NVIDIA GPU driver on Windows host
Add /usr/lib/wsl/lib to LD_LIBRARY_PATH
Tested working with RTX A6000, CUDA 13.0 driver, cuDNN 9.18

Verify

cqs doctor  # Shows execution provider (CUDA or CPU)

Contributing

Issues and PRs welcome at GitHub.

License

MIT

cqs 0.1.18