mcp-compressor

mcp-compressor helps agents use large MCP servers without spending huge amounts of context on tool descriptions and schemas.

It can run as a CLI MCP proxy, or be embedded directly from Python, TypeScript, or Rust.

Documentation: https://atlassian-labs.github.io/mcp-compressor/
Repository: https://github.com/atlassian-labs/mcp-compressor/
Blog: https://www.atlassian.com/blog/developer/mcp-compression-preventing-tool-bloat-in-ai-agents/

Why mcp-compressor?

A powerful MCP server can expose dozens or hundreds of tools. Each tool has a name, description, input schema, and sometimes large nested JSON Schema details. Sending all of that to a model up front can waste thousands of tokens before the agent has done any useful work.

For example, large public MCP servers can spend thousands or tens of thousands of tokens on tool descriptions alone. That creates three problems:

models spend context on tools they may never call,
cost increases for every request that carries the tool list,
adding multiple MCP servers quickly overwhelms practical context budgets.

mcp-compressor changes the interaction pattern: the model sees a small compressed surface first, asks for the full schema only for the selected tool, and then invokes that tool.

Pattern 1: compressed MCP proxy

Use this when you want any MCP client to see a smaller tool surface.

sequenceDiagram
    participant Client as MCP client / agent
    participant Proxy as mcp-compressor
    participant Backend as Backend MCP server

    Client->>Proxy: tools/list
    Proxy->>Backend: tools/list
    Backend-->>Proxy: all backend tools and schemas
    Proxy-->>Client: compact wrapper tools
    Client->>Proxy: get_tool_schema(tool_name)
    Proxy-->>Client: full schema for selected tool
    Client->>Proxy: invoke_tool(tool_name, input)
    Proxy->>Backend: tools/call selected backend tool
    Backend-->>Proxy: tool result
    Proxy-->>Client: result

The frontend usually exposes only:

get_tool_schema
invoke_tool
optionally list_tools at max compression

Pattern 2: local proxy for SDKs and generated clients

Use this when your application wants to embed compression directly and call tools from code or shell commands.

flowchart LR
    App[Python / TypeScript / Rust app]
    SDK[CompressorClient]
    Proxy[Local session proxy\n/token auth]
    Generated[Generated CLI / Python / TS clients]
    Backend[MCP backend servers]

    App --> SDK
    SDK --> Proxy
    Generated --> Proxy
    Proxy --> Backend

The SDK starts a local session proxy for generated clients. Generated clients call that proxy using a session token. Your app does not need to spawn a mcp-compressor stdio subprocess.

Features

Compressed MCP proxy for existing MCP clients.
Compression levels: low, medium, high, max.
Python, TypeScript, and Rust SDKs with aligned CompressorClient APIs.
Local TypeScript tool compression for AI SDK-style in-process tools.
CLI Mode and Code Mode generated clients: shell commands plus Python/TypeScript functions.
Just Bash integration for command-oriented agents.
Remote streamable HTTP MCP backends.
OAuth support for providers that support browser authorization.
Tool filters and TOON output to further reduce context.
Atlassian MCP example with OAuth-first usage.

Quick example

=== "CLI"

```bash
mcp-compressor -c medium -- python server.py
```

=== "Python"

```python
from mcp_compressor import CompressorClient

with CompressorClient(
    servers={"alpha": {"command": "python", "args": ["server.py"]}},
    compression_level="medium",
) as proxy:
    print([tool.name for tool in proxy.tools])
    print(proxy.invoke("echo", {"message": "hello"}))
```

=== "TypeScript"

```ts
import { CompressorClient } from "@atlassian/mcp-compressor";

const proxy = await new CompressorClient({
  servers: { alpha: { command: "python", args: ["server.py"] } },
  compressionLevel: "medium",
}).connect();

try {
  console.log(proxy.tools.map((tool) => tool.name));
  console.log(await proxy.invoke("echo", { message: "hello" }));
} finally {
  proxy.close();
}
```

=== "Rust"

```rust
use mcp_compressor::compression::CompressionLevel;
use mcp_compressor::sdk::{CompressorClient, ServerConfig};
use serde_json::json;

let proxy = CompressorClient::builder()
    .server("alpha", ServerConfig::command("python").arg("server.py"))
    .compression_level(CompressionLevel::Medium)
    .build()
    .connect()
    .await?;

let result = proxy.invoke("echo", json!({ "message": "hello" })).await?;
```

Generated clients

CLI Mode generates shell commands:

mcp-compressor --cli-mode --server-name atlassian -- https://mcp.atlassian.com/v1/mcp
atlassian get-accessible-atlassian-resources

Code Mode generates Python or TypeScript functions:

# generated-py/atlassian.py
import atlassian

resources = atlassian.getAccessibleAtlassianResources()

import { getAccessibleAtlassianResources } from "./generated-ts/atlassian.ts";

const resources = await getAccessibleAtlassianResources();

See Code Mode and generated clients for more detail.

Where to go next

Install the package you need.
Run the quickstart.
Read how compression works.
Choose between CLI usage, SDK usage, generated clients, and Just Bash.

mcp-compressor 0.19.5