embacle-server

Unified OpenAI-compatible REST API + MCP server that proxies requests to embacle LLM runners. Any client that speaks the OpenAI chat completions API or MCP protocol can use it without modification.

Install

cargo install embacle-server

Usage

# Start with default provider on localhost:3000
embacle-server

# Specify provider and bind address
embacle-server --provider claude_code --port 8080 --host 0.0.0.0

# MCP-only mode via stdio (for editor/client integration)
embacle-server --transport stdio --provider copilot

Endpoints

Method	Path	Description
`POST`	`/v1/chat/completions`	Chat completion (streaming and non-streaming)
`GET`	`/v1/models`	List available providers and models
`GET`	`/health`	Per-provider readiness check
`POST`	`/mcp`	MCP Streamable HTTP (JSON-RPC 2.0)

Model Routing

The model field determines which provider handles the request. Use a provider:model prefix to target a specific runner, or pass a bare model name to use the server's default.

# Explicit provider
curl http://localhost:3000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "claude:opus", "messages": [{"role": "user", "content": "hello"}]}'

# Default provider
curl http://localhost:3000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4o", "messages": [{"role": "user", "content": "hello"}]}'

Multiplex

Pass an array of models to fan out the same prompt to multiple providers concurrently:

curl http://localhost:3000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": ["copilot:gpt-4o", "claude:opus"], "messages": [{"role": "user", "content": "hello"}]}'

Streaming is not supported for multiplex requests.

Authentication

Optional. Set EMBACLE_API_KEY to require bearer token auth. When unset, all requests are allowed (localhost dev mode).

EMBACLE_API_KEY=my-secret embacle-server
curl http://localhost:3000/v1/models -H "Authorization: Bearer my-secret"

Requirements

At least one supported CLI tool must be installed and authenticated:

claude (Claude Code)
copilot (GitHub Copilot)
cursor-agent (Cursor Agent)
opencode (OpenCode)
gemini (Gemini CLI)
codex (Codex CLI)
goose (Goose CLI)
cline (Cline CLI)
cn (Continue CLI)
oz (Warp)
kiro-cli (Kiro CLI)
kilo (Kilo Code)

License

Apache-2.0 — see LICENSE-APACHE.

Full project documentation: github.com/dravr-ai/dravr-embacle

embacle-server 0.14.3