Dragen Logo

Dragen

CodeAct-style AI agents that write Python, not JSON.

When you ask an LLM to call tools via JSON schemas, you're asking it to work in a format it wasn't trained on. It can't loop over results, can't branch on conditions, can't compose tool outputs — it fills in one schema at a time and waits. But give it a Python sandbox and it writes code: loops, branches, error handling, multi-step reasoning — all in one shot.

That's the CodeAct pattern, and Dragen is a framework built around it. You register tools as Python functions, hand the agent a task, and it writes code to solve it. The code runs in a Littrs sandbox — a Python-to-bytecode compiler and stack VM that embeds directly into your application. No containers, no cloud sandboxing services, no exec().

Why not smolagents?

Smolagents pioneered CodeAct agents in the Python ecosystem, but its default local executor is a restricted interpreter — not a true sandbox. It blocks dangerous imports and restricts dunder attributes, but has known CVEs (CVE-2025-5120, CVE-2025-9959) that allow sandbox escapes through whitelisted modules. For real isolation, you need Docker, E2B, or Modal — each adding infrastructure, latency, and operational overhead.

Dragen takes a different approach. The Littrs sandbox compiles Python to bytecode and runs it on a stack-based VM with zero ambient capabilities — no filesystem, no network, no env vars, no dangerous imports. Resource limits are enforced at the VM level and cannot be caught by try/except. The only way sandboxed code can reach the outside world is through the tools you explicitly provide. All of this runs in-process: cargo add dragen or pip install dragen and you're done.

What you get

Secure sandbox — Littrs with resource limits, file mounting, and custom modules. Details
Structured output — JSON Schema validation with self-correction. Works with Pydantic. Details
Multi-agent pipelines — shared Context for typed data passing between agents. Details
Parallel execution — agent.map(tasks) runs concurrent tasks on cloned agents. Details
Any LLM — OpenAI, Anthropic, Groq, Ollama, or any compatible API via Tanukie
Observable — event callbacks for every step of the agent loop. Details

Installation

Rust

cargo add dragen

Python

pip install dragen

Quick Start

Rust

use dragen::{Agent, AgentConfig};
use littrs::tool;

/// Search the web for information.
///
/// Args:
///     query: The search query
#[tool]
fn search(query: String) -> String {
    format!("Results for: {}", query)
}

#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
    let mut agent = Agent::new(AgentConfig::new("gpt-4o"));
    agent.register(search::Tool);

    let result: String = agent.run("Search for recent AI agent frameworks").await?;
    println!("{}", result);
    Ok(())
}

Python

import dragen

agent = dragen.Agent("gpt-4o")

@agent.tool
def search(query: str) -> str:
    """Search the web for information."""
    return f"Results for: {query}"

result = agent.run("Search for recent AI agent frameworks")
print(result)

Examples

Structured output with self-correction

Pass a schema and the agent retries until the output validates:

from pydantic import BaseModel

class Analysis(BaseModel):
    summary: str
    sentiment: str  # positive, negative, neutral
    confidence: float

agent = dragen.Agent("gpt-4o")
result = agent.run(
    "Analyze the sentiment of: 'This product is amazing!'",
    schema=Analysis.model_json_schema()
)
analysis = Analysis(**result)

Multi-agent pipeline with shared context

Agents pass typed data to each other through a shared Context:

from dragen import Agent, Context

ctx = Context()

# Planner researches and writes a plan
planner = Agent("gpt-4o").to_context(ctx, "plan")
planner.run("Create a research plan for: quantum computing trends")

# Writer reads the plan and produces content
writer = Agent("gpt-4o").from_context(ctx, "plan")
result = writer.run("Write a report based on the research plan")

Recursive Language Model (RLM)

RLMs let an LLM recursively call itself to process inputs far beyond its context window. The long input lives in the sandbox as a variable — the agent writes code to slice, examine, and summarize chunks, accumulating results across iterations:

sandbox = dragen.Sandbox(builtins=True)
sandbox["document"] = very_long_text  # e.g. 500K tokens

agent = dragen.Agent("gpt-4o", max_iterations=20, sandbox=sandbox)
result = agent.run("""
The variable `document` contains a very long research paper.
Extract all key findings, then synthesize them into a structured summary.
You can slice `document` with Python string indexing to read it in parts.
""")

The agent writes code like chunk = document[0:5000], processes it, then chunk = document[5000:10000], accumulating findings in a list variable across iterations — recursively decomposing the input without ever exceeding its context window.

Custom sandbox with limits and file access

Pre-configure a sandbox with resource limits and mounted files:

sandbox = dragen.Sandbox(builtins=True)
sandbox.limit(max_instructions=50_000, max_recursion_depth=30)
sandbox.mount("data.csv", "./input/data.csv")
sandbox.mount("report.md", "./output/report.md", writable=True)

agent = dragen.Agent("gpt-4o", sandbox=sandbox)
result = agent.run("Read data.csv and write a summary report to report.md")

For the full feature reference, see DOCS.md. More examples in examples/.

Citation

If you use Dragen in your research, please cite it as:

@software{dragen,
  title = {Dragen: CodeAct-style AI Agent Framework},
  author = {Chonkie Inc.},
  url = {https://github.com/chonkie-inc/dragen},
  license = {Apache-2.0},
  year = {2025-2026}
}

dragen 0.2.0

Dragen

CodeAct-style AI agents that write Python, not JSON.

Why not smolagents?

What you get

Installation

Rust

Python

Quick Start

Rust

Python

Examples

Structured output with self-correction

Multi-agent pipeline with shared context

Recursive Language Model (RLM)

Custom sandbox with limits and file access

Citation