Struct AgentBuilder

Source

pub struct AgentBuilder { /* private fields */ }

Expand description

Builder for constructing an Agent with custom configuration.

Supports two modes:

Standalone – the builder loads the model itself (simple, one agent):

let agent = AgentBuilder::new()
    .backend(Backend::Cpu)
    .model_path("model.gguf")
    .build()
    .expect("Failed to build agent");

Shared engine – multiple agents share one model (no redundant loading):

let engine = Arc::new(InferenceEngine::load(InferenceConfig {
    backend: Backend::Vulkan,
    model_path: "model.gguf".into(),
    n_gpu_layers: 99,
    ..Default::default()
}).unwrap());

let agent_a = AgentBuilder::new()
    .engine(engine.clone())
    .system_prompt("You are agent A.")
    .build().unwrap();

let agent_b = AgentBuilder::new()
    .engine(engine.clone())
    .system_prompt("You are agent B.")
    .build().unwrap();

Implementations§

Source §

impl AgentBuilder

Source

pub fn new() -> Self

Source

pub fn engine(self, engine: Arc<InferenceEngine>) -> Self

Use a shared InferenceEngine instead of loading a new model.

When set, backend(), model_path(), n_gpu_layers(), dll_*(), cache_dir(), and app_name() are ignored — the engine already has those configured.

Source

pub fn backend(self, backend: Backend) -> Self

Set the compute backend (CPU, CUDA, Vulkan, etc.)

Source

pub fn model_path(self, path: &str) -> Self

Path to the GGUF model file.

Source

pub fn n_gpu_layers(self, n: i32) -> Self

Number of layers to offload to GPU (-1 = all, 0 = none).

Source

pub fn app_name(self, name: &str) -> Self

Application name (used for cache directory).

Source

pub fn cache_dir(self, dir: PathBuf) -> Self

Directory for caching downloaded DLLs.

Source

pub fn explicit_dll_path(self, path: PathBuf) -> Self

Explicit path to the llama.cpp DLL (bypasses download).

Source

pub fn dll_version(self, version: &str) -> Self

DLL version tag to download.

Source

pub fn chat_template(self, template: &str) -> Self

Set a custom chat template (Jinja).

Source

pub fn system_prompt(self, prompt: &str) -> Self

System prompt that instructs the model on its role and tool usage.

Source

pub fn n_ctx(self, n: u32) -> Self

Context window size in tokens.

Source

pub fn max_iterations(self, n: usize) -> Self

Maximum agent loop iterations (0 = unlimited).

Source

pub fn max_tokens_per_completion(self, n: usize) -> Self

Maximum tokens per model completion.

Source

pub fn temperature(self, temp: f32) -> Self

Sampling temperature.

Source

pub fn top_k(self, k: i32) -> Self

Top-K sampling parameter.

Source

pub fn min_p(self, p: f32) -> Self

Min-P sampling parameter.

Source

pub fn repeat_penalty(self, p: f32) -> Self

Repetition penalty.

Source

pub fn stop_sequence(self, stop: &str) -> Self

Add a stop sequence.

Source

pub fn auto_approve(self) -> Self

Auto-approve all tool calls (YOLO mode — dangerous!).

Source

pub fn permission_callback( self, cb: impl Fn(&PermissionRequest) -> PermissionDecision + Send + Sync + 'static, ) -> Self

Set a permission callback for interactive approval.

Source

pub fn tool(self, tool: Box<dyn Tool>) -> Self

Add a custom tool.

Source

pub fn skip_builtin_tools(self) -> Self

Skip registering built-in tools (bash, read, write, edit, glob).

Source

pub fn no_skills(self) -> Self

Disable skill discovery entirely.

Source

pub fn skills_path(self, path: PathBuf) -> Self

Add an extra directory to search for skills.

Source

pub fn activate_skill(self, name: &str) -> Self

Explicitly activate a skill by name.

Source

pub fn no_agents_md(self) -> Self

Disable AGENTS.md discovery entirely.

Source

pub fn scheduler(self, scheduler: Arc<InferenceScheduler>) -> Self

Set an inference scheduler to limit concurrent inferences.

Use InferenceScheduler::new(1) to serialize all inference (one agent at a time), or a higher value for controlled parallelism. Without a scheduler, agents run truly parallel (safe, but GPU-heavy).

Source