Skip to main content

LlmConfig

sgr_agent::types

Struct LlmConfig

pub struct LlmConfig {
    pub model: String,
    pub api_key: Option<String>,
    pub base_url: Option<String>,
    pub temp: f64,
    pub max_tokens: Option<u32>,
    pub prompt_cache_key: Option<String>,
    pub project_id: Option<String>,
    pub location: Option<String>,
    pub use_chat_api: bool,
}

Expand description

LLM provider configuration — single config for any provider.

Two optional fields control routing:

api_key: None → auto from env vars (OPENAI_API_KEY, ANTHROPIC_API_KEY, etc.)
base_url: None → auto-detect provider from model name; Some → custom endpoint

use sgr_agent::LlmConfig;

let c = LlmConfig::auto("gpt-4o");                                          // env vars
let c = LlmConfig::with_key("sk-...", "claude-3-haiku");                    // explicit key
let c = LlmConfig::endpoint("sk-or-...", "https://openrouter.ai/api/v1", "gpt-4o"); // custom
let c = LlmConfig::auto("gpt-4o").temperature(0.9).max_tokens(2048);        // builder

Fields§

§model: String§api_key: Option<String>§base_url: Option<String>§temp: f64§max_tokens: Option<u32>§prompt_cache_key: Option<String>

OpenAI prompt cache key — caches system prompt prefix server-side.

§project_id: Option<String>

Vertex AI project ID (enables Vertex routing when set).

§location: Option<String>

Vertex AI location (default: “global”).

§use_chat_api: bool

Force Chat Completions API instead of Responses API. Needed for OpenAI-compatible endpoints that don’t support /responses (e.g. Cloudflare AI Gateway compat, OpenRouter, local models).

Implementations§

impl LlmConfig

pub fn auto(model: impl Into<String>) -> Self

Auto-detect provider from model name, use env vars for auth.

pub fn with_key(api_key: impl Into<String>, model: impl Into<String>) -> Self

Explicit API key, auto-detect provider from model name.

pub fn endpoint( api_key: impl Into<String>, base_url: impl Into<String>, model: impl Into<String>, ) -> Self

Custom OpenAI-compatible endpoint (OpenRouter, Ollama, LiteLLM, etc.).

pub fn vertex(project_id: impl Into<String>, model: impl Into<String>) -> Self

Vertex AI — uses gcloud ADC for auth (no API key needed).

pub fn location(self, loc: impl Into<String>) -> Self

Set Vertex AI location.

pub fn temperature(self, t: f64) -> Self

Set temperature.

pub fn max_tokens(self, m: u32) -> Self

Set max output tokens.

pub fn prompt_cache_key(self, key: impl Into<String>) -> Self

Set OpenAI prompt cache key for server-side system prompt caching.

pub fn label(&self) -> String

Human-readable label for display.

pub fn compaction_model(&self) -> String

Infer a cheap/fast model for compaction based on the primary model.

pub fn for_compaction(&self) -> Self

Create a compaction config — cheap model, low max_tokens.

Trait Implementations§

impl Clone for LlmConfig

fn clone(&self) -> LlmConfig

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for LlmConfig

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl Default for LlmConfig

fn default() -> Self

Returns the “default value” for a type. Read more

impl<'de> Deserialize<'de> for LlmConfig

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl Serialize for LlmConfig

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

Auto Trait Implementations§

impl Freeze for LlmConfig

impl RefUnwindSafe for LlmConfig

impl Send for LlmConfig

impl Sync for LlmConfig

impl Unpin for LlmConfig

impl UnsafeUnpin for LlmConfig

impl UnwindSafe for LlmConfig

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> DynClone for T
where T: Clone,

fn __clone_box(&self, _: Private) -> *mut ()

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> PolicyExt for T
where T: ?Sized,

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow only if self and other return Action::Follow. Read more

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow if either self or other returns Action::Follow. Read more

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,