RequestConfig

multi_llm::provider

Struct RequestConfig

pub struct RequestConfig {
    pub temperature: Option<f64>,
    pub max_tokens: Option<u32>,
    pub top_p: Option<f64>,
    pub top_k: Option<u32>,
    pub min_p: Option<f64>,
    pub presence_penalty: Option<f64>,
    pub response_format: Option<ResponseFormat>,
    pub tools: Vec<Tool>,
    pub tool_choice: Option<ToolChoice>,
    pub user_id: Option<String>,
    pub session_id: Option<String>,
    pub llm_path: Option<String>,
}

Expand description

Configuration for a single LLM request.

Override default provider settings on a per-request basis. All fields are optional - unset fields use the provider’s defaults.

§Basic Usage

use multi_llm::RequestConfig;

let config = RequestConfig {
    temperature: Some(0.7),
    max_tokens: Some(1000),
    ..Default::default()
};

§With Tools

use multi_llm::{RequestConfig, Tool, ToolChoice};

let weather_tool = Tool {
    name: "get_weather".to_string(),
    description: "Get weather for a city".to_string(),
    parameters: serde_json::json!({"type": "object", "properties": {}}),
};

let config = RequestConfig {
    tools: vec![weather_tool],
    tool_choice: Some(ToolChoice::Auto),
    ..Default::default()
};

§Sampling Parameters

Parameter	Range	Effect
`temperature`	0.0-2.0	Randomness (0=deterministic, 2=very random)
`top_p`	0.0-1.0	Nucleus sampling threshold
`top_k`	1+	Limit vocab to top K tokens
`presence_penalty`	-2.0-2.0	Discourage repetition

Fields§

§temperature: Option<f64>

Temperature for response randomness.

0.0: Deterministic (always pick most likely token)
0.7: Balanced (good default for most tasks)
1.0+: More creative/random

Range: 0.0 to 2.0 (provider-dependent)

§max_tokens: Option<u32>

Maximum tokens to generate in the response.

Limits response length. The actual response may be shorter if the model completes its thought naturally.

§top_p: Option<f64>

Top-p (nucleus) sampling parameter.

Only consider tokens whose cumulative probability exceeds this threshold. Lower values = more focused, higher values = more diverse. Range: 0.0 to 1.0 (typically 0.9-0.95)

§top_k: Option<u32>

Top-k sampling parameter.

Only consider the top K most likely tokens at each step. Lower values = more focused. Not all providers support this.

§min_p: Option<f64>

Min-p sampling parameter.

Filter tokens below this probability relative to the top token. Range: 0.0 to 1.0. Not all providers support this.

§presence_penalty: Option<f64>

Presence penalty to discourage repetition.

Positive values reduce likelihood of repeating tokens that have appeared. Range: -2.0 to 2.0 (typically 0.0 to 1.0)

§response_format: Option<ResponseFormat>

Response format for structured output.

When set, the model attempts to return JSON matching the schema. Use with LlmProvider::execute_structured_llm() for best results.

§tools: Vec<Tool>

Tools available for this request.

Define functions the LLM can call. See Tool for details.

§tool_choice: Option<ToolChoice>

Strategy for tool selection.

Controls whether tools are optional, required, or disabled. See ToolChoice for options.

§user_id: Option<String>

User ID for analytics and cache analysis.

Helps track cache hit rates per user and debug user-specific issues.

§session_id: Option<String>

Session ID for session-level analytics.

Track cache performance and behavior within a conversation session.

§llm_path: Option<String>

LLM path context for distinguishing call types.

Useful when your application has multiple LLM call paths (e.g., “chat”, “analysis”, “summarization”).

Trait Implementations§

impl Clone for RequestConfig

fn clone(&self) -> RequestConfig

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for RequestConfig

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl Default for RequestConfig

fn default() -> RequestConfig

Returns the “default value” for a type. Read more

impl<'de> Deserialize<'de> for RequestConfig

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl PartialEq for RequestConfig

fn eq(&self, other: &RequestConfig) -> bool

Tests for self and other values to be equal, and is used by ==.

1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.

impl Serialize for RequestConfig

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

impl StructuralPartialEq for RequestConfig

Auto Trait Implementations§

impl Freeze for RequestConfig

impl RefUnwindSafe for RequestConfig

impl Send for RequestConfig

impl Sync for RequestConfig

impl Unpin for RequestConfig

impl UnwindSafe for RequestConfig

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> PolicyExt for T
where T: ?Sized,

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow only if self and other return Action::Follow. Read more

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow if either self or other returns Action::Follow. Read more

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,