Struct ModelRequest

Source

pub struct ModelRequest {Show 17 fields
    pub model: String,
    pub messages: Vec<Message>,
    pub system: SystemPrompt,
    pub max_tokens: Option<u32>,
    pub temperature: Option<f32>,
    pub top_p: Option<f32>,
    pub top_k: Option<u32>,
    pub stop_sequences: Vec<String>,
    pub tools: Arc<[ToolSpec]>,
    pub tool_choice: ToolChoice,
    pub parallel_tool_calls: Option<bool>,
    pub response_format: Option<ResponseFormat>,
    pub end_user_id: Option<String>,
    pub seed: Option<i64>,
    pub reasoning_effort: Option<ReasoningEffort>,
    pub provider_extensions: ProviderExtensions,
    pub continued_from: Vec<ProviderEchoSnapshot>,
}

Expand description

One model invocation, before encoding to vendor wire format.

Built by users (or higher-level recipes) and handed to Codec::encode. Codecs produce vendor-shaped JSON; the IR is the canonical surface and never carries vendor-specific fields directly.

Fields§

§model: String

Vendor model identifier (e.g. claude-opus-4-7, gpt-4.1).

§messages: Vec<Message>

Conversation up to this turn. Must contain at least one user message for most providers; codecs reject empty lists at encode time.

§system: SystemPrompt

Ordered system-prompt blocks. Empty = “no system prompt” (codecs treat as if the field were absent). Per-block crate::ir::CacheControl is honored natively by codecs that support it (Anthropic, Bedrock Converse for Claude); other codecs concatenate block text and emit LossyEncode warnings when any block is cached.

§max_tokens: Option<u32>

Hard cap on output tokens. None = vendor default.

§temperature: Option<f32>

Sampling temperature [0.0, 2.0]. Codecs clamp to vendor range.

§top_p: Option<f32>

Nucleus sampling parameter.

§top_k: Option<u32>

Top-k sampling parameter — restrict candidate-token sampling to the k most-likely tokens. None defers to the vendor default.

Codec mapping (CLAUDE.md §“Provider IR promotion”; native on Anthropic, Gemini, Bedrock Converse on Claude — three vendors, criterion satisfied):

Anthropic, Bedrock Converse on Claude — pass-through to the Messages API top_k field.
Gemini — pass-through to generationConfig.topK.
OpenAI Chat / OpenAI Responses — LossyEncode (no native parameter).

§stop_sequences: Vec<String>

Sequences that, when produced, halt generation.

§tools: Arc<[ToolSpec]>

Tools advertised to the model. Empty = no tool calls permitted. Held as Arc<[ToolSpec]> so per-dispatch cloning of the request shape is an atomic refcount bump rather than a deep walk of every tool’s JSON schema. Codecs read through the Deref<Target = [ToolSpec]> coercion — every &request.tools site continues to see &[ToolSpec] unchanged.

§tool_choice: ToolChoice

Constraint on tool selection. Defaults to ToolChoice::Auto.

§parallel_tool_calls: Option<bool>

Allow the model to emit more than one tool call in a single turn. Some(true) opts in to parallel tool use, Some(false) forces serial dispatch, None defers to the vendor default.

Codec mapping:

Anthropic, Bedrock Converse on Claude — translate to tool_choice.disable_parallel_tool_use (inverted polarity); the codec only emits when a tool_choice block is present.
OpenAI Chat / OpenAI Responses — pass-through to the parallel_tool_calls field.
Gemini — LossyEncode (no native parallel-tool toggle).

Promoted to IR per the rule “≥ 2 first-party vendors carry the concept natively → IR field” (CLAUDE.md §“Provider IR promotion”).

§response_format: Option<ResponseFormat>

Optional structured-output constraint. Codecs route to vendor-canonical channels (Anthropic output_config.format, OpenAI response_format / text.format, Gemini responseJsonSchema).

§end_user_id: Option<String>

Pseudonymous end-user identifier — abuse-monitoring, per-user rate-limit attribution, and audit trail. Vendor pseudonym, never PII (no email / IP / real name).

Codec mapping (native on Anthropic + OpenAI Chat + OpenAI Responses — two distinct vendors, criterion satisfied):

Anthropic — metadata.user_id.
OpenAI Chat / OpenAI Responses — top-level user.
Gemini, Bedrock Converse — LossyEncode (no native end-user attribution channel).

§seed: Option<i64>

Deterministic-generation seed. Same seed + same request → same output, best-effort (vendors document this as not strictly guaranteed across model versions).

Codec mapping (native on OpenAI Chat + OpenAI Responses + Gemini — two distinct vendors, criterion satisfied):

OpenAI Chat / OpenAI Responses — top-level seed.
Gemini — generationConfig.seed.
Anthropic, Bedrock Converse — LossyEncode (no native deterministic-sampling knob).

§reasoning_effort: Option<ReasoningEffort>

Cross-vendor reasoning-effort knob. When Some, codecs translate onto their native wire shape per the mapping in ReasoningEffort’s module doc — Off/Minimal/Low/ Medium/High/Auto snap to vendor buckets, lossy approximations emit ModelWarning::LossyEncode, and VendorSpecific(s) passes through the literal vendor wire value. None ⇒ vendor default (codec emits no thinking / reasoning field).

§provider_extensions: ProviderExtensions

Per-vendor typed knobs that don’t generalise to a cross-provider IR field — e.g. Anthropic disable_parallel_tool_use, Gemini safetySettings, Bedrock guardrails. Codecs read their own ext when encoding and emit ModelWarning::ProviderExtensionIgnored when another vendor’s ext is present (the operator intended a knob this wire format cannot honour).

§continued_from: Vec<ProviderEchoSnapshot>

Vendor-keyed opaque round-trip tokens carrying state from a prior turn — OpenAI Responses previous_response_id is the canonical example. Codecs read entries matching their own Codec::name and translate to the vendor’s chain-pointer wire field; non-matching entries are ignored. Empty when the request does not chain from a prior turn.

Struct ModelRequest Copy item path

Fields§

Implementations§

impl ModelRequest

pub fn continue_turn( self, prior_response: &ModelResponse, next_message: Message, ) -> Self

§Tool round-trip

§Why a self-consuming method

Trait Implementations§

impl Clone for ModelRequest

fn clone(&self) -> ModelRequest

fn clone_from(&mut self, source: &Self)

impl Debug for ModelRequest

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for ModelRequest

fn default() -> ModelRequest

impl<'de> Deserialize<'de> for ModelRequest

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

impl PartialEq for ModelRequest

fn eq(&self, other: &ModelRequest) -> bool

fn ne(&self, other: &Rhs) -> bool

impl Serialize for ModelRequest

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

impl StructuralPartialEq for ModelRequest

Auto Trait Implementations§

impl Freeze for ModelRequest

impl RefUnwindSafe for ModelRequest

impl Send for ModelRequest

impl Sync for ModelRequest

impl Unpin for ModelRequest

impl UnsafeUnpin for ModelRequest

impl UnwindSafe for ModelRequest

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> DynClone for Twhere T: Clone,

fn __clone_box(&self, _: Private) -> *mut ()

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

fn in_current_span(self) -> Instrumented<Self>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> PolicyExt for Twhere T: ?Sized,

fn and<P, B, E>(self, other: P) -> And<T, P>where T: Policy<B, E>, P: Policy<B, E>,

fn or<P, B, E>(self, other: P) -> Or<T, P>where T: Policy<B, E>, P: Policy<B, E>,

impl<T> Same for T

type Output = T

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>where S: Into<Dispatch>,

fn with_current_subscriber(self) -> WithDispatch<Self>

impl<T> DeserializeOwned for Twhere T: for<'de> Deserialize<'de>,

Struct ModelRequest

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T> DynClone for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> PolicyExt for T
where T: ?Sized,

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,