pub struct CompletionRequest<'a> {
pub messages: &'a [Message],
pub system_prompt: &'a str,
pub tools: &'a [ToolSchema],
pub max_tokens: Option<u32>,
pub tool_choice: Option<ToolChoice>,
pub thinking: Option<ThinkingMode>,
pub effort: Option<EffortLevel>,
pub output_schema: Option<Value>,
pub enable_caching: bool,
pub fallback_model: Option<String>,
pub temperature: Option<f64>,
}Expand description
A request to the LLM API.
Fields§
§messages: &'a [Message]§system_prompt: &'a str§tools: &'a [ToolSchema]§max_tokens: Option<u32>§tool_choice: Option<ToolChoice>Tool selection constraint.
thinking: Option<ThinkingMode>Thinking/reasoning configuration.
effort: Option<EffortLevel>Effort level for the response.
output_schema: Option<Value>JSON schema for structured output mode.
enable_caching: boolEnable prompt caching.
fallback_model: Option<String>Fallback model if primary is overloaded.
temperature: Option<f64>Temperature override.
Implementations§
Source§impl<'a> CompletionRequest<'a>
impl<'a> CompletionRequest<'a>
Auto Trait Implementations§
impl<'a> Freeze for CompletionRequest<'a>
impl<'a> RefUnwindSafe for CompletionRequest<'a>
impl<'a> Send for CompletionRequest<'a>
impl<'a> Sync for CompletionRequest<'a>
impl<'a> Unpin for CompletionRequest<'a>
impl<'a> UnsafeUnpin for CompletionRequest<'a>
impl<'a> UnwindSafe for CompletionRequest<'a>
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more