ChatCompletionRequest

Struct ChatCompletionRequest 

Source
pub struct ChatCompletionRequest {
Show 18 fields pub model: String, pub messages: Vec<ChatMessage>, pub temperature: Option<f32>, pub max_tokens: Option<u32>, pub top_p: Option<f32>, pub frequency_penalty: Option<f32>, pub presence_penalty: Option<f32>, pub stop: Option<Vec<String>>, pub user: Option<String>, pub provider: Option<String>, pub stream: Option<bool>, pub logit_bias: Option<Value>, pub logprobs: Option<bool>, pub top_logprobs: Option<u32>, pub n: Option<u32>, pub response_format: Option<ResponseFormat>, pub tools: Option<Vec<Tool>>, pub tool_choice: Option<ToolChoice>,
}
Expand description

Represents a request to create a chat completion.

Fields§

§model: String

The identifier of the model to use for the completion (e.g., “gpt-4o”, “claude-sonnet-4”).

§messages: Vec<ChatMessage>

A list of messages that form the conversation history.

§temperature: Option<f32>

The sampling temperature to use, between 0.0 and 2.0. Higher values will make the output more random, while lower values will make it more focused and deterministic.

§max_tokens: Option<u32>

The maximum number of tokens to generate in the completion.

§top_p: Option<f32>

The nucleus sampling parameter. The model considers the results of the tokens with top_p probability mass. So, 0.1 means only the tokens comprising the top 10% probability mass are considered.

§frequency_penalty: Option<f32>

A penalty applied to new tokens based on their frequency in the text so far. It decreases the model’s likelihood to repeat the same line verbatim.

§presence_penalty: Option<f32>

A penalty applied to new tokens based on whether they appear in the text so far. It increases the model’s likelihood to talk about new topics.

§stop: Option<Vec<String>>

A list of sequences that will cause the model to stop generating further tokens.

§user: Option<String>

A unique identifier representing your end-user, which can help in monitoring and tracking conversations.

§provider: Option<String>

A hint to the router about which provider to use for the model.

§stream: Option<bool>

If set to true, the response will be streamed as a series of events.

§logit_bias: Option<Value>

Modify the likelihood of specified tokens appearing in the completion.

§logprobs: Option<bool>

Whether to return log probabilities of the output tokens.

§top_logprobs: Option<u32>

An integer between 0 and 20 specifying the number of most likely tokens to return at each token position.

§n: Option<u32>

How many chat completion choices to generate for each input message.

§response_format: Option<ResponseFormat>

An object specifying the format that the model must output.

§tools: Option<Vec<Tool>>

A list of tools the model may call.

§tool_choice: Option<ToolChoice>

Controls which (if any) tool is called by the model.

Implementations§

Source§

impl ChatCompletionRequest

Source

pub fn new(model: impl Into<String>, messages: Vec<ChatMessage>) -> Self

Creates a new ChatCompletionRequest with the given model and messages.

§Arguments
  • model - The identifier of the model to use.
  • messages - The list of messages for the conversation.
Source

pub fn with_temperature(self, temperature: f32) -> Self

Sets the temperature for the chat completion.

The temperature is clamped between 0.0 and 2.0.

§Arguments
  • temperature - The sampling temperature.
Source

pub fn with_max_tokens(self, max_tokens: u32) -> Self

Sets the maximum number of tokens to generate.

§Arguments
  • max_tokens - The maximum number of tokens.
Source

pub fn with_user(self, user: impl Into<String>) -> Self

Sets the user identifier for the chat completion.

§Arguments
  • user - A unique identifier for the end-user.
Source

pub fn with_provider(self, provider: impl Into<String>) -> Self

Sets a provider hint for the request.

§Arguments
  • provider - The name of the provider to use.
Source

pub fn with_stream(self, stream: bool) -> Self

Enables or disables streaming for the response.

§Arguments
  • stream - true to enable streaming, false to disable.
Source

pub fn with_logit_bias(self, logit_bias: Value) -> Self

Sets the logit bias for the chat completion.

§Arguments
  • logit_bias - A map of token IDs to bias values.
Source

pub fn with_logprobs(self, logprobs: bool) -> Self

Enables or disables log probabilities for the response.

§Arguments
  • logprobs - true to include log probabilities.
Source

pub fn with_top_logprobs(self, top_logprobs: u32) -> Self

Sets the number of most likely tokens to return at each position.

§Arguments
  • top_logprobs - The number of top log probabilities to return.
Source

pub fn with_n(self, n: u32) -> Self

Sets the number of chat completion choices to generate.

§Arguments
  • n - The number of completions to generate.
Source

pub fn with_response_format(self, response_format: ResponseFormat) -> Self

Sets the response format for the chat completion.

§Arguments
  • response_format - The format the model must output.
Source

pub fn with_tools(self, tools: Vec<Tool>) -> Self

Sets the tools available to the model.

§Arguments
  • tools - A list of tools the model can use.
Source

pub fn with_tool_choice(self, tool_choice: ToolChoice) -> Self

Sets the tool choice for the chat completion.

§Arguments
  • tool_choice - Controls which tool the model uses.
Source

pub fn validate_openai_compatibility(&self) -> Result<(), String>

Validates that the request parameters are compatible with OpenAI standards.

This method checks parameter ranges and values to ensure they match OpenAI’s API specifications.

§Returns

A Result indicating whether the request is valid for OpenAI compatibility.

Trait Implementations§

Source§

impl Clone for ChatCompletionRequest

Source§

fn clone(&self) -> ChatCompletionRequest

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for ChatCompletionRequest

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<'de> Deserialize<'de> for ChatCompletionRequest

Source§

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more
Source§

impl Serialize for ChatCompletionRequest

Source§

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>
where __S: Serializer,

Serialize this value into the given Serde serializer. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> PolicyExt for T
where T: ?Sized,

Source§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow only if self and other return Action::Follow. Read more
Source§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow if either self or other returns Action::Follow. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

Source§

impl<T> ErasedDestructor for T
where T: 'static,