Struct InferenceParameter

Source

#[non_exhaustive]pub struct InferenceParameter {
    pub max_output_tokens: Option<i32>,
    pub temperature: Option<f64>,
    pub top_k: Option<i32>,
    pub top_p: Option<f64>,
    /* private fields */
}

Available on crate features conversations or generator-evaluations or generators only.

Expand description

The parameters of inference.

Fields (Non-exhaustive)§

This struct is marked as non-exhaustive

Non-exhaustive structs could have additional fields added in future. Therefore, non-exhaustive structs cannot be constructed in external crates using the traditional Struct { .. } syntax; cannot be matched against without a wildcard ..; and struct update syntax will not work.

§max_output_tokens: Option<i32>

Optional. Maximum number of the output tokens for the generator.

§temperature: Option<f64>

Optional. Controls the randomness of LLM predictions. Low temperature = less random. High temperature = more random. If unset (or 0), uses a default value of 0.

§top_k: Option<i32>

Optional. Top-k changes how the model selects tokens for output. A top-k of 1 means the selected token is the most probable among all tokens in the model’s vocabulary (also called greedy decoding), while a top-k of 3 means that the next token is selected from among the 3 most probable tokens (using temperature). For each token selection step, the top K tokens with the highest probabilities are sampled. Then tokens are further filtered based on topP with the final token selected using temperature sampling. Specify a lower value for less random responses and a higher value for more random responses. Acceptable value is [1, 40], default to 40.

§top_p: Option<f64>

Optional. Top-p changes how the model selects tokens for output. Tokens are selected from most K (see topK parameter) probable to least until the sum of their probabilities equals the top-p value. For example, if tokens A, B, and C have a probability of 0.3, 0.2, and 0.1 and the top-p value is 0.5, then the model will select either A or B as the next token (using temperature) and doesn’t consider C. The default top-p value is 0.95. Specify a lower value for less random responses and a higher value for more random responses. Acceptable value is [0.0, 1.0], default to 0.95.

Struct InferenceParameter Copy item path

Fields (Non-exhaustive)§

Implementations§

impl InferenceParameter

pub fn new() -> Self

pub fn set_max_output_tokens<T>(self, v: T) -> Selfwhere T: Into<i32>,

§Example

pub fn set_or_clear_max_output_tokens<T>(self, v: Option<T>) -> Selfwhere T: Into<i32>,

§Example

pub fn set_temperature<T>(self, v: T) -> Selfwhere T: Into<f64>,

§Example

pub fn set_or_clear_temperature<T>(self, v: Option<T>) -> Selfwhere T: Into<f64>,

§Example

pub fn set_top_k<T>(self, v: T) -> Selfwhere T: Into<i32>,

§Example

pub fn set_or_clear_top_k<T>(self, v: Option<T>) -> Selfwhere T: Into<i32>,

§Example

pub fn set_top_p<T>(self, v: T) -> Selfwhere T: Into<f64>,

§Example

pub fn set_or_clear_top_p<T>(self, v: Option<T>) -> Selfwhere T: Into<f64>,

§Example

Trait Implementations§

impl Clone for InferenceParameter

fn clone(&self) -> InferenceParameter

fn clone_from(&mut self, source: &Self)

impl Debug for InferenceParameter

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for InferenceParameter

fn default() -> InferenceParameter

impl Message for InferenceParameter

fn typename() -> &'static str

impl PartialEq for InferenceParameter

fn eq(&self, other: &InferenceParameter) -> bool

fn ne(&self, other: &Rhs) -> bool

impl StructuralPartialEq for InferenceParameter

Auto Trait Implementations§

impl Freeze for InferenceParameter

impl RefUnwindSafe for InferenceParameter

impl Send for InferenceParameter

impl Sync for InferenceParameter

impl Unpin for InferenceParameter

impl UnwindSafe for InferenceParameter

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

fn in_current_span(self) -> Instrumented<Self>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> PolicyExt for Twhere T: ?Sized,

fn and<P, B, E>(self, other: P) -> And<T, P>where T: Policy<B, E>, P: Policy<B, E>,

fn or<P, B, E>(self, other: P) -> Or<T, P>where T: Policy<B, E>, P: Policy<B, E>,

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<V, T> VZip<V> for Twhere V: MultiLane<T>,

fn vzip(self) -> V

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>where S: Into<Dispatch>,

fn with_current_subscriber(self) -> WithDispatch<Self>

impl<T> DeserializeOwned for Twhere T: for<'de> Deserialize<'de>,

Struct InferenceParameter

pub fn set_max_output_tokens<T>(self, v: T) -> Self
where T: Into<i32>,

pub fn set_or_clear_max_output_tokens<T>(self, v: Option<T>) -> Self
where T: Into<i32>,

pub fn set_temperature<T>(self, v: T) -> Self
where T: Into<f64>,

pub fn set_or_clear_temperature<T>(self, v: Option<T>) -> Self
where T: Into<f64>,

pub fn set_top_k<T>(self, v: T) -> Self
where T: Into<i32>,

pub fn set_or_clear_top_k<T>(self, v: Option<T>) -> Self
where T: Into<i32>,

pub fn set_top_p<T>(self, v: T) -> Self
where T: Into<f64>,

pub fn set_or_clear_top_p<T>(self, v: Option<T>) -> Self
where T: Into<f64>,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> PolicyExt for T
where T: ?Sized,

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,