InferenceParameters

Struct InferenceParameters

pub struct InferenceParameters {
    pub n_threads: usize,
    pub n_batch: usize,
    pub top_k: usize,
    pub top_p: f32,
    pub repeat_penalty: f32,
    pub temperature: f32,
    pub bias_tokens: TokenBias,
    pub repetition_penalty_last_n: usize,
}

Expand description

The parameters for text generation.

This needs to be provided during all inference calls, but can be changed between calls.

Fields§

§n_threads: usize

The number of threads to use.

§n_batch: usize

Controls batch/chunk size for prompt ingestion in InferenceSession::feed_prompt.

§top_k: usize

The top K words by score are kept during sampling.

§top_p: f32

The cumulative probability after which no more words are kept for sampling.

§repeat_penalty: f32

The penalty for repeating tokens. Higher values make the generation less likely to get into a loop, but may harm results when repetitive outputs are desired.

§temperature: f32

Temperature (randomness) used for sampling. A higher number is more random.

§bias_tokens: TokenBias

A list of tokens to bias against in the process of generation.

§repetition_penalty_last_n: usize

The number of tokens to consider for the repetition penalty.

Trait Implementations§

impl Clone for InferenceParameters

fn clone(&self) -> InferenceParameters

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for InferenceParameters

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl Default for InferenceParameters

fn default() -> Self

Returns the “default value” for a type. Read more

impl PartialEq for InferenceParameters

fn eq(&self, other: &InferenceParameters) -> bool

Tests for self and other values to be equal, and is used by ==.

1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.

impl StructuralPartialEq for InferenceParameters

Auto Trait Implementations§

impl Freeze for InferenceParameters

impl RefUnwindSafe for InferenceParameters

impl Send for InferenceParameters

impl Sync for InferenceParameters

impl Unpin for InferenceParameters

impl UnwindSafe for InferenceParameters

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

fn vzip(self) -> V