Skip to main content

LlamaContext

Struct LlamaContext

pub struct LlamaContext {
    pub backend: Arc<LlamaLib>,
    pub handle: *mut llama_context,
}

Expand description

Inference context attached to a model

Fields§

§backend: Arc<LlamaLib>§handle: *mut llama_context

Implementations§

impl LlamaContext

pub fn new( model: &LlamaModel, params: llama_context_params, ) -> Result<Self, LlamaError>

pub fn default_params(model: &LlamaModel) -> llama_context_params

pub fn decode(&mut self, batch: &LlamaBatch) -> Result<(), LlamaError>

pub fn kv_cache_clear(&mut self)

Clear the KV cache for this context. Resets all cached key/value state, allowing the context to be reused for a fresh generation without reallocating.

pub fn kv_cache_seq_rm( &mut self, seq_id: llama_seq_id, p0: llama_pos, p1: llama_pos, ) -> bool

Remove KV cache entries for sequence seq_id in position range [p0, p1).

If p0 < 0, removes from the beginning. If p1 < 0, removes to the end. Returns true if the operation succeeded.

This is used for incremental prompt encoding: when the conversation diverges from the cached prefix, only the divergent suffix needs to be removed and re-decoded, avoiding a full KV cache clear.

Trait Implementations§

impl Drop for LlamaContext

fn drop(&mut self)

Executes the destructor for this type. Read more

impl Send for LlamaContext

impl Sync for LlamaContext

Auto Trait Implementations§

impl Freeze for LlamaContext

impl RefUnwindSafe for LlamaContext

impl Unpin for LlamaContext

impl UnsafeUnpin for LlamaContext

impl UnwindSafe for LlamaContext

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> Same for T

type Output = T

Should always be Self

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.