pub enum ModelInstance<B: Backend> {
Baseline(Box<BaselineTransformer<B>>),
Ddl(Box<DdlTransformer<B>>),
}Variants§
Baseline(Box<BaselineTransformer<B>>)
Ddl(Box<DdlTransformer<B>>)
Implementations§
Source§impl<B: Backend> ModelInstance<B>
impl<B: Backend> ModelInstance<B>
pub fn num_params(&self) -> usize
pub fn forward_logits( &self, input_ids: Tensor<B, 2, Int>, mask: Option<&Tensor<B, 3>>, ) -> Tensor<B, 3>
pub fn forward_with_optional_diagnostics( &self, input_ids: Tensor<B, 2, Int>, mask: Option<&Tensor<B, 3>>, ) -> ModelOutput<B>
pub fn forward_with_diagnostics( &self, input_ids: Tensor<B, 2, Int>, mask: Option<&Tensor<B, 3>>, diagnostic_level: DiagnosticLevel, ) -> ModelOutput<B>
pub fn max_seq_len(&self) -> usize
pub fn generate( &self, prompt_tokens: &[usize], generation_config: &GenerationConfig, device: &B::Device, ) -> Result<GenerationResult, GenerationError>
Trait Implementations§
Source§impl<B: Backend> AutoregressiveModel<B> for ModelInstance<B>
impl<B: Backend> AutoregressiveModel<B> for ModelInstance<B>
Source§impl<B: Clone + Backend> Clone for ModelInstance<B>
impl<B: Clone + Backend> Clone for ModelInstance<B>
Source§fn clone(&self) -> ModelInstance<B>
fn clone(&self) -> ModelInstance<B>
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreAuto Trait Implementations§
impl<B> Freeze for ModelInstance<B>
impl<B> !RefUnwindSafe for ModelInstance<B>
impl<B> Send for ModelInstance<B>
impl<B> !Sync for ModelInstance<B>
impl<B> Unpin for ModelInstance<B>
impl<B> UnsafeUnpin for ModelInstance<B>
impl<B> !UnwindSafe for ModelInstance<B>
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more