Skip to main content

MultiscreenModel

Struct MultiscreenModel 

Source
pub struct MultiscreenModel<B: Backend = DefaultAutodiffBackend> { /* private fields */ }
Expand description

Burn-backed neural Multiscreen language model ported from multiscreen-testing.

Implementations§

Source§

impl<B: Backend> MultiscreenModel<B>

Source

pub fn new(config: MultiscreenModelConfig, device: &B::Device) -> Result<Self>

Source

pub fn config(&self) -> &MultiscreenModelConfig

Source

pub fn parameter_count(&self) -> usize

Source

pub fn forward(&self, tokens: Tensor<B, 2, Int>) -> Tensor<B, 3>

Source

pub fn save_parameters(&self, path: impl AsRef<Path>) -> Result<()>

Source

pub fn load_parameters(&mut self, path: impl AsRef<Path>) -> Result<()>

Source

pub fn infer_tokens_stream( &self, prompt: &[u32], inference: &ModelInferenceConfig, device: &B::Device, on_token: impl FnMut(u32, usize) -> bool, ) -> Result<MultiscreenModelOutput>

Greedy token generation. Generate tokens one at a time, invoking a callback for each newly produced token. This enables streaming / word-by-word output similar to ChatGPT.

The callback receives (token_id, index) where index is the zero-based position of the new token (0 = first generated token). If the callback returns false, generation stops early.

Returns the full output (prompt + generated) token sequence.

Source

pub fn infer_tokens( &self, prompt: &[u32], inference: &ModelInferenceConfig, device: &B::Device, ) -> Result<MultiscreenModelOutput>

Generate tokens and return them all at once (non-streaming).

For streaming / token-by-token output, use Self::infer_tokens_stream.

Source

pub fn predict_next_token( &self, context: &[u32], pad_token_id: u32, device: &B::Device, ) -> Result<u32>

Source

pub fn forward_logits( &self, context: &[u32], pad_token_id: u32, device: &B::Device, ) -> Result<Tensor<B, 3>>

Run a forward pass and return the full logit tensor.

The returned tensor has shape [1, seq_len, vocab_size]. This is useful for sampling-based generation (top-k, temperature, etc.) where you need access to the raw logit values, not just the argmax.

The context is padded/truncated to seq_len automatically.

Source

pub fn evaluate_on_sequences( &self, sequences: &[Vec<u32>], seq_len: usize, batch_size: usize, pad_token_id: u32, device: &B::Device, ) -> Result<EvaluationResult>

Evaluates the model on token sequences, returning average loss, perplexity, and next-token prediction accuracy.

This method works on any Backend (including non-autodiff), which makes it safe to call on an inference-only model without VRAM growth.

Source§

impl<B> MultiscreenModel<B>
where B: AutodiffBackend,

Source

pub fn train_token_sequences( &mut self, sequences: &[Vec<u32>], training: &ModelTrainingConfig, device: &B::Device, on_step: impl FnMut(usize, f32), ) -> Result<ModelTrainingReport>

Trains this model directly on token sequences.

The optional on_step callback is invoked after each optimizer step with (step_index, loss_value). Use it for progress logging, CSV export, etc.

Source

pub fn train_chat_sequences( &mut self, chat_pairs: &[(Vec<u32>, Vec<u32>)], training: &ModelTrainingConfig, device: &B::Device, on_step: impl FnMut(usize, f32), ) -> Result<ModelTrainingReport>

Trains this model on chat-style (prompt, response) token-ID pairs.

This is the chat-aware counterpart of MultiscreenModel::train_token_sequences. The model sees the full context (prompt + response) but loss is computed only on the response tokens, preventing the model from learning to generate role labels like system:, user:, or assistant:.

Each element of chat_pairs is (prompt_token_ids, response_token_ids). The caller is responsible for appending an EOS token to the response IDs when desired — the EOS token will receive loss_mask = 1.0 like any other response token.

Trait Implementations§

Source§

impl<B> AutodiffModule<B> for MultiscreenModel<B>

Source§

type InnerModule = MultiscreenModel<<B as AutodiffBackend>::InnerBackend>

Inner module without auto-differentiation.
Source§

fn valid(&self) -> Self::InnerModule

Returns the same module, but on the inner backend without auto-differentiation.
Source§

fn from_inner(module: Self::InnerModule) -> Self

Wraps an inner module back into an auto-diff module.
Source§

impl<B: Backend> Clone for MultiscreenModel<B>

Source§

fn clone(&self) -> Self

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl<B: Debug + Backend> Debug for MultiscreenModel<B>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<B: Backend> Display for MultiscreenModel<B>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<B> HasAutodiffModule<B> for MultiscreenModel<B::InnerBackend>

Source§

type TrainModule = MultiscreenModel<B>

The module with auto-differentiation.
Source§

impl<B: Backend> Module<B> for MultiscreenModel<B>

Source§

type Record = MultiscreenModelRecord<B>

Type to save and load the module.
Source§

fn load_record(self, record: Self::Record) -> Self

Load the module state from a record.
Source§

fn into_record(self) -> Self::Record

Convert the module into a record containing the state.
Source§

fn num_params(&self) -> usize

Get the number of parameters the module has, including all of its sub-modules.
Source§

fn visit<Visitor: ModuleVisitor<B>>(&self, visitor: &mut Visitor)

Visit each tensor parameter in the module with a visitor.
Source§

fn map<Mapper: ModuleMapper<B>>(self, mapper: &mut Mapper) -> Self

Map each tensor parameter in the module with a mapper.
Source§

fn collect_devices(&self, devices: Devices<B>) -> Devices<B>

Return all the devices found in the underneath module tree added to the given vector without duplicates.
Source§

fn to_device(self, device: &B::Device) -> Self

Move the module and all of its sub-modules to the given device. Read more
Source§

fn fork(self, device: &B::Device) -> Self

Fork the module and all of its sub-modules to the given device. Read more
Source§

fn devices(&self) -> Vec<<B as BackendTypes>::Device>

Return all the devices found in the underneath module tree without duplicates.
Source§

fn no_grad(self) -> Self

Each tensor in the module tree will not require grad. Read more
Source§

fn train<AB>(self) -> Self::TrainModule
where AB: AutodiffBackend<InnerBackend = B>, Self: HasAutodiffModule<AB>,

Move the module and all of its sub-modules to the autodiff backend. Read more
Source§

fn save_file<FR, PB>( self, file_path: PB, recorder: &FR, ) -> Result<(), RecorderError>
where FR: FileRecorder<B>, PB: Into<PathBuf>,

Save the module to a file using the provided file recorder. Read more
Source§

fn load_file<FR, PB>( self, file_path: PB, recorder: &FR, device: &<B as BackendTypes>::Device, ) -> Result<Self, RecorderError>
where FR: FileRecorder<B>, PB: Into<PathBuf>,

Load the module from a file using the provided file recorder. Read more
Source§

fn quantize_weights(self, quantizer: &mut Quantizer) -> Self

Quantize the weights of the module.
Source§

impl<B: Backend> ModuleDisplay for MultiscreenModel<B>

Source§

fn format(&self, passed_settings: DisplaySettings) -> String

Formats the module with provided display settings. Read more
Source§

fn custom_settings(&self) -> Option<DisplaySettings>

Custom display settings for the module. Read more
Source§

fn custom_content(&self, _content: Content) -> Option<Content>

Custom attributes for the module. Read more
Source§

impl<B: Backend> ModuleDisplayDefault for MultiscreenModel<B>

Source§

fn content(&self, content: Content) -> Option<Content>

Attributes of the module used for display purposes. Read more
Source§

fn num_params(&self) -> usize

Gets the number of the parameters of the module.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

fn to_string(&self) -> String

Converts the given value to a String. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.