Struct rten::Model

source ·

pub struct Model { /* private fields */ }

Expand description

The central type used to execute RTen machine learning models.

Models are loaded from .rten format model files using Model::load and executed using Model::run or one of the other run_* methods. They take a list of tensor views as inputs, perform a series of computations and return one or more output tensors. .rten models use FlatBuffers and are conceptually similar to the .ort format used by ONNX Runtime and .tflite used by TensorFlow Lite.

RTen models are logically graphs consisting of three types of nodes:

Values which are supplied or generated at runtime
Constants which are the weights, biases and other parameters of the model. Their values are determined when the model is trained.
Operators which combine the values and constants using operations such as matrix multiplication, convolution etc.

Some of these nodes are designated as inputs and outputs. The IDs of these nodes can be obtained using Model::input_ids and Model::output_ids. These IDs are then used when calling Model::run. Model execution consists of generating a plan which starts with the input nodes, and executes the necessary operators to generate the requested outputs.

§Partial evaluation

Some models, such as transformer decoders, are evaluated repeatedly in a loop. If such models have inputs which are constant in each iteration of the loop, execution can be sped up by using partial evaluation. This involves evaluating the part of the graph that depends only on the constant inputs once, outside the loop. To do this use Model::partial_run.

§Custom operator registries

By default all supported ONNX operators are available for use by the model. You can reduce binary size and compilation time by loading a model with only a subset of operators enabled. See Model::load_with_ops.

Struct rten::ModelCopy item path

§Partial evaluation

§Custom operator registries

Implementations§

impl Model

pub fn load(data: &[u8]) -> Result<Model, ModelLoadError>

pub fn load_with_ops( data: &[u8], registry: &OpRegistry ) -> Result<Model, ModelLoadError>

pub fn find_node(&self, id: &str) -> Option<NodeId>

pub fn node_id(&self, id: &str) -> Result<NodeId, RunError>

pub fn node_info(&self, id: NodeId) -> Option<NodeInfo<'_>>

pub fn metadata(&self) -> &ModelMetadata

pub fn input_ids(&self) -> &[NodeId]

pub fn output_ids(&self) -> &[NodeId]

pub fn total_params(&self) -> usize

pub fn input_shape(&self, index: usize) -> Option<Vec<Dimension>>

pub fn run( &self, inputs: &[(NodeId, Input<'_>)], outputs: &[NodeId], opts: Option<RunOptions> ) -> Result<Vec<Output>, RunError>

pub fn run_n<const N: usize>( &self, inputs: &[(NodeId, Input<'_>)], outputs: [NodeId; N], opts: Option<RunOptions> ) -> Result<[Output; N], RunError>

pub fn run_one( &self, input: Input<'_>, opts: Option<RunOptions> ) -> Result<Output, RunError>

pub fn partial_run( &self, inputs: &[(NodeId, Input<'_>)], outputs: &[NodeId], opts: Option<RunOptions> ) -> Result<Vec<(NodeId, Output)>, RunError>

Auto Trait Implementations§

impl Freeze for Model

impl !RefUnwindSafe for Model

impl Send for Model

impl Sync for Model

impl Unpin for Model

impl !UnwindSafe for Model

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T> Pointable for T

const ALIGN: usize = _

type Init = T

unsafe fn init(init: <T as Pointable>::Init) -> usize

unsafe fn deref<'a>(ptr: usize) -> &'a T

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

unsafe fn drop(ptr: usize)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Struct rten::Model

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,