Struct MemoizingDynamicInferer

Source

pub struct MemoizingDynamicInferer { /* private fields */ }

Expand description

The dynamic memoizing batch inferer generates execution plans to fit each batch perfectly, achieving near-perfect performance no matter how much data you have - with a hefty up-front cost for each new batch size.

The dynamic batcher has the highest potential throughput when the amount of data isn’t known. By dynamically generating execution plans to fit the exact amount of elements in each batch, it will give tract optimal knowledge for execution each time. The downside of this is that setting up a new plan is fairly costly, so doing this for a batch size that is only seen once will waste memory and compute resources.

While plans are cached; this still means that if your expected batch size is can vary greatly, you’ll end up with noticeable spikes each time a new plan is generated. If you know you’ll have one or a few batch sizes - but not the exact size - this batcher will end up providing good value and inform tuning for a fixed batcher later.

If you know some batch sizes but not all, you can preload the batcher with those plans to avoid having to build them at runtime.

§Pros

Optimal amortized performance without tuning
Requires no tuning for good results

§Cons

For small amounts of data and large models the spikes can offset amortized gains significantly

Struct MemoizingDynamicInferer Copy item path

§Pros

§Cons

Implementations§

impl MemoizingDynamicInferer

pub fn from_model( model: InferenceModel, preloaded_sizes: &[usize], ) -> TractResult<Self>

§Errors

pub fn from_typed( model: TypedModel, preloaded_sizes: &[usize], ) -> TractResult<Self>

§Errors

Trait Implementations§

impl Inferer for MemoizingDynamicInferer

fn select_batch_size(&self, max_count: usize) -> usize

fn infer_raw(&self, pad: &mut ScratchPadView<'_>) -> Result<(), Error>

fn raw_input_shapes(&self) -> &[(String, Vec<usize>)]

fn raw_output_shapes(&self) -> &[(String, Vec<usize>)]

fn begin_agent(&self, _id: u64)

fn end_agent(&self, _id: u64)

fn input_shapes(&self) -> &[(String, Vec<usize>)]

fn output_shapes(&self) -> &[(String, Vec<usize>)]

impl IntoStateful for MemoizingDynamicInferer

fn into_stateful<WrapStack: InfererWrapper>( self, wrapper_stack: WrapStack, ) -> StatefulInferer<WrapStack, Self>

Auto Trait Implementations§

impl !Freeze for MemoizingDynamicInferer

impl !RefUnwindSafe for MemoizingDynamicInferer

impl Send for MemoizingDynamicInferer

impl Sync for MemoizingDynamicInferer

impl Unpin for MemoizingDynamicInferer

impl !UnwindSafe for MemoizingDynamicInferer

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> Downcast for Twhere T: Any,

fn into_any(self: Box<T>) -> Box<dyn Any>

fn into_any_rc(self: Rc<T>) -> Rc<dyn Any>

fn as_any(&self) -> &(dyn Any + 'static)

fn as_any_mut(&mut self) -> &mut (dyn Any + 'static)

impl<T> DowncastSync for Twhere T: Any + Send + Sync,

fn into_any_arc(self: Arc<T>) -> Arc<dyn Any + Send + Sync>

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> InfererExt for Twhere T: Inferer,

fn with_default_epsilon(self, key: &str) -> Result<EpsilonInjector<Self>>

fn with_epsilon<G: NoiseGenerator>( self, generator: G, key: &str, ) -> Result<EpsilonInjector<Self, G>>

fn into_batched(self) -> Batched<Self>

fn infer( &mut self, observations: HashMap<u64, State<'_>>, ) -> Result<HashMap<u64, Response<'_>>, Error>

fn infer_batch<'this>( &'this self, batch: HashMap<u64, State<'_>>, ) -> Result<HashMap<u64, Response<'this>>, Error>

fn infer_single<'this>( &'this self, input: State<'_>, ) -> Result<Response<'this>, Error>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<V, T> VZip<V> for Twhere V: MultiLane<T>,

fn vzip(self) -> V

Struct MemoizingDynamicInferer

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> Downcast for T
where T: Any,

impl<T> DowncastSync for T
where T: Any + Send + Sync,

impl<T> InfererExt for T
where T: Inferer,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,