Skip to main content

TensorRtBackend

rave_tensorrt::tensorrt

Struct TensorRtBackend

pub struct TensorRtBackend {
    pub inference_metrics: InferenceMetrics,
    pub precision_policy: PrecisionPolicy,
    pub batch_config: BatchConfig,
    /* private fields */
}

Fields§

§inference_metrics: InferenceMetrics§precision_policy: PrecisionPolicy

Phase 8: precision policy for TRT EP.

§batch_config: BatchConfig

Phase 8: batch configuration.

Implementations§

impl TensorRtBackend

pub fn new( model_path: PathBuf, ctx: Arc<GpuContext>, device_id: i32, ring_size: usize, downstream_capacity: usize, ) -> Self

Create a new backend instance.

§Parameters

ring_size: number of output ring slots to pre-allocate.
downstream_capacity: the bounded channel capacity between inference and the encoder. Ring size is validated ≥ downstream_capacity + 2.

pub fn with_precision( model_path: PathBuf, ctx: Arc<GpuContext>, device_id: i32, ring_size: usize, downstream_capacity: usize, precision_policy: PrecisionPolicy, batch_config: BatchConfig, ) -> Self

Create with explicit precision policy and batch config.

pub async fn ring_metrics(&self) -> Option<(u64, u64, u64)>

Access ring metrics (if initialized).

Trait Implementations§

impl Drop for TensorRtBackend

fn drop(&mut self)

Executes the destructor for this type. Read more

impl UpscaleBackend for TensorRtBackend

fn initialize<'life0, 'async_trait>( &'life0 self, ) -> Pin<Box<dyn Future<Output = Result<()>> + Send + 'async_trait>>
where Self: 'async_trait, 'life0: 'async_trait,

fn process<'life0, 'async_trait>( &'life0 self, input: GpuTexture, ) -> Pin<Box<dyn Future<Output = Result<GpuTexture>> + Send + 'async_trait>>
where Self: 'async_trait, 'life0: 'async_trait,

fn shutdown<'life0, 'async_trait>( &'life0 self, ) -> Pin<Box<dyn Future<Output = Result<()>> + Send + 'async_trait>>
where Self: 'async_trait, 'life0: 'async_trait,

fn metadata(&self) -> Result<&ModelMetadata>

Auto Trait Implementations§

impl !Freeze for TensorRtBackend

impl !RefUnwindSafe for TensorRtBackend

impl Send for TensorRtBackend

impl Sync for TensorRtBackend

impl Unpin for TensorRtBackend

impl UnsafeUnpin for TensorRtBackend

impl !UnwindSafe for TensorRtBackend

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more