Skip to main content

TensorRtBackend

rave_tensorrt::tensorrt

Struct TensorRtBackend

pub struct TensorRtBackend {
    pub inference_metrics: InferenceMetrics,
    pub precision_policy: PrecisionPolicy,
    pub batch_config: BatchConfig,
    /* private fields */
}

Expand description

TensorRT/CUDA ORT inference backend.

Implements UpscaleBackend using ONNX Runtime with a TensorRT or CUDA execution provider. Output buffers are managed via a fixed-size OutputRing to avoid per-frame allocation.

Fields§

§inference_metrics: InferenceMetrics

Atomic inference latency and frame count metrics.

§precision_policy: PrecisionPolicy

Precision policy used when building the TensorRT EP session.

§batch_config: BatchConfig

Batch configuration (must have max_batch = 1 — batching not yet implemented).

Implementations§

impl TensorRtBackend

pub fn new( model_path: PathBuf, ctx: Arc<GpuContext>, device_id: i32, ring_size: usize, downstream_capacity: usize, ) -> Self

Create a new backend instance.

§Parameters

ring_size: number of output ring slots to pre-allocate.
downstream_capacity: the bounded channel capacity between inference and the encoder. Ring size is validated ≥ downstream_capacity + 2.

pub fn with_precision( model_path: PathBuf, ctx: Arc<GpuContext>, device_id: i32, ring_size: usize, downstream_capacity: usize, precision_policy: PrecisionPolicy, batch_config: BatchConfig, ) -> Self

Create with explicit precision policy and batch config.

pub async fn ring_metrics(&self) -> Option<RingMetricsSnapshot>

Access ring metrics (if initialized).

pub fn selected_provider(&self) -> Option<&str>

Active ORT execution provider selected during initialization.

Trait Implementations§

impl Drop for TensorRtBackend

fn drop(&mut self)

Executes the destructor for this type. Read more

impl UpscaleBackend for TensorRtBackend

fn initialize<'life0, 'async_trait>( &'life0 self, ) -> Pin<Box<dyn Future<Output = Result<()>> + Send + 'async_trait>>
where Self: 'async_trait, 'life0: 'async_trait,

Warm up the backend: load the model, allocate buffers, build engine plans.

fn process<'life0, 'async_trait>( &'life0 self, input: GpuTexture, ) -> Pin<Box<dyn Future<Output = Result<GpuTexture>> + Send + 'async_trait>>
where Self: 'async_trait, 'life0: 'async_trait,

Run a single upscale pass on the given GPU texture, returning the upscaled output.

fn shutdown<'life0, 'async_trait>( &'life0 self, ) -> Pin<Box<dyn Future<Output = Result<()>> + Send + 'async_trait>>
where Self: 'async_trait, 'life0: 'async_trait,

Flush any pending work, synchronize streams, and release GPU resources.

fn metadata(&self) -> Result<&ModelMetadata>

Return the model metadata extracted during initialize.

Auto Trait Implementations§

impl !Freeze for TensorRtBackend

impl !RefUnwindSafe for TensorRtBackend

impl Send for TensorRtBackend

impl Sync for TensorRtBackend

impl Unpin for TensorRtBackend

impl UnsafeUnpin for TensorRtBackend

impl !UnwindSafe for TensorRtBackend

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more