Re-exports§
pub use async_session::AsyncBatchProcessor;pub use async_session::AsyncSession;pub use batch::BatchConfig;pub use batch::BatchProcessor;pub use batch::BatchRequest;pub use batch::BatchStats;pub use batch::BatchStrategy;
Modules§
- async_
session - Async inference API for non-blocking, high-throughput inference.
- batch
- Batch processing for high-throughput inference
- prelude
- Prelude module for convenient imports
Structs§
- Inference
Session - An inference session for running a model
- Model
- Represents a loaded ML model
- Session
Builder - Builder for creating inference sessions
- Session
Options - Options for configuring an inference session
- Tensor
- Enhanced Tensor implementation with Candle backend.
Enums§
- Data
Type - Supported data types for tensors and operations.
- Error
- Optimization
Level - Optimization levels similar to compiler optimization levels
Traits§
- Execution
Provider - Core execution provider interface.
Type Aliases§
- Provider
Type - Alias for provider types (same as ProviderId)
- Result