Expand description
Backend abstraction split into compute and weight loading concerns
This module separates the previous “fat” Backend trait into focused interfaces: ComputeBackend for tensor operations and WeightLoader for model weight management.
Structs§
- Backend
Capabilities - Backend capabilities description
- Backend
Config - Backend configuration
- Backend
Memory Config - Memory configuration for backends
- Backend
Requirements - Requirements for backend selection
- Backend
Status - Backend status information
- Kernel
Handle - Handle for loaded kernel
- Kernel
Info - Kernel information and metadata
- Tensor
Spec - Tensor specification for weight loading
- Weight
Loader Capabilities - Weight loader capabilities
- Weight
Loader Config - Weight loader configuration
- Weight
Metadata - Weight metadata information
Enums§
- Backend
Type - Backend types
- Kernel
Arg - Kernel argument types
- Memory
Growth Strategy - Memory growth strategies
- Quantization
Config - Quantization configuration for weights
- Scalar
Value - Scalar values for kernel arguments
- Tensor
Transformation - Transformations that can be applied to loaded tensors
- Transformation
Type - Transformation types
- Weight
Format - Weight file formats
- Weight
Source - Weight source specification
- Weight
Source Type - Weight source types
Traits§
- Backend
Factory - Backend factory for creating backend instances
- Backend
Registry - Backend registry for managing multiple backends
- Compute
Backend - Compute backend for tensor operations and kernel execution
- Kernel
Executor - Kernel executor for custom GPU kernels
- Weight
Loader - Weight loading interface for model parameter management