Expand description
§Ferrum Runtime
Device runtime and compute backend implementations for LLM inference.
Re-exports§
Modules§
- backends
- Backend implementations for different compute devices
- memory
- Device memory management implementations
Structs§
- Default
Backend Registry - Default backend registry
Enums§
Traits§
- Compute
Backend - Compute backend for tensor operations and kernel execution
- Device
Memory Manager - Device memory manager for raw memory operations
- Kernel
Executor - Kernel executor for custom GPU kernels
- Tensor
Factory - Tensor factory for creating tensors on specific backends
- Tensor
Like - Core tensor trait for zero-copy, device-aware operations
- Tensor
Ops - Basic tensor operations
Functions§
- global_
backend_ registry - Get the global backend registry