Expand description
Ferrum unified compute kernels for high-performance inference.
Provides the Backend trait and implementations for CUDA, Metal, and CPU.
On CUDA builds, kernels are compiled to PTX during cargo build and loaded
on demand at runtime.
Re-exports§
pub use linear::Linear;