ferrum-quantization 0.7.5

Weight-format abstraction (Dense / GPTQ / AWQ / GGUF) for Ferrum models
Documentation
1
2
3
4
5
6
7
//! Re-export of `Linear` trait (canonical home: ferrum-kernels).
//!
//! The trait itself lives in `ferrum-kernels::linear` so that Backend-level
//! helpers (`layer_forward_fused`) can reference it without ferrum-kernels
//! depending on this crate (which would be circular).

pub use ferrum_kernels::Linear;