Expand description
Weight loading from SafeTensors files
Re-exports§
pub use gptq_loader::load_gptq_weights;pub use gptq_loader::GptqLayerWeights;pub use gptq_loader::QuantizeConfig;pub use safetensors_loader::SafeTensorsLoader;
Modules§
- gptq_
loader - GPTQ quantized model loader.
- runner_
weights - Generic CUDA decode runner weight loader.
- safetensors_
loader - SafeTensors weight loader with Candle integration