Expand description
Matrix-free operations for quantized tensors
This module provides matrix-free operations for quantized tensors, enabling efficient memory usage and computation for large models. It combines the benefits of quantization (reduced memory footprint) with matrix-free operations (no need to materialize large matrices).
Structs§
- Quantized
Matrix Free Op - A matrix-free operator that represents a quantized matrix
Functions§
- quantized_
to_ linear_ operator - Convert a QuantizedMatrixFreeOp to a generic LinearOperator
Type Aliases§
- MatVec
Fn - Type alias for the matrix-vector product function