Module quantized_matrixfree

Expand description

Matrix-free operations for quantized tensors

This module provides matrix-free operations for quantized tensors, enabling efficient memory usage and computation for large models. It combines the benefits of quantization (reduced memory footprint) with matrix-free operations (no need to materialize large matrices).