pub fn turboquant_quantize(
x: &[f32],
config: &TurboQuantConfig,
) -> Result<(Vec<u8>, f32)>Expand description
Quantize a single head vector using TurboQuant_mse.
Steps:
- Apply FWHT (Walsh-Hadamard rotation) for incoherence
- Extract L2 norm
- Normalize to unit vector
- Quantize each coordinate against the appropriate Lloyd-Max codebook
- Pack indices
§Arguments
x— input vector of lengthconfig.head_dimconfig— quantization configuration
§Returns
(packed_indices, norm) on success.