pub fn quantized_matmul(
a: &QuantizedMatrix,
a_params: &QuantizationParams,
b: &QuantizedMatrix,
b_params: &QuantizationParams,
) -> LinalgResult<Array2<f32>>
Expand description
Perform matrix multiplication with two quantized matrices
§Arguments
a
- The first quantized matrixa_params
- Quantization parameters for the first matrixb
- The second quantized matrixb_params
- Quantization parameters for the second matrix
§Returns
The result of the matrix multiplication in floating-point