quantized_matmul

Function quantized_matmul 

Source
pub fn quantized_matmul(
    a: &QuantizedMatrix,
    a_params: &QuantizationParams,
    b: &QuantizedMatrix,
    b_params: &QuantizationParams,
) -> LinalgResult<Array2<f32>>
Expand description

Perform matrix multiplication with two quantized matrices

§Arguments

  • a - The first quantized matrix
  • a_params - Quantization parameters for the first matrix
  • b - The second quantized matrix
  • b_params - Quantization parameters for the second matrix

§Returns

The result of the matrix multiplication in floating-point