pub fn quantize<R, F, I>(
tensor: CubeTensor<R>,
scheme: &QuantizationScheme,
scale: CubeTensor<R>,
) -> CubeTensor<R>Expand description
Convert the tensor to a lower precision data type based on the quantization scheme and parameters.
pub fn quantize<R, F, I>(
tensor: CubeTensor<R>,
scheme: &QuantizationScheme,
scale: CubeTensor<R>,
) -> CubeTensor<R>Convert the tensor to a lower precision data type based on the quantization scheme and parameters.