Function quantize

pub fn quantize<R>(
    tensor: CubeTensor<R>,
    scheme: &QuantScheme,
    scale: CubeTensor<R>,
) -> CubeTensor<R>where
    R: CubeRuntime,

Expand description

Convert the tensor to a lower precision data type based on the quantization scheme and parameters.

quantize