Function quantize_per_tensor

pub fn quantize_per_tensor(
    tensor: &Tensor,
    scale: f32,
    zero_point: i32,
    _dtype: DType,
) -> Result<Tensor>

Expand description

Quantize a tensor to INT8 using specified scale and zero point

quantize_per_tensor