Expand description
Quantization operations
Structs§
- QParams
- Quantization parameters.
- QParams
Compilation Arg - QParams
Expand - QParams
Launch
Functions§
- dequantize
- Convert the tensor back to a higher precision data type.
- quantize
- Convert the tensor to a lower precision data type based on the quantization scheme and parameters.
Type Aliases§
- QTensor
- Quantized tensor representation.