Module quantization

Module quantization 

Source
Expand description

Quantization data representation.

Structs§

BlockSize
Copyable block size, specialized version of SmallVec.
QParams
The quantization tensor data parameters.
QuantScheme
Describes a quantization scheme/configuration.
QuantizedBytes
Quantized data bytes representation.

Enums§

QuantAcc
The precision of accumulating elements.
QuantLevel
Level or granularity of quantization.
QuantMode
Strategy used to quantize values.
QuantParam
Quantization floating-point precision.
QuantPropagation
Specify if the output of an operation is quantized using the scheme of the input or returned unquantized.
QuantStore
Data type used to stored quantized values.
QuantValue
Data type used to represent quantized values.

Functions§

pack_i8s_to_u32s
Pack signed 8-bit integer values into a sequence of unsigned 32-bit integers.