Module quantization

Module quantization 

Source
Expand description

Quantization data representation.

Structs§

BlockSize
Copyable block size, specialized version of SmallVec.
QParams
The quantization tensor data parameters.
QuantScheme
Describes a quantization scheme/configuration.
QuantizedBytes
Quantized data bytes representation.

Enums§

QuantLevel
Level or granularity of quantization.
QuantMode
Strategy used to quantize values.
QuantParam
Quantization floating-point precision.
QuantStore
Data type used to stored quantized values.
QuantValue
Data type used to represent quantized values.

Functions§

pack_i8s_to_u32s
Pack signed 8-bit integer values into a sequence of unsigned 32-bit integers.