Skip to main content

Module types

Module types 

Source
Expand description

Quantization Types

Defines quantization formats and data structures.

@version 0.1.0 @author AutomataNexus Development Team

Structs§

Q4Block
A block of Q4_0 quantized data.
Q4_1Block
A block of Q4_1 quantized data (with min value).
Q8Block
A block of Q8_0 quantized data.
QuantizedTensor
A quantized tensor containing compressed weight data.

Enums§

QuantType
Quantization type enumeration.
QuantizedBlock
Generic quantized block enum.