Module quantize

Expand description

QuantizedBrick Implementation (PMAT-013)

Implements quantized weight support for ComputeBricks per cbtop spec S17.

§Supported Formats

Format	Bits/Weight	Memory	Perplexity Delta
Q4_0	4.0	25%	~0.5%
Q4_K	4.5	28%	~0.3%
Q5_K	5.5	34%	~0.1%
Q8_0	8.0	50%	~0.01%

GgufHeader: GGUF file header (simplified parsing).
GgufLoader: GGUF file loader (basic implementation).
GgufTensorInfo: GGUF tensor info.
LayerQuantStats: Per-layer quantization statistics.
QuantStats: Quantization statistics for a model or layer.
QuantizedBrick: QuantizedBrick wraps compute operations with quantized weights.
QuantizedWeights: Quantized weight storage for a single layer.