Skip to main content

Module quant

Module quant 

Source
Expand description

Quantized kernel profiler (Q4K/Q6K CPU). Spec section 4.7. Profiles trueno’s fused dequantization + GEMV CPU kernels.

Structs§

QuantProfile
Quantized kernel profile output.

Enums§

QuantKernel
Supported quantized kernel types.

Functions§

profile_quant
Profile a quantized GEMV kernel.
profile_quant_all
Profile all standard Q4K LLM layer sizes with summary table.