Module dequant_cache

Expand description

Cache dequantized GGUF weight bytes for static params.

Qwen3.5 decode with --packed was re-dequantizing every K-quant weight on every matmul (hundreds of times per token). Keys are (k, n, scheme, bytes_hash) — stable for identical GGUF bytes regardless of arena offset (multiple compiled graphs reuse offsets).