Module cpu_dequant

Expand description

Linear<CpuBackend> impl for GPTQ weights, dequantized at load time.

Phase 3e/2: replaces the old BackendQuantMarlin::gemm_gptq impl on CpuBackend. The kernel call (Self::gemm on dequantized weights) lives inside CpuGptqLinear::forward instead of the trait method body.