1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
//! GPU Model Adapters (PMAT-106)
//!
//! Adapters for converting different model formats to GpuModel.
//!
//! # Supported Formats
//!
//! - **APR F32** - Native `.apr` format with F32 weights
//! - **APR Q4** - GGUF models with Q4_0 quantization
//! - **SafeTensors** - HuggingFace SafeTensors format (planned)
//!
//! # Coverage Impact
//!
//! These adapters drive coverage for:
//! - `apr_transformer/mod.rs` (F32)
//! - `apr_transformer/q4_simd.rs` (Q4)
//! - `gpu/scheduler/batch.rs`
//! - `api/openai_handlers.rs`
/// PMAT-333: WGPU adapter — dequantize quantized model for WGPU inference
pub use ;
pub use ;
pub use ;