1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
//! Scalar Int8 Quantization for Embedding Retrieval
//!
//! Implements the scalar int8 rescoring retriever specification with:
//! - 4x memory reduction (f32 -> i8)
//! - 99% accuracy retention with rescoring
//! - 3.66x speedup via SIMD acceleration
//!
//! # References
//!
//! - Jacob et al. (2018) - Quantization and Training of Neural Networks
//! - Gholami et al. (2022) - Survey of Quantization Methods
//! - Wu et al. (2020) - Integer Quantization Principles
//!
//! # Toyota Way Principles
//!
//! - **Jidoka**: Auto-stop on quantization error > threshold
//! - **Poka-Yoke**: Type-safe precision levels, compile-time checks
//! - **Heijunka**: Batched rescoring with backpressure
//! - **Kaizen**: Continuous calibration improvement
//! - **Genchi Genbutsu**: Hardware-specific benchmarks
//! - **Muda**: 4x memory reduction via quantization
// Library code - usage from examples and integration tests
// Re-export all public types
pub use CalibrationStats;
pub use ;
pub use ;
pub use QuantizationParams;
pub use ;
pub use ;