Diamond Maps: inference-time reward alignment via stochastic flow maps.
Implements host-side math from arXiv:2602.05993
(value functions, GLASS flows, weighted renoising). Model integration lives in
rlx-flux2::diamond.
No retraining of the base generative model is required for GLASS-based guidance; multi-step GLASS posterior sampling uses the frozen denoiser as a reference.