// SPDX-License-Identifier: MIT
//! Skeleton for an in-process bit-exact replay test against vLLM.
//!
//! As of v1.0.x this is **not** the validation path — the actual
//! vLLM bit-exact validation runs on Modal via
//! `scripts/gpu-validate-modal.py`, and its output lands in
//! `benchmarks/gpu-validation/*.json`. Latest result (Modal A10G,
//! TinyLlama-1.1B, V0 engine): `bit_exact: true` over 38 619 KV
//! pages with byte-identical regenerated text.
//!
//! This test still exists because it documents the shape we want
//! a self-contained local PF_HAS_GPU=1 path to take eventually
//! (subprocess vLLM → snapshot → SIGKILL → fresh worker →
//! checkout → assert bit-equal logits). When that path lands,
//! replace the explicit-skip body with the real subprocess flow.
//! The on-host proxy that DOES run everywhere is
//! `tests/cache_round_trip.rs`.