//! KV cache types for the Whisper decoder.
use Runtime;
use Tensor;
/// KV cache for the full decoder, one entry per layer.
/// Per-layer KV cache for the decoder.
///
/// - `self_k` / `self_v`: grow by one time-step each decoder step (`[B, H, T, D]`).
/// - `cross_k` / `cross_v`: computed once from the encoder output at decode
/// start and reused on every step (`[B, H, S, D]`).