pub fn sample_scalar(logits: &[f32]) -> usize
Full sampling pipeline: apply temperature, softmax, then greedy (scalar reference).
Returns the selected token index.