pub fn kinetic_energy_gpu(buf: &GpuMdBuffer) -> f32
Compute total kinetic energy: KE = Σ 0.5 * m_i * |v_i|².
KE = Σ 0.5 * m_i * |v_i|²