pub fn std_dev_f32(data: &[f32]) -> Result<f32>
Compute standard deviation using SIMD-accelerated variance
Returns an error if the slice is empty