pub fn quickselect_simd<F>(arr: &mut [F], k: usize) -> Fwhere F: Float + NumCast + SimdUnifiedOps + Display,
SIMD-optimized quickselect algorithm for finding the k-th smallest element
This implementation uses SIMD operations for partitioning when beneficial.