pub fn scale_simd(a: &[f32], s: f32) -> Result<Vec<f32>, TruenoError>
SIMD-accelerated scale using trueno.