Expand description
Basic arithmetic operations with SIMD acceleration
This module provides optimized implementations of fundamental array operations including element-wise maximum, minimum, and addition with various optimization levels.
Functionsยง
- simd_
add_ aligned_ ultra - simd_
add_ f32 - Compute element-wise addition of two f32 arrays using unified SIMD operations
- simd_
add_ f64 - Compute element-wise addition of two f64 arrays using unified SIMD operations
- simd_
add_ f32_ fast - PHASE 2: Advanced SIMD addition with CPU feature caching and loop unrolling
- simd_
add_ f32_ optimized - OPTIMIZED: Compute element-wise addition of two f32 arrays using SIMD operations This is the Phase 1 optimization version with direct memory access patterns
- simd_
add_ f32_ ultra - PHASE 3: Memory-aligned SIMD with prefetching for maximum performance
- simd_
maximum_ f32 - Compute element-wise maximum of two f32 arrays using unified SIMD operations
- simd_
maximum_ f64 - Compute element-wise maximum of two f64 arrays using unified SIMD operations
- simd_
minimum_ f32 - Compute element-wise minimum of two f32 arrays using unified SIMD operations
- simd_
minimum_ f64 - Compute element-wise minimum of two f64 arrays using unified SIMD operations