Module basic

Module basic 

Source
Expand description

Basic arithmetic operations with SIMD acceleration

This module provides optimized implementations of fundamental array operations including element-wise maximum, minimum, and addition with various optimization levels.

Functionsยง

simd_add_aligned_ultra
simd_add_f32
Compute element-wise addition of two f32 arrays using unified SIMD operations
simd_add_f64
Compute element-wise addition of two f64 arrays using unified SIMD operations
simd_add_f32_fast
PHASE 2: Advanced SIMD addition with CPU feature caching and loop unrolling
simd_add_f32_optimized
OPTIMIZED: Compute element-wise addition of two f32 arrays using SIMD operations This is the Phase 1 optimization version with direct memory access patterns
simd_add_f32_ultra
PHASE 3: Memory-aligned SIMD with prefetching for maximum performance
simd_maximum_f32
Compute element-wise maximum of two f32 arrays using unified SIMD operations
simd_maximum_f64
Compute element-wise maximum of two f64 arrays using unified SIMD operations
simd_minimum_f32
Compute element-wise minimum of two f32 arrays using unified SIMD operations
simd_minimum_f64
Compute element-wise minimum of two f64 arrays using unified SIMD operations