yscv-tensor
SIMD-accelerated tensor library. 115 Tensor operations in ops.rs, f32/f16/bf16 support, NumPy-style broadcasting, 32-byte aligned memory.
use Tensor;
let a = from_vec?;
let b = ones;
let c = ?;
Features
- Data types: f32, f16 (IEEE 754), bf16 (Brain Float)
- Broadcasting: automatic shape expansion following NumPy rules
- SIMD: runtime dispatch for NEON (aarch64) and SSE2/AVX (x86_64)
- Aligned memory: 32-byte aligned allocations for AVX
- Operations: arithmetic, matmul, transpose, reshape, slice, gather, scatter, reduce, clamp, pad, concat, split, topk, sort
Optional Features
[]
= [] # Intel MKL BLAS backend
= [] # Arm Performance Libraries backend
Tests
207 tests covering shapes, broadcasting, dtypes, edge cases.