yscv-kernels 0.1.1

CPU and GPU compute backends with SIMD dispatch and BLAS integration

Coverage
35.74%
104 out of 291 items documented0 out of 0 items with examples
Size
Source code size: 620.36 kB This is the summed size of all the files inside the crates.io package for this release.
Documentation size: 16.33 MB This is the summed size of all files generated by rustdoc for all configured targets
Ø build duration
this release: 34s Average build duration of successful builds.
all releases: 40s Average build duration of successful builds in releases after 2024-10-23.
Links
Repository
crates.io
Dependencies
Versions
Owners

Execution kernels and backend abstraction for yscv.

GPU Inference (Cross-Platform via wgpu)

The gpu feature enables compute shader acceleration via wgpu — Vulkan (Linux/Windows/Android), Metal (macOS/iOS), DX12 (Windows). No CUDA dependency. GPU-accelerated operations:

Matrix multiplication (tiled 16×16 workgroups)
Elementwise: add, sub, mul
Activations: relu, sigmoid
Normalization: batch_norm, layer_norm, group_norm, rms_norm, softmax
Convolution: conv2d, depthwise_conv2d, separable_conv2d, transpose_conv2d
Pooling: max_pool2d, avg_pool2d

GPU training (backward passes) is on the roadmap. CPU backend is fully optimized with NEON/AVX/SSE SIMD on all platforms.