Expand description
GPU-accelerated optimizer operations
This module provides GPU acceleration for optimization using SciRS2’s GPU abstractions. Enables 10-50x speedup for large models through GPU parallelism and tensor cores.
§Features
- GPU-accelerated parameter updates
- Tensor core support for mixed-precision training
- Multi-backend support (CUDA, Metal, OpenCL, WebGPU via SciRS2)
- Automatic host-device data transfer
- GPU memory tracking and management
§Performance
Achieves 10-50x speedup over CPU for models with millions of parameters.
§SciRS2 Integration
This module uses SciRS2-Core GPU abstractions exclusively:
scirs2_core::gpu::GpuContextfor GPU context managementscirs2_core::gpu::GpuBufferfor GPU memory allocationscirs2_core::gpu::GpuKernelfor GPU kernel executionscirs2_core::tensor_coresfor mixed-precision optimizationscirs2_core::array_protocol::GPUArrayfor GPU array interface
Structs§
- GpuConfig
- GPU optimizer configuration
- GpuMemory
Stats - GPU memory statistics
- GpuOptimizer
- GPU-accelerated optimizer wrapper
- GpuUtils
- GPU optimizer utilities