Expand description
GPU acceleration module (currently CPU fallback) GPU acceleration for neural network operations
This module provides GPU-accelerated implementations of neural network primitives. Includes CUDA support via safe wrappers, mixed precision operations, multi-GPU training, and comprehensive GPU memory management.
Structs§
- Cuda
Tensor - CUDA safe wrapper for tensor operations
- Device
Info - GPU device information
- GpuContext
- GPU context for managing multiple devices
- GpuMemory
Handle - GPU memory handle
- Memory
Stats - GPU memory statistics
- Mixed
Precision Config - Mixed precision configuration
- Multi
GpuTrainer - Multi-GPU training coordinator
- Neural
Ops - Enhanced neural operations accelerator with full GPU support
Enums§
- Reduction
Strategy - Gradient reduction strategy for multi-GPU training
Functions§
- create_
neural_ ops - Helper function to create neural operations with automatic backend detection
- create_
neural_ ops_ with_ backend - Helper function to create neural operations with preferred backend