Expand description
Pre-optimized CUDA Kernels for Neural Operations
This module contains hand-optimized CUDA kernels for common neural network operations, designed for maximum performance and efficiency.
Structsยง
- Kernel
Config - Kernel configuration parameters
- Launch
Params - Kernel launch parameters
- Optimized
Kernels - Collection of optimized CUDA kernels for neural operations