Module cuda_kernels

Source
Expand description

Pre-optimized CUDA Kernels for Neural Operations

This module contains hand-optimized CUDA kernels for common neural network operations, designed for maximum performance and efficiency.

Structsยง

KernelConfig
Kernel configuration parameters
LaunchParams
Kernel launch parameters
OptimizedKernels
Collection of optimized CUDA kernels for neural operations