Expand description
GPU kernels for BitNet operations.
This module provides CubeCL-based GPU kernels for efficient ternary weight x INT8 activation matrix multiplication.
Requires the cuda feature to be enabled.
Functionsยง
- cuda_
available - Check if CUDA kernels are available.