cubecl-cuda 0.2.0

CUDA runtime for CubeCL
Documentation

Cuda runtime

The runtime uses the lower level primitives from cudarc to compile generated CUDA code into a ptx and execute it at runtime.