Proc macro for marking GPU kernel functions.
#[warp_kernel] transforms a function into a proper PTX kernel entry point
when compiling for nvptx64, and generates a host-side launcher when compiling
for the host target.
Usage
In your kernel crate (compiled for nvptx64):
use *;
use warp_kernel;
The macro always emits #[no_mangle] pub unsafe extern "ptx-kernel" fn ...
regardless of target. Kernel crates should target nvptx64 exclusively —
the extern "ptx-kernel" ABI requires nightly abi_ptx and is only
meaningful on GPU targets.