Core IRO CUDA FFI crate.
This crate provides the ABI types, error handling, streams, events, and
device memory management for Rust code orchestrating nvcc-compiled kernels.
CUDA 12.0 or later is required.
See the workspace `README.md` for the full overview, kernel examples, and
benchmark guidance.
Docs: https://docs.rs/iro-cuda-ffi