Expand description
§KAIO
Rust-native GPU kernel authoring framework.
KAIO (καίω — to kindle, to ignite) lets developers write GPU compute kernels in Rust and lower them to PTX for execution on NVIDIA GPUs. A Rust alternative to OpenAI’s Triton, targeting Windows and Linux from day one, with automatic PTX generation and Rust’s type-safety guarantees.
§Crates
kaio_core— PTX IR types, instruction emitters, PtxWriterkaio_runtime— CUDA device management, buffers, PTX loading, kernel launchkaio_macros—#[gpu_kernel]proc macro (re-exported here)kaio_ops— pre-built GPU operations (matmul, more planned). Separate crate, not re-exported fromkaio. Add withcargo add kaio-ops.
§Status
Phase 4 complete. Tiled matmul (31% of cuBLAS sgemm), kaio-ops
crate, 2D thread blocks, FMA, PTX inspection tools. See the
repository README for the full feature table and roadmap.
Re-exports§
pub use kaio_core as core;pub use kaio_runtime as runtime;
Modules§
- gpu_
builtins - Stub functions for GPU built-in operations.
- prelude
- The KAIO prelude — everything needed to write and launch GPU kernels.
Macros§
- shared_
mem - Declare a shared memory buffer inside a
#[gpu_kernel]function.
Attribute Macros§
- gpu_
kernel - Re-export the
#[gpu_kernel]attribute macro. Marks a function as a GPU kernel compiled to PTX.