trueno-gpu 0.4.29

Pure Rust PTX generation for NVIDIA CUDA - no LLVM, no nvcc
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
//! Composite operations for GPU-resident tensors.
//!
//! Includes layer normalization, GELU activation, bias add, linear projection,
//! fused linear+GELU, and conv1d operations. Each synchronous operation has
//! a corresponding `_with_stream` variant for pipelined execution.
//!
//! ## Submodules
//!
//! - [`norm_activation`] - Layer normalization and GELU activation
//! - [`linear_bias`] - Bias add, linear projection, fused linear+GELU, and conv1d

#![allow(clippy::similar_names)]

mod linear_bias;
mod norm_activation;