Skip to main content

linear_ptx

Function linear_ptx 

Source
pub fn linear_ptx() -> &'static str
Expand description

PTX assembly for linear projection.

One thread per output element (batch_idx, out_feature). Each thread computes one dot product of x_row and w_row, then adds bias.