Skip to main content

sampling_ptx

Function sampling_ptx 

Source
pub fn sampling_ptx() -> &'static str
Expand description

PTX assembly for greedy sampling (argmax reduction).

Uses parallel reduction to find the maximum logit index.