pub struct ScaleMaskSoftmaxParams {
pub rows: u32,
pub cols: u32,
pub seq_q: u32,
pub scale: f32,
}Expand description
Host-side parameters for scale_mask_softmax_f32.
Fields§
§rows: u32Number of rows = n_heads * seq_q (one threadgroup per row).
cols: u32Length of the reduction axis = seq_k. Must match mask row
length.
seq_q: u32Number of query rows per head (= seq_q). Lets the kernel
derive q = row_idx % seq_q for the shared mask index.
scale: f32Pre-softmax multiplicative scale (e.g. 1.0 / sqrt(head_dim)).
Trait Implementations§
Source§impl Clone for ScaleMaskSoftmaxParams
impl Clone for ScaleMaskSoftmaxParams
Source§fn clone(&self) -> ScaleMaskSoftmaxParams
fn clone(&self) -> ScaleMaskSoftmaxParams
Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for ScaleMaskSoftmaxParams
impl Debug for ScaleMaskSoftmaxParams
impl Copy for ScaleMaskSoftmaxParams
Auto Trait Implementations§
impl Freeze for ScaleMaskSoftmaxParams
impl RefUnwindSafe for ScaleMaskSoftmaxParams
impl Send for ScaleMaskSoftmaxParams
impl Sync for ScaleMaskSoftmaxParams
impl Unpin for ScaleMaskSoftmaxParams
impl UnsafeUnpin for ScaleMaskSoftmaxParams
impl UnwindSafe for ScaleMaskSoftmaxParams
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more