pub enum MmaShape {
M16N8K16,
M16N8K32,
}Expand description
The shape of an mma.sync instruction.
Each variant corresponds to a distinct hardware tile geometry. Adding a
new shape requires a new enum variant and a new set of fragment types
because the per-thread register distribution is shape-dependent. See
crate::fragment.
Variants§
M16N8K16
16×16 × 16×8 → 16×8, fp16 / bf16 inputs with fp32 accumulate (Ampere+).
M16N8K32
16×32 × 32×8 → 16×8, signed int8 inputs with int32 accumulate (Ampere+).
Used by mma.sync.aligned.m16n8k32.row.col.s32.s8.s8.s32. K is 32,
not 16 — twice the K-tile of the fp16 path. Introduced in
Sprint 7.1 for INT8 dequantize-matmul.
Implementations§
Trait Implementations§
impl Copy for MmaShape
impl Eq for MmaShape
impl StructuralPartialEq for MmaShape
Auto Trait Implementations§
impl Freeze for MmaShape
impl RefUnwindSafe for MmaShape
impl Send for MmaShape
impl Sync for MmaShape
impl Unpin for MmaShape
impl UnsafeUnpin for MmaShape
impl UnwindSafe for MmaShape
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more