pub struct WmmaLoad<D: Dialect> { /* private fields */ }Implementations§
Source§impl<D: Dialect> WmmaLoad<D>
impl<D: Dialect> WmmaLoad<D>
Sourcepub fn new(frag: Fragment<D>, layout: Option<FragmentLayout<D>>) -> Self
pub fn new(frag: Fragment<D>, layout: Option<FragmentLayout<D>>) -> Self
Constructs a new WmmaLoad.
Source§impl<D: Dialect> WmmaLoad<D>
impl<D: Dialect> WmmaLoad<D>
pub fn fn_name(&self) -> String
Sourcepub fn format_extension(&self, f: &mut Formatter<'_>) -> Result
pub fn format_extension(&self, f: &mut Formatter<'_>) -> Result
Matrix A must be in column major layout (so fragments correspond to a row) Matrices B, C and D must be in row major layout (so fragments correspond to a column)
Each lane is a thread so each column get 8 VGPRs used to store fragments Here is the layout for C and D matrices and how they map to registers
§Lane index 0 1 2 3 … 13 14 15 … 17 18 … 30 31
§VGPR0 | 1,1 | 1,2 | 1,3 | 1,4 | … | 1,13 | 1,14 | 1,15 | … | 2,1 | 2,2 | … | 2,15 | 2,16 |
§VGPR1 | 3,1 | 3,2 | 3,3 | 3,4 | … | 3,13 | 3,14 | 3,15 | … | 4,1 | 4,2 | … | 4,15 | 4,16 |
§VGPR2 | 5,1 | 5,2 | 5,3 | 5,4 | … | 5,13 | 5,14 | 5,15 | … | 6,1 | 6,2 | … | 6,15 | 6,16 |
§VGPR3 | 7,1 | 7,2 | 7,3 | 7,4 | … | 7,13 | 7,14 | 7,15 | … | 8,1 | 8,2 | … | 8,15 | 8,16 |
§VGPR4 | 9,1 | 9,2 | 9,3 | 9,4 | … | 9,13 | 9,14 | 9,15 | … | 10,1 | 10,2 | … | 10,15| 10,16|
§VGPR5 | 11,1 | 11,2 | 11,3 | 11,4 | … | 11,13| 11,14| 11,15| … | 12,1 | 12,2 | … | 12,15| 12,16|
§VGPR6 | 13,1 | 13,2 | 13,3 | 13,4 | … | 13,13| 13,14| 13,15| … | 14,1 | 14,2 | … | 14,15| 14,16|
§VGPR7 | 15,1 | 15,2 | 15,3 | 15,4 | … | 15,13| 15,14| 15,15| … | 16,1 | 16,2 | … | 16,15| 16,16|
Trait Implementations§
impl<D: Dialect> StructuralPartialEq for WmmaLoad<D>
Auto Trait Implementations§
impl<D> Freeze for WmmaLoad<D>
impl<D> RefUnwindSafe for WmmaLoad<D>where
D: RefUnwindSafe,
impl<D> Send for WmmaLoad<D>
impl<D> Sync for WmmaLoad<D>
impl<D> Unpin for WmmaLoad<D>where
D: Unpin,
impl<D> UnwindSafe for WmmaLoad<D>where
D: UnwindSafe,
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more