pub struct TensorCoreEngine { /* private fields */ }Expand description
Tensor Core MMA engine.
Provides mma() (D = A·B + C) matching the semantics of CUDA’s
nvcuda::wmma::mma_sync and PTX mma.sync instructions.
Implementations§
Source§impl TensorCoreEngine
impl TensorCoreEngine
Sourcepub fn new(precision: MmaPrecision, shape: FragmentShape) -> Self
pub fn new(precision: MmaPrecision, shape: FragmentShape) -> Self
Create a new engine with specified precision and fragment shape.
Auto Trait Implementations§
impl Freeze for TensorCoreEngine
impl RefUnwindSafe for TensorCoreEngine
impl Send for TensorCoreEngine
impl Sync for TensorCoreEngine
impl Unpin for TensorCoreEngine
impl UnsafeUnpin for TensorCoreEngine
impl UnwindSafe for TensorCoreEngine
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more