pub enum SmArch {
Sm80,
Sm89,
Sm90a,
Sm100,
}Expand description
CUDA streaming-multiprocessor architecture target. The dispatch
table refuses to resolve any key whose arch is not in this list.
Variants§
Sm80
Ampere (A100, A30) — fa2 only.
Sm89
Ada (RTX 40xx, L4) — fa2 only, supports fp8 cuBLASLt but not fa3.
Sm90a
Hopper (H100, H200) — fa3, fp8, TMA, WGMMA, persistent kernels.
Sm100
Blackwell (B100, B200) — forward-compat target; fa3 with fifth-gen tensor cores. Falls back to Hopper kernels for now.
Implementations§
Source§impl SmArch
impl SmArch
Sourcepub fn nvrtc_flag(self) -> &'static str
pub fn nvrtc_flag(self) -> &'static str
CUDA --gpu-architecture string.
Sourcepub fn supports_fa3(self) -> bool
pub fn supports_fa3(self) -> bool
True if this arch supports FlashAttention v3 (Hopper+).
Sourcepub fn supports_fp8(self) -> bool
pub fn supports_fp8(self) -> bool
True if this arch supports fp8 e4m3 / e5m2 in FA3.
Trait Implementations§
impl Copy for SmArch
impl Eq for SmArch
impl StructuralPartialEq for SmArch
Auto Trait Implementations§
impl Freeze for SmArch
impl RefUnwindSafe for SmArch
impl Send for SmArch
impl Sync for SmArch
impl Unpin for SmArch
impl UnsafeUnpin for SmArch
impl UnwindSafe for SmArch
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.