pub enum DType {
F16,
Bf16,
F8E4m3,
F8E5m2,
}Expand description
Element type for Q / K / V tiles. Distinct from atomr-accel-cuda’s
future CudaDtype so the FlashAttn crate is self-contained.
Variants§
F16
IEEE 754 binary16 — fa2 + fa3.
Bf16
bfloat16 — fa2 + fa3.
F8E4m3
8-bit float, e4m3 — fa3 only, sm_90a+.
F8E5m2
8-bit float, e5m2 — fa3 only, sm_90a+ (used for V in DPA-mixed-precision).
Implementations§
Trait Implementations§
impl Copy for DType
impl Eq for DType
impl StructuralPartialEq for DType
Auto Trait Implementations§
impl Freeze for DType
impl RefUnwindSafe for DType
impl Send for DType
impl Sync for DType
impl Unpin for DType
impl UnsafeUnpin for DType
impl UnwindSafe for DType
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.