pub enum AttentionStrategy {
FlashBlackboxAccelerated(BlackboxAcceleratedStrategy),
FlashUnit,
Fallback,
Autotune,
}Expand description
Strategy used to select which attention implementation to run.
Variants§
FlashBlackboxAccelerated(BlackboxAcceleratedStrategy)
Flash Attention using accelerated inner matmuls.
FlashUnit
Flash Attention using unit inner matmuls.
Fallback
Fallback implementation using multiple separate kernels.
Autotune
Available on crate feature
autotune only.Automatically benchmark and select the best strategy at runtime.
Trait Implementations§
Source§impl Debug for AttentionStrategy
impl Debug for AttentionStrategy
Auto Trait Implementations§
impl Freeze for AttentionStrategy
impl RefUnwindSafe for AttentionStrategy
impl Send for AttentionStrategy
impl Sync for AttentionStrategy
impl Unpin for AttentionStrategy
impl UnsafeUnpin for AttentionStrategy
impl UnwindSafe for AttentionStrategy
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more