pub enum AttentionType {
MultiHead,
MultiQuery,
GroupedQuery,
Flash,
Paged,
SlidingWindow,
}Expand description
Attention mechanism types
Variants§
MultiHead
Standard multi-head attention
MultiQuery
Multi-query attention (MQA)
GroupedQuery
Grouped-query attention (GQA)
Flash
Flash attention
Paged
Paged attention
SlidingWindow
Sliding window attention
Trait Implementations§
Source§impl Clone for AttentionType
impl Clone for AttentionType
Source§fn clone(&self) -> AttentionType
fn clone(&self) -> AttentionType
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for AttentionType
impl Debug for AttentionType
Source§impl<'de> Deserialize<'de> for AttentionType
impl<'de> Deserialize<'de> for AttentionType
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl Serialize for AttentionType
impl Serialize for AttentionType
impl Copy for AttentionType
Auto Trait Implementations§
impl Freeze for AttentionType
impl RefUnwindSafe for AttentionType
impl Send for AttentionType
impl Sync for AttentionType
impl Unpin for AttentionType
impl UnsafeUnpin for AttentionType
impl UnwindSafe for AttentionType
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more