Skip to main content

Module based

Module based 

Source
Expand description

Based from the Stanford Hazy Research group.

See “Simple linear attention language models balance the recall-throughput tradeoff”, Arora et al. 2024

Structs§

Config
LinearAttentionConfig
LinearAttentionFeatureMapConfig
Model
SlidingWindowAttentionConfig