Expand description
Based from the Stanford Hazy Research group.
See “Simple linear attention language models balance the recall-throughput tradeoff”, Arora et al. 2024
- Simple linear attention language models balance the recall-throughput tradeoff. Arxiv
- GitHub Rep
- Blogpost