Skip to main content

Module based

candle_transformers::models

Module based

Expand description

Based from the Stanford Hazy Research group.

See “Simple linear attention language models balance the recall-throughput tradeoff”, Arora et al. 2024

Simple linear attention language models balance the recall-throughput tradeoff. Arxiv
GitHub Rep
Blogpost

Structs§

Config
LinearAttentionConfig
LinearAttentionFeatureMapConfig
Model
SlidingWindowAttentionConfig