Expand description
Attention kernels
Enums§
- Attention
Strategy - Strategy used to select which attention implementation to run.
Functions§
- attention
- Launch an attention kernel with given strategy
- attention_
autotune - Executes autotune on attention operations
- flash_
attention - Launch a flash attention kernel