Expand description
Advanced Attention Mechanisms
This module provides various attention mechanisms including Flash Attention, multi-query attention, and other efficient attention variants.
Structsยง
- Flash
Attention - Flash Attention for memory-efficient computation