Module attention

Module attention 

Source
Expand description

Advanced Attention Mechanisms

This module provides various attention mechanisms including Flash Attention, multi-query attention, and other efficient attention variants.

Structsยง

FlashAttention
Flash Attention for memory-efficient computation