Module transformer

Module transformer 

Source
Expand description

Transformer models for advanced pattern recognition in sparse matrices

This module contains transformer-based architectures for learning complex patterns in sparse matrix operations and optimizing them adaptively.

Structsยง

AttentionGradients
FFGradients
HeadGradients
LayerGradients
TransformerGradients
Gradient structures for transformer training