Expand description
Pooling strategies for transformer hidden states.
Converts per-token hidden states [seq_len × dim] into a single fixed-size embedding vector [dim].
Functions§
- l2_norm
- Compute the L2 norm of a vector.
- mean_
pool - Mean pooling over token positions, weighted by attention mask.
- normalize_
l2 - L2-normalize a vector in-place.