Expand description
Dense linear projection — the baseline, uses B::gemm directly.
Supports an optional learnable bias (Bert / Clip / many encoder models).
When bias is set, forward lowers to gemm + add_bias (one extra
dispatch on GPU backends, still part of the current command buffer).
Structs§
- Dense
Linear - Dense linear projection.