Expand description
ADR-020 iter-11h-c2 — vector outer product forward + backward.
Forward: y[i, j] = lhs[i] · rhs[j]
Backward: dlhs[i] = Σ_j dy[i, j] · rhs[j]
drhs[j] = Σ_i dy[i, j] · lhs[i]
Distinct from matmul: matmul kernel has a 32-element floor on each
dim (M, N, K ≥ 32 for dW backward); outer products have
inner-dim = 1, falling below that floor.