/// GEGLU activation for REVE.
///
/// Python:
/// class GEGLU(nn.Module):
/// def forward(self, x):
/// x, gates = x.chunk(2, dim=-1)
/// return F.gelu(gates) * x
use *;
use gelu;
/// GEGLU: split input in half along last dim, GELU-gate one half by the other.
/// Input: [*, 2*D] → Output: [*, D]