pub fn grad_mean(a: &GradientTensor) -> CoreResult<GradientTensor>
Mean reduction with gradient tracking.