pub fn reduction_webgpu(a: &Array, axis: Option<usize>) -> Result<Array>
Reduction operations on WebGPU (public API for dispatch)