pub fn eval(op: &Op, out_shape: &Shape, inputs: &[(Shape, HostBuf)]) -> Vec<f32>
Run a single op on the CPU reference and return its f32 output. inputs[i] is (declared_shape, buffer).
inputs[i]
(declared_shape, buffer)