zeph-bench 0.21.4

Benchmark harness for evaluating Zeph agent performance on standardized datasets
Documentation