zeph-bench 0.21.3

Benchmark harness for evaluating Zeph agent performance on standardized datasets
Documentation