zeph-bench 0.20.2

Benchmark harness for evaluating Zeph agent performance on standardized datasets
Documentation