oxibonsai-eval 0.1.3

Model evaluation harness for OxiBonsai — perplexity, MMLU, benchmarks
Documentation