oxibonsai-eval 0.1.1

Model evaluation harness for OxiBonsai — perplexity, MMLU, benchmarks
Documentation