oxibonsai

Pure Rust 1-bit LLM inference engine for PrismML Bonsai models — umbrella crate.

Re-exports all OxiBonsai subcrates for convenience. Add this single dependency to get access to the entire OxiBonsai ecosystem:

[dependencies]
oxibonsai = "0.1.0"

# Enable optional subsystems:
oxibonsai = { version = "0.1.0", features = ["full"] }

Subcrates

Crate	Description
`oxibonsai-core`	GGUF loader, tensor types, configuration
`oxibonsai-kernels`	1-bit compute kernels (dequant, GEMV, GEMM, SIMD)
`oxibonsai-model`	Qwen3-8B transformer, KV cache, attention
`oxibonsai-runtime`	Inference engine, sampling, OpenAI-compatible server
`oxibonsai-tokenizer`	Pure Rust BPE tokenizer (optional)
`oxibonsai-rag`	Retrieval-augmented generation pipeline (optional)
`oxibonsai-eval`	Model evaluation framework (optional)
`oxibonsai-serve`	Standalone OpenAI-compatible server (optional)

Apache-2.0 — COOLJAPAN OU