oxillama 0.1.3

Pure Rust LLM inference engine — the sovereign alternative to llama.cpp (meta crate)
Documentation

oxillama

There is very little structured metadata to build this page from currently. You should check the main library docs, readme, or Cargo.toml in case the author documented the features in them.

This version has 19 feature flags, 2 of them enabled by default.

default

bench (default)

server (default)

command-r

dbrx

deepseek

gemma

gpu

grok

jamba

llama

llava

mamba2

mistral

phi

qwen3

simd-avx2

simd-avx512

simd-neon

starcoder