forgellm-runtime 0.6.3

Minimal runtime for ForgeLLM (KV cache, sampling, tokenizer, API server)
Documentation