forgellm-runtime 0.6.1

Minimal runtime for ForgeLLM (KV cache, sampling, tokenizer, API server)
Documentation