forgellm-runtime 0.3.0

Minimal runtime for ForgeLLM (KV cache, sampling, tokenizer, API server)
Documentation

ForgeLLM Runtime — minimal inference runtime.

Provides KV cache management, token sampling, and tokenizer integration for compiled models.