forgellm-runtime 0.1.0

Minimal runtime for ForgeLLM (KV cache, sampling, tokenizer, API server)

Coverage
100%
2 out of 2 items documented0 out of 1 items with examples
Size
Source code size: 7.32 kB This is the summed size of all the files inside the crates.io package for this release.
Documentation size: 1 MB This is the summed size of all files generated by rustdoc for all configured targets
Ø build duration
this release: 1m Average build duration of successful builds.
all releases: 1m 28s Average build duration of successful builds in releases after 2024-10-23.
Links
Homepage
sauravpanda/forge-llm
3 0 2
crates.io
Dependencies
Versions
Owners

Forge Runtime — minimal inference runtime.

Provides KV cache management, token sampling, tokenizer integration, and an OpenAI-compatible API server.