Crate wgml

Source

Modulesยง

gguf
Loading gguf files.
models
LLM transformer implementations for inference.
ops
Primitives for building LLM inferences.
quantization
Quantization and unquantization structures.