llama-gguf 0.2.0

A high-performance Rust implementation of llama.cpp - LLM inference engine with full GGUF support
Documentation
1
2
3
4
5
6
7
//! Quantization formats and dequantization functions

mod blocks;
mod dequant;

pub use blocks::*;
pub use dequant::*;