Docs.rs
llama-gguf-0.14.0
Platform
x86_64-unknown-linux-gnu
Feature flags
docs.rs
About docs.rs
Badges
Builds
Metadata
Shorthand URLs
Download
Rustdoc JSON
Build queue
Privacy policy
Rust
Rust website
The Book
Standard Library API Reference
Rust by Example
The Cargo Guide
Clippy Documentation
llama-gguf 0.14.0
A high-performance Rust implementation of llama.cpp - LLM inference engine with full GGUF support
Crate
Source
Builds
Feature flags
Documentation
..
add.comp
attention.comp
attention_cached.comp
dequant_q4_k.comp
dequant_q6_k.comp
dequant_q8_0.comp
gelu.comp
layer_norm.comp
matmul.comp
matvec.comp
mul.comp
rms_norm_scale.comp
rms_norm_sum.comp
rope.comp
scale.comp
silu.comp
softmax_div.comp
softmax_exp.comp
softmax_max.comp
vec_mat.comp