Docs.rs
llama-gguf-0.14.0
Platform
x86_64-unknown-linux-gnu
Feature flags
docs.rs
About docs.rs
Badges
Builds
Metadata
Shorthand URLs
Download
Rustdoc JSON
Build queue
Privacy policy
Rust
Rust website
The Book
Standard Library API Reference
Rust by Example
The Cargo Guide
Clippy Documentation
llama-gguf 0.14.0
A high-performance Rust implementation of llama.cpp - LLM inference engine with full GGUF support
Crate
Source
Builds
Feature flags
Documentation
..
add.metal
attention.metal
attention_cached.metal
dequant_q4_k.metal
dequant_q6_k.metal
dequant_q8_0.metal
gelu.metal
layer_norm.metal
matmul.metal
matvec.metal
mul.metal
rms_norm_scale.metal
rms_norm_sum.metal
rope.metal
scale.metal
silu.metal
softmax_div.metal
softmax_exp.metal
softmax_max.metal
vec_mat.metal