Enum GgufBlockFormat

Source

#[non_exhaustive]
#[repr(u16)]pub enum GgufBlockFormat {
    Q4_0 = 2,
    Q4_1 = 3,
    Q5_0 = 6,
    Q5_1 = 7,
    Q8_0 = 8,
    Q2K = 10,
    Q3K = 11,
    Q4K = 12,
    Q5K = 13,
    Q6K = 14,
    Q8K = 15,
}

Expand description

GGUF block-format selector for QuantizeKind::GgufDequantize / QuantizeKind::GgufMmvq plans. Mirrors the discriminants used by llama.cpp / ggml so a descriptor can be round-tripped to a GGUF file header without translation.

Block sizes:

Type-0/1 variants (Q4_0, Q4_1, Q5_0, Q5_1, Q8_0) pack 32 quantized values per block plus a shared FP scale (+ min for the _1 variants).
k-quants variants (Q2_K … Q8_K) pack 256 values per super-block with a multi-level scale hierarchy (quantized sub-block scales + FP super-block scale).

Discriminant values match the GGML_TYPE_* enum in upstream ggml.h, ensuring binary compatibility with GGUF file headers.

Variants (Non-exhaustive)§

This enum is marked as non-exhaustive

Non-exhaustive enums could have additional variants added in future. Therefore, when matching against variants of non-exhaustive enums, an extra wildcard arm must be added to account for any future variants.

§

Q4_0 = 2

4-bit, 32-element block, single FP scale. block_q4_0.

§

Q4_1 = 3

4-bit, 32-element block, FP scale + FP min. block_q4_1.

§

Q5_0 = 6

5-bit, 32-element block, single FP scale. block_q5_0.

§

Q5_1 = 7

5-bit, 32-element block, FP scale + FP min. block_q5_1.

§

Q8_0 = 8

8-bit, 32-element block, single FP scale. block_q8_0.

§

Q2K = 10

2.5-bit (effective), 256-element super-block. block_q2_K.

§

Q3K = 11

3.4-bit (effective), 256-element super-block. block_q3_K.

§

Q4K = 12

4.5-bit (effective), 256-element super-block. block_q4_K.

§

Q5K = 13

5.5-bit (effective), 256-element super-block. block_q5_K.

§

Q6K = 14

6.6-bit (effective), 256-element super-block. block_q6_K.

§

Q8K = 15

8-bit, 256-element super-block (CPU-side intermediate). block_q8_K. Dequant supported; MMVQ NOT supported (matches llama.cpp — no upstream MMVQ specialization).

Enum GgufBlockFormat Copy item path

Variants (Non-exhaustive)§

Q4_0 = 2

Q4_1 = 3

Q5_0 = 6

Q5_1 = 7

Q8_0 = 8

Q2K = 10

Q3K = 11

Q4K = 12

Q5K = 13

Q6K = 14

Q8K = 15

Implementations§

impl GgufBlockFormat

pub const fn block_size(self) -> usize

pub const fn type_size(self) -> usize

pub const fn is_type_01(self) -> bool

pub const fn has_mmvq(self) -> bool

Trait Implementations§

impl Clone for GgufBlockFormat

fn clone(&self) -> GgufBlockFormat

fn clone_from(&mut self, source: &Self)

impl Copy for GgufBlockFormat

impl Debug for GgufBlockFormat

fn fmt(&self, f: &mut Formatter<'_>) -> Result<(), Error>

impl Eq for GgufBlockFormat

impl Hash for GgufBlockFormat

fn hash<__H>(&self, state: &mut __H)where __H: Hasher,

fn hash_slice<H>(data: &[Self], state: &mut H)where H: Hasher, Self: Sized,

impl PartialEq for GgufBlockFormat

fn eq(&self, other: &GgufBlockFormat) -> bool

fn ne(&self, other: &Rhs) -> bool

impl StructuralPartialEq for GgufBlockFormat

Auto Trait Implementations§

impl Freeze for GgufBlockFormat

impl RefUnwindSafe for GgufBlockFormat

impl Send for GgufBlockFormat

impl Sync for GgufBlockFormat

impl Unpin for GgufBlockFormat

impl UnsafeUnpin for GgufBlockFormat

impl UnwindSafe for GgufBlockFormat

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Enum GgufBlockFormat

fn hash<H>(&self, state: &mut H)
where __H: Hasher,

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,