Skip to main content

ModelVariant

Enum ModelVariant 

Source
pub enum ModelVariant {
    Bonsai8B,
    Bonsai4B,
    Bonsai1_7B,
    TernaryBonsai8B,
    TernaryBonsai4B,
    TernaryBonsai1_7B,
    FP8Bonsai8B,
    FP8Bonsai4B,
    FP8Bonsai1_7B,
    Custom,
}
Expand description

Known Bonsai model variants.

Variants§

§

Bonsai8B

Bonsai-8B (Qwen3-8B architecture): 36 layers, hidden=4096

§

Bonsai4B

Bonsai-4B: 24 layers, hidden=2560

§

Bonsai1_7B

Bonsai-1.7B: 16 layers, hidden=1536

§

TernaryBonsai8B

Ternary-Bonsai-8B: same Qwen3-8B architecture, {-1,0,+1} weights (TQ2_0_g128).

§

TernaryBonsai4B

Ternary-Bonsai-4B: same Qwen3-4B architecture, {-1,0,+1} weights (TQ2_0_g128).

§

TernaryBonsai1_7B

Ternary-Bonsai-1.7B: same Qwen3-1.7B architecture, {-1,0,+1} weights (TQ2_0_g128).

§

FP8Bonsai8B

FP8-Bonsai-8B: same Qwen3-8B architecture, FP8 weights (F8_E4M3 or F8_E5M2).

§

FP8Bonsai4B

FP8-Bonsai-4B: same Qwen3-4B architecture, FP8 weights.

§

FP8Bonsai1_7B

FP8-Bonsai-1.7B: same Qwen3-1.7B architecture, FP8 weights.

§

Custom

Custom or unrecognized architecture

Implementations§

Source§

impl ModelVariant

Source

pub fn from_config(config: &Qwen3Config) -> Self

Auto-detect variant from model configuration.

Matches on the combination of num_layers and hidden_size to identify known architectures.

Source

pub fn from_config_and_sample_tensor_type( config: &Qwen3Config, sample_tensor_type: GgufTensorType, ) -> Self

Detect model variant from config + sample tensor type (for ternary vs 1-bit disambiguation).

Architecture match is identical to from_config, but if sample_tensor_type.is_ternary(), the result is upgraded to the ternary sibling variant.

Source

pub fn default_config(&self) -> Qwen3Config

Get the default configuration for this variant.

Returns the standard configuration for known variants. For Custom, returns the 8B configuration as a fallback.

Source

pub fn name(&self) -> &'static str

Human-readable display name for this variant.

Source

pub fn param_count(&self) -> u64

Approximate parameter count for this variant.

Computed as: embedding + attention + ffn + norms + output head. For 1-bit models, each “parameter” is 1 bit + per-group scale. Ternary variants share the same architecture (and thus the same parameter count) as their 1-bit siblings; only the storage format differs.

Source

pub fn expected_model_size_bytes(&self) -> u64

Expected model file size in bytes for the quantized GGUF file.

For 1-bit variants: ~1 bit per param + scale factors + FP16 embeddings. For ternary variants: TQ2_0_g128 uses 34 bytes per 128 weights ≈ 0.266 bytes/param. Embeddings and norms are typically stored in FP16 or FP32.

Source

pub fn known_variants() -> &'static [ModelVariant]

Return all known (non-Custom) variants.

Source

pub fn is_known(&self) -> bool

Whether this variant is a known (non-custom) architecture.

Trait Implementations§

Source§

impl Clone for ModelVariant

Source§

fn clone(&self) -> ModelVariant

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for ModelVariant

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Display for ModelVariant

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Hash for ModelVariant

Source§

fn hash<__H: Hasher>(&self, state: &mut __H)

Feeds this value into the given Hasher. Read more
1.3.0 · Source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more
Source§

impl PartialEq for ModelVariant

Source§

fn eq(&self, other: &ModelVariant) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 (const: unstable) · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl Copy for ModelVariant

Source§

impl Eq for ModelVariant

Source§

impl StructuralPartialEq for ModelVariant

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

fn to_string(&self) -> String

Converts the given value to a String. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more