#[non_exhaustive]pub enum MoeVariant {
ScalarGguf,
Wmma,
WmmaGguf,
}Expand description
Selector for the MoE variant.
#[non_exhaustive] — additional MoE backend variants (FP8 expert
weights, BF16+WMMA on Hopper, multi-block routing) may land in
future phases. Match arms must include a _ => catch-all.
Variants (Non-exhaustive)§
This enum is marked as non-exhaustive
Non-exhaustive enums could have additional variants added in future. Therefore, when matching against variants of non-exhaustive enums, an extra wildcard arm must be added to account for any future variants.
ScalarGguf
Scalar dispatch over GGUF-packed expert weights, f32 activations.
Wmma
WMMA tensor cores over FP expert weights, f16/bf16 activations.
WmmaGguf
WMMA tensor cores + GGUF-packed expert weights, f16/bf16 activations. The combined hot path for quantized LLM inference.
Trait Implementations§
Source§impl Clone for MoeVariant
impl Clone for MoeVariant
Source§fn clone(&self) -> MoeVariant
fn clone(&self) -> MoeVariant
Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreimpl Copy for MoeVariant
Source§impl Debug for MoeVariant
impl Debug for MoeVariant
impl Eq for MoeVariant
Source§impl Hash for MoeVariant
impl Hash for MoeVariant
Source§impl PartialEq for MoeVariant
impl PartialEq for MoeVariant
Source§fn eq(&self, other: &MoeVariant) -> bool
fn eq(&self, other: &MoeVariant) -> bool
Tests for
self and other values to be equal, and is used by ==.impl StructuralPartialEq for MoeVariant
Auto Trait Implementations§
impl Freeze for MoeVariant
impl RefUnwindSafe for MoeVariant
impl Send for MoeVariant
impl Sync for MoeVariant
impl Unpin for MoeVariant
impl UnsafeUnpin for MoeVariant
impl UnwindSafe for MoeVariant
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more