pub enum ProviderQuantization {
Int4,
Int8,
Fp4,
Fp6,
Fp8,
Fp16,
Bf16,
Fp32,
Unknown,
}Expand description
Model quantization levels for provider filtering.
Quantization reduces model precision to decrease memory usage and increase inference speed, potentially at the cost of output quality.
Variants§
Int4
4-bit integer quantization.
Int8
8-bit integer quantization.
Fp4
4-bit floating point quantization.
Fp6
6-bit floating point quantization.
Fp8
8-bit floating point quantization.
Fp16
16-bit floating point (half precision).
Bf16
16-bit brain floating point.
Fp32
32-bit floating point (full precision).
Unknown
Unknown quantization level.
Trait Implementations§
Source§impl<'arbitrary> Arbitrary<'arbitrary> for ProviderQuantization
impl<'arbitrary> Arbitrary<'arbitrary> for ProviderQuantization
Source§fn arbitrary(u: &mut Unstructured<'arbitrary>) -> Result<Self>
fn arbitrary(u: &mut Unstructured<'arbitrary>) -> Result<Self>
Generate an arbitrary value of
Self from the given unstructured data. Read moreSource§fn arbitrary_take_rest(u: Unstructured<'arbitrary>) -> Result<Self>
fn arbitrary_take_rest(u: Unstructured<'arbitrary>) -> Result<Self>
Generate an arbitrary value of
Self from the entirety of the given
unstructured data. Read moreSource§fn size_hint(depth: usize) -> (usize, Option<usize>)
fn size_hint(depth: usize) -> (usize, Option<usize>)
Get a size hint for how many bytes out of an
Unstructured this type
needs to construct itself. Read moreSource§fn try_size_hint(
depth: usize,
) -> Result<(usize, Option<usize>), MaxRecursionReached>
fn try_size_hint( depth: usize, ) -> Result<(usize, Option<usize>), MaxRecursionReached>
Get a size hint for how many bytes out of an
Unstructured this type
needs to construct itself. Read moreSource§impl Clone for ProviderQuantization
impl Clone for ProviderQuantization
Source§fn clone(&self) -> ProviderQuantization
fn clone(&self) -> ProviderQuantization
Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for ProviderQuantization
impl Debug for ProviderQuantization
Source§impl<'de> Deserialize<'de> for ProviderQuantization
impl<'de> Deserialize<'de> for ProviderQuantization
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl Hash for ProviderQuantization
impl Hash for ProviderQuantization
Source§impl JsonSchema for ProviderQuantization
impl JsonSchema for ProviderQuantization
Source§fn schema_id() -> Cow<'static, str>
fn schema_id() -> Cow<'static, str>
Returns a string that uniquely identifies the schema produced by this type. Read more
Source§fn json_schema(generator: &mut SchemaGenerator) -> Schema
fn json_schema(generator: &mut SchemaGenerator) -> Schema
Generates a JSON Schema for this type. Read more
Source§fn inline_schema() -> bool
fn inline_schema() -> bool
Whether JSON Schemas generated for this type should be included directly in parent schemas,
rather than being re-used where possible using the
$ref keyword. Read moreSource§impl Ord for ProviderQuantization
impl Ord for ProviderQuantization
1.21.0 (const: unstable) · Source§fn max(self, other: Self) -> Selfwhere
Self: Sized,
fn max(self, other: Self) -> Selfwhere
Self: Sized,
Compares and returns the maximum of two values. Read more
Source§impl PartialEq for ProviderQuantization
impl PartialEq for ProviderQuantization
Source§fn eq(&self, other: &ProviderQuantization) -> bool
fn eq(&self, other: &ProviderQuantization) -> bool
Tests for
self and other values to be equal, and is used by ==.Source§impl PartialOrd for ProviderQuantization
impl PartialOrd for ProviderQuantization
Source§impl Serialize for ProviderQuantization
impl Serialize for ProviderQuantization
impl Copy for ProviderQuantization
impl Eq for ProviderQuantization
impl StructuralPartialEq for ProviderQuantization
Auto Trait Implementations§
impl Freeze for ProviderQuantization
impl RefUnwindSafe for ProviderQuantization
impl Send for ProviderQuantization
impl Sync for ProviderQuantization
impl Unpin for ProviderQuantization
impl UnsafeUnpin for ProviderQuantization
impl UnwindSafe for ProviderQuantization
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Comparable<K> for Q
impl<Q, K> Comparable<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more