QuantizedBytes

Struct QuantizedBytes 

Source
pub struct QuantizedBytes {
    pub bytes: Bytes,
    pub scheme: QuantScheme,
    pub num_elements: usize,
}
Expand description

Quantized data bytes representation.

§Notes

  1. The quantized values are packed into 32-bit unsigned integers. For example, int8 quantized values pack 4 grouped values into a single u32. When unpacking these values, we make sure to retrieve only the meaningful values (and ignore the alignment padding).
  2. Quantization parameters are appended to the tensor data. As such, the last bytes always correspond to the scale parameter. If the quantization scheme includes an offset (zero-point) parameter, it is next to last.

Fields§

§bytes: Bytes

The quantized values and quantization parameters represented as bytes.

§scheme: QuantScheme

The quantization scheme.

§num_elements: usize

The number of quantized elements.

Implementations§

Source§

impl QuantizedBytes

Source

pub fn new<E>( value: Vec<E>, scheme: QuantScheme, scales: &[f32], ) -> QuantizedBytes

Creates a new quantized bytes representation.

Source

pub fn into_vec_i8(self) -> (Vec<i8>, QParams<Vec<f32>>)

Returns the int8 quantized values with the quantization parameters.

Auto Trait Implementations§

Blanket Implementations§

§

impl<T> Any for T
where T: 'static + ?Sized,

§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
§

impl<T> Borrow<T> for T
where T: ?Sized,

§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
§

impl<T> BorrowMut<T> for T
where T: ?Sized,

§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
§

impl<T> From<T> for T

§

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T, U> Into<U> for T
where U: From<T>,

§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

§

type Error = Infallible

The type returned in the event of a conversion error.
§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V