Enum Canonical

Source
pub enum Canonical {
    Null(NullArray),
    Bool(BoolArray),
    Primitive(PrimitiveArray),
    Struct(StructArray),
    List(ListArray),
    VarBinView(VarBinViewArray),
    Extension(ExtensionArray),
}
Expand description

The set of canonical array encodings, also the set of encodings that can be transferred to Arrow with zero-copy.

Note that a canonical form is not recursive, i.e. a StructArray may contain non-canonical child arrays, which may themselves need to be canonicalized.

§Logical vs. Physical encodings

Vortex separates logical and physical types, however this creates ambiguity with Arrow, there is no separation. Thus, if you receive an Arrow array, compress it using Vortex, and then decompress it later to pass to a compute kernel, there are multiple suitable Arrow array variants to hold the data.

To disambiguate, we choose a canonical physical encoding for every Vortex DType, which will correspond to an arrow-rs arrow_schema::DataType.

§Views support

Binary and String views, also known as “German strings” are a better encoding format for nearly all use-cases. Variable-length binary views are part of the Apache Arrow spec, and are fully supported by the Datafusion query engine. We use them as our canonical string encoding for all Utf8 and Binary typed arrays in Vortex.

Variants§

Implementations§

Methods from Deref<Target = Array>§

Source

pub fn vtable(&self) -> &VTableRef

Return the array’s encoding VTable.

Source

pub fn encoding(&self) -> EncodingId

Return the array’s encoding ID.

Source

pub fn len(&self) -> usize

Returns the number of logical elements in the array.

Source

pub fn is_empty(&self) -> bool

Check whether the array has any data

Source

pub fn dtype(&self) -> &DType

Return the array’s dtype

Source

pub fn is_canonical(&self) -> bool

Whether the array is of a canonical encoding.

Source

pub fn is_arrow(&self) -> bool

Whether the array is fully zero-copy to Arrow (including children). This means any nested types, like Structs, Lists, and Extensions are not present.

Source

pub fn is_constant(&self) -> bool

Return whether the array is constant.

Source

pub fn as_constant(&self) -> Option<Scalar>

Return scalar value of this array if the array is constant

Source

pub fn child<'a>( &'a self, idx: usize, dtype: &'a DType, len: usize, ) -> VortexResult<Self>

Source

pub fn children(&self) -> Vec<Array>

Returns a Vec of Arrays with all the array’s child arrays.

Source

pub fn named_children(&self) -> Vec<(String, Array)>

Returns a Vec of Arrays with all the array’s child arrays.

Source

pub fn nchildren(&self) -> usize

Returns the number of child arrays

Source

pub fn depth_first_traversal(&self) -> ArrayChildrenIterator

Source

pub fn cumulative_nbuffers(&self) -> usize

Count the number of cumulative buffers encoded by self.

Source

pub fn all_buffer_offsets(&self, alignment: usize) -> Vec<u64>

Return the buffer offsets and the total length of all buffers, assuming the given alignment. This includes all child buffers.

Source

pub fn metadata_bytes(&self) -> Option<&[u8]>

Source

pub fn nbuffers(&self) -> usize

Source

pub fn byte_buffer(&self, index: usize) -> Option<&ByteBuffer>

Source

pub fn byte_buffers(&self) -> impl Iterator<Item = ByteBuffer> + '_

Source

pub fn is_encoding(&self, id: EncodingId) -> bool

Checks whether array is of a given encoding.

Source

pub fn try_downcast_ref<E: Encoding>(&self) -> VortexResult<(&E::Array, &E)>
where for<'a> &'a E::Array: TryFrom<&'a Array, Error = VortexError>,

Source

pub fn nbytes(&self) -> usize

Total size of the array in bytes, including all children and buffers.

Source

pub fn statistics(&self) -> &(dyn Statistics + '_)

Source

pub fn inherit_statistics(&self, parent: &dyn Statistics)

Source

pub fn tree_display(&self) -> TreeDisplayWrapper<'_>

Source

pub fn is_valid(&self, index: usize) -> VortexResult<bool>

Return whether the element at the given index is valid (true) or null (false).

Source

pub fn null_count(&self) -> VortexResult<usize>

Return the number of null elements in the array.

Source

pub fn logical_validity(&self) -> VortexResult<Mask>

Return the logical validity of the array if nullable, and None if non-nullable.

Source

pub fn as_null_array(&self) -> Option<&dyn NullArrayTrait>

Source

pub fn as_bool_array(&self) -> Option<&dyn BoolArrayTrait>

Source

pub fn as_primitive_array(&self) -> Option<&dyn PrimitiveArrayTrait>

Source

pub fn as_utf8_array(&self) -> Option<&dyn Utf8ArrayTrait>

Source

pub fn as_binary_array(&self) -> Option<&dyn BinaryArrayTrait>

Source

pub fn as_struct_array(&self) -> Option<&dyn StructArrayTrait>

Source

pub fn as_list_array(&self) -> Option<&dyn ListArrayTrait>

Source

pub fn as_extension_array(&self) -> Option<&dyn ExtensionArrayTrait>

Trait Implementations§

Source§

impl AsRef<Array> for Canonical

Source§

fn as_ref(&self) -> &Array

Converts this type into a shared reference of the (usually inferred) input type.
Source§

impl Clone for Canonical

Source§

fn clone(&self) -> Canonical

Returns a copy of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for Canonical

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Deref for Canonical

Source§

type Target = Array

The resulting type after dereferencing.
Source§

fn deref(&self) -> &Self::Target

Dereferences the value.
Source§

impl From<Canonical> for Array

This conversion is always “free” and should not touch underlying data. All it does is create an owned pointer to the underlying concrete array type.

This combined with the above IntoCanonical impl for Array allows simple two-way conversions between arbitrary Vortex encodings and canonical Arrow-compatible encodings.

Source§

fn from(value: Canonical) -> Self

Converts to this type from the input type.
Source§

impl IntoArray for Canonical

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> ArchivePointee for T

Source§

type ArchivedMetadata = ()

The archived version of the pointer metadata for this type.
Source§

fn pointer_metadata( _: &<T as ArchivePointee>::ArchivedMetadata, ) -> <T as Pointee>::Metadata

Converts some archived metadata to the pointer metadata for itself.
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dst: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dst. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoArrayVariant for T
where T: IntoCanonical,

Source§

impl<A> IntoCanonical for A
where A: IntoArray,

Source§

fn into_canonical(self) -> Result<Canonical, VortexError>

Canonicalize the array.
Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> LayoutRaw for T

Source§

fn layout_raw(_: <T as Pointee>::Metadata) -> Result<Layout, LayoutError>

Returns the layout of the type.
Source§

impl<T, N1, N2> Niching<NichedOption<T, N1>> for N2
where T: SharedNiching<N1, N2>, N1: Niching<T>, N2: Niching<T>,

Source§

unsafe fn is_niched(niched: *const NichedOption<T, N1>) -> bool

Returns whether the given value has been niched. Read more
Source§

fn resolve_niched(out: Place<NichedOption<T, N1>>)

Writes data to out indicating that a T is niched.
Source§

impl<T> Pointee for T

Source§

type Metadata = ()

The metadata type for pointers and references to this type.
Source§

impl<P, T> Receiver for P
where P: Deref<Target = T> + ?Sized, T: ?Sized,

Source§

type Target = T

🔬This is a nightly-only experimental API. (arbitrary_self_types)
The target type on which the method may be called.
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<T> ErasedDestructor for T
where T: 'static,

Source§

impl<T> MaybeSendSync for T