pub struct ExtensionArray { /* private fields */ }
Expand description
An extension array that wraps another array with additional type information.
⚠️ Unstable API: This is an experimental feature that may change significantly in future versions. The extension type system is still evolving.
Unlike Apache Arrow’s extension arrays, Vortex extension arrays provide a more flexible mechanism for adding semantic meaning to existing array types without requiring changes to the core type system.
§Design Philosophy
Extension arrays serve as a type-safe wrapper that:
- Preserves the underlying storage format and operations
- Adds semantic type information via
ExtDType
- Enables custom serialization and deserialization logic
- Allows domain-specific interpretations of generic data
§Storage and Type Relationship
The extension array maintains a strict contract:
- Storage array: Contains the actual data in a standard Vortex encoding
- Extension type: Defines how to interpret the storage data semantically
- Type safety: The storage array’s dtype must match the extension type’s storage dtype
§Use Cases
Extension arrays are ideal for:
- Custom numeric types: Units of measurement, currencies
- Temporal types: Custom date/time formats, time zones, calendars
- Domain-specific types: UUIDs, IP addresses, geographic coordinates
- Encoded types: Base64 strings, compressed data, encrypted values
§Validity and Operations
Extension arrays delegate validity and most operations to their storage array:
- Validity is inherited from the underlying storage
- Slicing preserves the extension type
- Scalar access wraps storage scalars with extension metadata
§Examples
use std::sync::Arc;
use vortex_array::arrays::{ExtensionArray, PrimitiveArray};
use vortex_dtype::{ExtDType, ExtID, DType, Nullability, PType};
use vortex_array::validity::Validity;
use vortex_array::IntoArray;
use vortex_buffer::buffer;
// Define a custom extension type for representing currency values
let currency_id = ExtID::from("example.currency");
let currency_dtype = Arc::new(ExtDType::new(
currency_id,
Arc::new(DType::Primitive(PType::I64, Nullability::NonNullable)), // Storage as i64 cents
None, // No additional metadata needed
));
// Create storage array with currency values in cents
let cents_storage = PrimitiveArray::new(
buffer![12345i64, 67890, 99999], // $123.45, $678.90, $999.99
Validity::NonNullable
);
// Wrap with extension type
let currency_array = ExtensionArray::new(
currency_dtype.clone(),
cents_storage.into_array()
);
assert_eq!(currency_array.len(), 3);
assert_eq!(currency_array.id().as_ref(), "example.currency");
// Access maintains extension type information
let first_value = currency_array.scalar_at(0);
assert!(first_value.as_extension_opt().is_some());
Implementations§
Methods from Deref<Target = dyn Array>§
Sourcepub fn display_values(&self) -> impl Display
pub fn display_values(&self) -> impl Display
Display logical values of the array
For example, an i16
typed array containing the first five non-negative integers is displayed
as: [0i16, 1i16, 2i16, 3i16, 4i16]
.
§Examples
let array = buffer![0_i16, 1, 2, 3, 4].into_array();
assert_eq!(
format!("{}", array.display_values()),
"[0i16, 1i16, 2i16, 3i16, 4i16]",
)
See also: Array::display_as, DisplayArrayAs, and DisplayOptions.
Sourcepub fn display_as(&self, options: DisplayOptions) -> impl Display
pub fn display_as(&self, options: DisplayOptions) -> impl Display
Display the array as specified by the options.
See DisplayOptions for examples.
Sourcepub fn display_tree(&self) -> impl Display
pub fn display_tree(&self) -> impl Display
Display the tree of encodings of this array as an indented lists.
While some metadata (such as length, bytes and validity-rate) are included, the logical values of the array are not displayed. To view the logical values see Array::display_as and DisplayOptions.
§Examples
let array = buffer![0_i16, 1, 2, 3, 4].into_array();
let expected = "root: vortex.primitive(i16, len=5) nbytes=10 B (100.00%)
metadata: EmptyMetadata
buffer (align=2): 10 B (100.00%)
";
assert_eq!(format!("{}", array.display_tree()), expected);
Sourcepub fn as_opt<V: VTable>(&self) -> Option<&V::Array>
pub fn as_opt<V: VTable>(&self) -> Option<&V::Array>
Returns the array downcast to the given A
.
pub fn is_constant(&self) -> bool
pub fn is_constant_opts(&self, cost: Cost) -> bool
pub fn as_constant(&self) -> Option<Scalar>
Sourcepub fn nbytes(&self) -> u64
pub fn nbytes(&self) -> u64
Total size of the array in bytes, including all children and buffers.
Sourcepub fn to_array_iterator(&self) -> impl ArrayIterator + 'static
pub fn to_array_iterator(&self) -> impl ArrayIterator + 'static
Create an ArrayIterator
over the array.
Sourcepub fn serialize(
&self,
ctx: &ArrayContext,
options: &SerializeOptions,
) -> VortexResult<Vec<ByteBuffer>>
pub fn serialize( &self, ctx: &ArrayContext, options: &SerializeOptions, ) -> VortexResult<Vec<ByteBuffer>>
Serialize the array into a sequence of byte buffers that should be written contiguously. This function returns a vec to avoid copying data buffers.
Optionally, padding can be included to guarantee buffer alignment and ensure zero-copy reads within the context of an external file or stream. In this case, the alignment of the first byte buffer should be respected when writing the buffers to the stream or file.
The format of this blob is a sequence of data buffers, possible with prefixed padding,
followed by a flatbuffer containing an fba::Array
message, and ending with a
little-endian u32 describing the length of the flatbuffer message.
Sourcepub fn to_array_stream(&self) -> impl ArrayStream + 'static
pub fn to_array_stream(&self) -> impl ArrayStream + 'static
Create an ArrayStream
over the array.
Sourcepub fn as_null_typed(&self) -> NullTyped<'_>
pub fn as_null_typed(&self) -> NullTyped<'_>
Downcasts the array for null-specific behavior.
Sourcepub fn as_bool_typed(&self) -> BoolTyped<'_>
pub fn as_bool_typed(&self) -> BoolTyped<'_>
Downcasts the array for bool-specific behavior.
Sourcepub fn as_primitive_typed(&self) -> PrimitiveTyped<'_>
pub fn as_primitive_typed(&self) -> PrimitiveTyped<'_>
Downcasts the array for primitive-specific behavior.
Sourcepub fn as_decimal_typed(&self) -> DecimalTyped<'_>
pub fn as_decimal_typed(&self) -> DecimalTyped<'_>
Downcasts the array for decimal-specific behavior.
Sourcepub fn as_utf8_typed(&self) -> Utf8Typed<'_>
pub fn as_utf8_typed(&self) -> Utf8Typed<'_>
Downcasts the array for utf8-specific behavior.
Sourcepub fn as_binary_typed(&self) -> BinaryTyped<'_>
pub fn as_binary_typed(&self) -> BinaryTyped<'_>
Downcasts the array for binary-specific behavior.
Sourcepub fn as_struct_typed(&self) -> StructTyped<'_>
pub fn as_struct_typed(&self) -> StructTyped<'_>
Downcasts the array for struct-specific behavior.
Sourcepub fn as_list_typed(&self) -> ListTyped<'_>
pub fn as_list_typed(&self) -> ListTyped<'_>
Downcasts the array for list-specific behavior.
Sourcepub fn as_extension_typed(&self) -> ExtensionTyped<'_>
pub fn as_extension_typed(&self) -> ExtensionTyped<'_>
Downcasts the array for extension-specific behavior.
Trait Implementations§
Source§impl AsRef<dyn Array> for ExtensionArray
impl AsRef<dyn Array> for ExtensionArray
Source§impl Clone for ExtensionArray
impl Clone for ExtensionArray
Source§fn clone(&self) -> ExtensionArray
fn clone(&self) -> ExtensionArray
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
source
. Read moreSource§impl Debug for ExtensionArray
impl Debug for ExtensionArray
Source§impl Deref for ExtensionArray
impl Deref for ExtensionArray
Source§impl From<&TemporalArray> for ExtensionArray
impl From<&TemporalArray> for ExtensionArray
Source§fn from(value: &TemporalArray) -> Self
fn from(value: &TemporalArray) -> Self
Source§impl From<ExtensionArray> for ArrayRef
impl From<ExtensionArray> for ArrayRef
Source§fn from(value: ExtensionArray) -> ArrayRef
fn from(value: ExtensionArray) -> ArrayRef
Source§impl From<TemporalArray> for ExtensionArray
impl From<TemporalArray> for ExtensionArray
Source§fn from(value: TemporalArray) -> Self
fn from(value: TemporalArray) -> Self
Source§impl IntoArray for ExtensionArray
impl IntoArray for ExtensionArray
fn into_array(self) -> ArrayRef
Source§impl TryFrom<ExtensionArray> for TemporalArray
impl TryFrom<ExtensionArray> for TemporalArray
Source§type Error = VortexError
type Error = VortexError
Auto Trait Implementations§
impl Freeze for ExtensionArray
impl !RefUnwindSafe for ExtensionArray
impl Send for ExtensionArray
impl Sync for ExtensionArray
impl Unpin for ExtensionArray
impl !UnwindSafe for ExtensionArray
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
self
into a Left
variant of Either<Self, Self>
if into_left
is true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
self
into a Left
variant of Either<Self, Self>
if into_left(&self)
returns true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read more