Skip to main content

DictionaryEncoding

Struct DictionaryEncoding 

Source
pub struct DictionaryEncoding { /* private fields */ }
Expand description

Stores repeated strings efficiently by referencing them with integer codes.

Each unique string appears once in the dictionary. Values are stored as u32 indices pointing into that dictionary. Great for labels, categories, and other low-cardinality string columns.

Implementations§

Source§

impl DictionaryEncoding

Source

pub fn new(dictionary: Arc<[Arc<str>]>, codes: Vec<u32>) -> Self

Creates a new dictionary encoding from a dictionary and codes.

Source

pub fn with_nulls(self, null_bitmap: Vec<u64>) -> Self

Creates a dictionary encoding with a null bitmap.

Source

pub fn len(&self) -> usize

Returns the number of values.

Source

pub fn is_empty(&self) -> bool

Returns whether the encoding is empty.

Source

pub fn dictionary_size(&self) -> usize

Returns the number of unique strings in the dictionary.

Source

pub fn dictionary(&self) -> &Arc<[Arc<str>]>

Returns the dictionary.

Source

pub fn codes(&self) -> &[u32]

Returns the encoded values.

Source

pub fn is_null(&self, index: usize) -> bool

Returns whether the value at index is null.

Source

pub fn get(&self, index: usize) -> Option<&str>

Returns the string value at the given index.

Returns None if the value is null.

Source

pub fn get_code(&self, index: usize) -> Option<u32>

Returns the code at the given index.

Source

pub fn iter(&self) -> impl Iterator<Item = Option<&str>>

Iterates over all values, yielding Option<&str>.

Source

pub fn compression_ratio(&self) -> f64

Returns the compression ratio (original size / compressed size).

A ratio > 1.0 means compression is effective.

Source

pub fn encode(&self, value: &str) -> Option<u32>

Encodes a lookup value into a code, if it exists in the dictionary.

Source

pub fn filter_by_code(&self, predicate: impl Fn(u32) -> bool) -> Vec<usize>

Filters the encoding to only include rows matching a predicate code.

Trait Implementations§

Source§

impl Clone for DictionaryEncoding

Source§

fn clone(&self) -> DictionaryEncoding

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for DictionaryEncoding

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.