Enum Mask

Source
pub enum Mask {
    AllTrue(usize),
    AllFalse(usize),
    Values(Arc<MaskValues>),
}
Expand description

Represents a set of sorted unique positive integers.

A Mask can be constructed from various representations, and converted to various others. Internally, these are cached.

Variants§

§

AllTrue(usize)

All values are included.

§

AllFalse(usize)

No values are included.

§

Values(Arc<MaskValues>)

Some values are included, represented as a BooleanBuffer.

Implementations§

Source§

impl Mask

Source

pub fn intersect_by_rank(&self, mask: &Mask) -> Mask

Take the intersection of the mask with the set of true values in self.

We are more interested in low selectivity self (as indices) with a boolean buffer mask, so we don’t optimize for other cases, yet.

Note: we might be able to accelerate this function on x86 with BMI, see: https://www.microsoft.com/en-us/research/uploads/prod/2023/06/parquet-select-sigmod23.pdf

§Examples

Keep the third and fifth set values from mask m1:

use vortex_mask::Mask;

let m1 = Mask::from_iter([true, false, false, true, true, true, false, true]);
let m2 = Mask::from_iter([false, false, true, false, true]);
assert_eq!(
    m1.intersect_by_rank(&m2),
    Mask::from_iter([false, false, false, false, true, false, false, true])
);
Source§

impl Mask

Source

pub fn iter_bools<F, T>(&self, f: F) -> T
where F: FnMut(&mut dyn Iterator<Item = bool>) -> T,

Provides a closure with an iterator over the boolean values of the mask.

This allows us to provide different implementations of the iterator based on the underlying representation of the mask, while avoiding a heap allocation to return a boxed iterator.

Note that bool iteration might not be the fastest way to achieve whatever is it you’re trying to do!

Source§

impl Mask

Source

pub fn new_true(length: usize) -> Self

Create a new Mask where all values are set.

Source

pub fn new_false(length: usize) -> Self

Create a new Mask where no values are set.

Source

pub fn from_buffer(buffer: BooleanBuffer) -> Self

Create a new Mask from a BooleanBuffer.

Source

pub fn from_indices(len: usize, indices: Vec<usize>) -> Self

Create a new Mask from a Vec<usize>.

Source

pub fn from_excluded_indices( len: usize, indices: impl IntoIterator<Item = usize>, ) -> Self

Create a new Mask from an [IntoIterator<Item = usize>] of indices to be excluded.

Source

pub fn from_slices(len: usize, vec: Vec<(usize, usize)>) -> Self

Create a new Mask from a [Vec<(usize, usize)>] where each range represents a contiguous range of true values.

Source

pub fn from_intersection_indices( len: usize, lhs: impl Iterator<Item = usize>, rhs: impl Iterator<Item = usize>, ) -> Self

Create a new Mask from the intersection of two indices slices.

Source

pub fn len(&self) -> usize

Returns the length of the mask (not the number of true values).

Source

pub fn true_count(&self) -> usize

Get the true count of the mask.

Source

pub fn false_count(&self) -> usize

Get the false count of the mask.

Source

pub fn all_true(&self) -> bool

Returns true if all values in the mask are true.

Source

pub fn all_false(&self) -> bool

Returns true if all values in the mask are false.

Source

pub fn density(&self) -> f64

Return the density of the full mask.

Source

pub fn value(&self, idx: usize) -> bool

Returns the boolean value at a given index.

§Panics

Panics if the index is out of bounds.

Source

pub fn first(&self) -> Option<usize>

Returns the first true index in the mask.

Source

pub fn slice(&self, offset: usize, length: usize) -> Self

Slice the mask.

Source

pub fn boolean_buffer(&self) -> AllOr<&BooleanBuffer>

Return the boolean buffer representation of the mask.

Source

pub fn to_boolean_buffer(&self) -> BooleanBuffer

Return a boolean buffer representation of the mask, allocating new buffers for all-true and all-false variants.

Source

pub fn to_null_buffer(&self) -> Option<NullBuffer>

Returns an Arrow null buffer representation of the mask.

Source

pub fn indices(&self) -> AllOr<&[usize]>

Return the indices representation of the mask.

Source

pub fn slices(&self) -> AllOr<&[(usize, usize)]>

Return the slices representation of the mask.

Source

pub fn threshold_iter(&self, threshold: f64) -> AllOr<MaskIter<'_>>

Return an iterator over either indices or slices of the mask based on a density threshold.

Source

pub fn values(&self) -> Option<&MaskValues>

Return MaskValues if the mask is not all true or all false.

Trait Implementations§

Source§

impl BitAnd for &Mask

Source§

type Output = Mask

The resulting type after applying the & operator.
Source§

fn bitand(self, rhs: Self) -> Self::Output

Performs the & operation. Read more
Source§

impl Clone for Mask

Source§

fn clone(&self) -> Mask

Returns a copy of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for Mask

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl From<BooleanBuffer> for Mask

Source§

fn from(value: BooleanBuffer) -> Self

Converts to this type from the input type.
Source§

impl FromIterator<Mask> for Mask

Source§

fn from_iter<T: IntoIterator<Item = Mask>>(iter: T) -> Self

Creates a value from an iterator. Read more
Source§

impl FromIterator<bool> for Mask

Source§

fn from_iter<T: IntoIterator<Item = bool>>(iter: T) -> Self

Creates a value from an iterator. Read more
Source§

impl Not for &Mask

Source§

type Output = Mask

The resulting type after applying the ! operator.
Source§

fn not(self) -> Self::Output

Performs the unary ! operation. Read more
Source§

impl PartialEq for Mask

Source§

fn eq(&self, other: &Self) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl Eq for Mask

Auto Trait Implementations§

§

impl Freeze for Mask

§

impl RefUnwindSafe for Mask

§

impl Send for Mask

§

impl Sync for Mask

§

impl Unpin for Mask

§

impl UnwindSafe for Mask

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<T> Allocation for T
where T: RefUnwindSafe + Send + Sync,

Source§

impl<T> ErasedDestructor for T
where T: 'static,

Source§

impl<T> MaybeSendSync for T