Struct idlset::v1::IDLBitRange

source · [−]

pub struct IDLBitRange { /* private fields */ }

Expand description

An ID List of u64 values, that uses a compressed representation of u64 to speed up set operations, improve cpu cache behaviour and consume less memory.

This is essentially a Vec<u64>, but requires less storage with large values and natively supports logical operations for set manipulation. Today this supports And, Or, AndNot. Future types may be added such as xor.

How does it work?

The IDLBitRange stores a series of tuples (IDRange) that represents a range prefix u64 and a u64 mask of bits representing the presence of that integer in the set. For example, the number 1 when inserted would create an idl range of: IDRange { range: 0, mask: 2 }. The mask has the “second” bit set, so we add range and recieve 1. (if mask was 1, this means the value 0 is present!)

Other examples would be IDRange { range: 0, mask: 3 }. Because 3 means “the first and second bit is set” this would extract to [0, 1] IDRange { range: 0, mask: 38} represents the set [1, 2, 5] as the. second, third and sixth bits are set. Finally, a value of IDRange { range: 64, mask: 4096 } represents the set [76, ].

Using this, we can store up to 64 integers in an IDRange. Once there are at least 3 bits set in mask, the compression is now saving memory space compared to raw unpacked Vec<u64>.

The set operations can now be performed by applying u64 bitwise operations on the mask components for a given matching range prefix. If the range prefix is not present in the partner set, we choose a correct course of action (Or copies the range to the result, And skips the range entirely)

As an example, if we had the values IDRange { range: 0, mask: 38 } ([1, 2, 5]) and IDRange { range: 0, mask: 7 } ([0, 1, 2]), and we were to perform an & operation on these sets, the result would be 7 & 38 == 6. The result now is IDRange { range: 0, mask: 6 }, which decompresses to [1, 2] - the correct result of our set And operation.

The important note here is that with a single cpu & operation, we were able to intersect up to 64 values at once. Contrast to a Vec<u64> where we would need to perform cpu equality on each value. For our above example this would have taken at most 4 cpu operations with the Vec<u64>, where as the IDLBitRange performed 2 (range eq and mask &).

Worst case behaviour is sparse u64 sets where each IDRange only has a single populated value. This yields a slow down of approx 20% compared to the Vec<u64>. However, as soon as the IDRange contains at least 2 values they are equal in performance, and three values begins to exceed. This applies to all operation types and data sizes.

Examples

use idlset::v1::IDLBitRange;
use std::iter::FromIterator;

let idl_a = IDLBitRange::from_iter(vec![1, 2, 3]);
let idl_b = IDLBitRange::from_iter(vec![2]);

// Conduct an and (intersection) of the two lists to find commont members.
let idl_result = idl_a & idl_b;

let idl_expect = IDLBitRange::from_iter(vec![2]);
assert_eq!(idl_result, idl_expect);

Struct idlset::v1::IDLBitRange

Implementations

impl IDLBitRange

pub fn new() -> Self

pub fn from_u64(id: u64) -> Self

pub fn contains(&self, id: u64) -> bool

pub fn insert_id(&mut self, value: u64)

pub fn remove_id(&mut self, value: u64)

pub unsafe fn push_id(&mut self, value: u64)

pub fn len(&self) -> usize

pub fn below_threshold(&self, threshold: usize) -> bool

pub fn is_empty(&self) -> bool

pub fn len_range(&self) -> usize

pub fn sum(&self) -> u64

Trait Implementations

impl AndNot<&IDLBitRange> for &IDLBitRange

fn andnot(self, rhs: &IDLBitRange) -> IDLBitRange

type Output = IDLBitRange

impl AndNot<IDLBitRange> for IDLBitRange

fn andnot(self, rhs: Self) -> IDLBitRange

type Output = IDLBitRange

impl BitAnd<&IDLBitRange> for &IDLBitRange

fn bitand(self, rhs: &IDLBitRange) -> IDLBitRange

type Output = IDLBitRange

impl BitAnd<IDLBitRange> for IDLBitRange

fn bitand(self, rhs: IDLBitRange) -> IDLBitRange

type Output = IDLBitRange

impl BitOr<&IDLBitRange> for &IDLBitRange

fn bitor(self, rhs: &IDLBitRange) -> IDLBitRange

type Output = IDLBitRange

impl BitOr<IDLBitRange> for IDLBitRange

fn bitor(self, rhs: Self) -> IDLBitRange

type Output = IDLBitRange

impl Clone for IDLBitRange

fn clone(&self) -> IDLBitRange

fn clone_from(&mut self, source: &Self)

impl Debug for IDLBitRange

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for IDLBitRange

fn default() -> Self

impl<'de> Deserialize<'de> for IDLBitRange

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error> where __D: Deserializer<'de>,

impl Display for IDLBitRange

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl FromIterator<u64> for IDLBitRange

fn from_iter<I: IntoIterator<Item = u64>>(iter: I) -> Self

impl<'a> IntoIterator for &'a IDLBitRange

type Item = u64

type IntoIter = IDLBitRangeIter<'a>

fn into_iter(self) -> IDLBitRangeIter<'a>ⓘNotable traits for IDLBitRangeIter<'a>impl<'a> Iterator for IDLBitRangeIter<'a> type Item = u64;

impl PartialEq<IDLBitRange> for IDLBitRange

fn eq(&self, other: &IDLBitRange) -> bool

fn ne(&self, other: &IDLBitRange) -> bool

impl Serialize for IDLBitRange

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error> where __S: Serializer,

impl StructuralPartialEq for IDLBitRange

Auto Trait Implementations

impl RefUnwindSafe for IDLBitRange

impl Send for IDLBitRange

impl Sync for IDLBitRange

impl Unpin for IDLBitRange

impl UnwindSafe for IDLBitRange

Blanket Implementations

impl<T> Any for T where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for T where T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for T where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for T where U: From<T>,

fn into(self) -> U

impl<T> ToOwned for T where T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T> ToString for T where T: Display + ?Sized,

default fn to_string(&self) -> String

impl<T, U> TryFrom<U> for T where U: Into<T>,

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error> where
__D: Deserializer<'de>,

fn into_iter(self) -> IDLBitRangeIter<'a>ⓘNotable traits for IDLBitRangeIter<'a>`impl<'a> Iterator for IDLBitRangeIter<'a> type Item = u64;`

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error> where
S: Serializer,

impl<T> Any for T where
T: 'static + ?Sized,

impl<T> Borrow<T> for T where
T: ?Sized,

impl<T> BorrowMut<T> for T where
T: ?Sized,

impl<T, U> Into<U> for T where
U: From<T>,

impl<T> ToOwned for T where
T: Clone,

impl<T> ToString for T where
T: Display + ?Sized,

impl<T, U> TryFrom<U> for T where
U: Into<T>,

impl<T, U> TryInto<U> for T where
U: TryFrom<T>,

impl<T> DeserializeOwned for T where
T: for<'de> Deserialize<'de>,