Struct iset::IntervalMap

source ·

pub struct IntervalMap<T, V, Ix: IndexType = DefaultIx> { /* private fields */ }

Expand description

Map with interval keys (x..y).

Range bounds should implement PartialOrd and Copy, for example any integer or float types. However, you cannot use values that cannot be used in comparison (such as NAN), although infinity is allowed. There are no restrictions on values.

Example

 let mut map = iset::interval_map!{ 20..30 => 'a', 15..25 => 'b', 10..20 => 'c' };
 assert_eq!(map.insert(10..20, 'd'), Some('c'));
 assert_eq!(map.insert(5..15, 'e'), None);

 // Iterator over all pairs (range, value). Output is sorted.
 let a: Vec<_> = map.iter(..).collect();
 assert_eq!(a, &[(5..15, &'e'), (10..20, &'d'), (15..25, &'b'), (20..30, &'a')]);

 // Iterate over intervals that overlap query (..20 here). Output is sorted.
 let b: Vec<_> = map.intervals(..20).collect();
 assert_eq!(b, &[5..15, 10..20, 15..25]);

 assert_eq!(map[15..25], 'b');
 // Replace 15..25 => 'b' into 'z'.
 *map.get_mut(15..25).unwrap() = 'z';

 // Iterate over values that overlap query (20.. here). Output is sorted by intervals.
 let c: Vec<_> = map.values(20..).collect();
 assert_eq!(c, &[&'z', &'a']);

 // Remove 10..20 => 'd'.
 assert_eq!(map.remove(10..20), Some('d'));

Insertion, search and removal

All three operations take O(log N). By default, this crate does not allow duplicate keys, insert replaces and returns the old value if the interval was already present in the map. Note, that the key is not updated even if the value is replaced. This matters for types that can be == without being identical.

Search operations contains, get and get_mut is usually faster than insertion or removal, as the tree does not need to be rebalanced.

You can remove nodes from the tree using remove method given the interval key. Currently, it is not feasible to have a method that removes multiple nodes at once (for example based on a predicate).

It is possible to store entries with equal intervals by calling force_insert. This method should be used with care, as methods get, get_mut and remove only return/remove a single entry (see force_insert for more details). Nevertheless, functions values_at and values_mut_at allow to iterate over all values with exactly matching query, and remove_where allows to remove an entry with matching interval based on a predicate.

Additionally, it is possible to get or remove the entry with the smallest/largest interval in the map (in lexicographical order), see smallest, largest, etc. These methods take O(log N) as well.

Method range allows to extract interval range (min_start, max_end) in O(1). Method covered_len is designed to calculate the total length of a query that is covered by the intervals in the map. Method has_overlap allows to quickly find if the query overlaps any intervals in the map.

Iteration

Interval map allows to quickly find all intervals that overlap a query interval in O(log N + K) where K is the size of the output. All iterators traverse entries in a sorted order (sorted lexicographically by intervals). Iteration methods include:

iter: iterate over pairs (x..y, &value),
intervals: iterate over interval keys x..y,
values: iterate over values &value,
Mutable iterators iter_mut and values_mut,
Into iterators into_iter, into_intervals and into_values,
Iterators over values with exactly matching intervals values_at and values_mut_at.

Additionally, most methods have their unsorted_ counterparts (for example unsorted_iter). These iterators traverse the whole map in an arbitrary unsorted order. Although both map.iter(..) and map.unsorted_iter() output all entries in the map and both take O(N), unsorted iterator is slightly faster as it reads the memory consecutively instead of traversing the tree.

Methods iter, intervals, values, iter_mut and values_mut have alternatives overlap, overlap_intervals, …, that allow to iterate over all entries that cover a single point x (same as x..=x).

Index types

Every node in the tree stores three indices (to the parent and two children), and as a result, memory usage can be reduced by reducing index sizes. In most cases, number of items in the map does not exceed u32::MAX, therefore we store indices as u32 numbers by default (iset::DefaultIx = u32). You can use four integer types (u8, u16, u32 or u64) as index types. Number of elements in the interval map cannot exceed IndexType::MAX - 1: for example a map with u8 indices can store up to 255 items.

Using smaller index types saves memory and may reduce running time.

Interval map creation

An interval map can be created using the following methods:

use iset::{interval_map, IntervalMap};

// Creates an empty interval map with the default index type (u32):
let mut map = IntervalMap::new();
map.insert(10..20, 'a');

// Creates an empty interval map and specifies index type (u16 here):
let mut map = IntervalMap::<_, _, u16>::default();
map.insert(10..20, 'a');

let mut map = IntervalMap::<_, _, u16>::with_capacity(10);
map.insert(10..20, 'a');

// Creates an interval map with the default index type:
let map = interval_map!{ 0..10 => 'a', 5..15 => 'b' };

// Creates an interval map and specifies index type:
let map = interval_map!{ [u16] 0..10 => 'a', 5..15 => 'b' };

// Creates an interval map from a sorted iterator, takes O(N):
let vec = vec![(0..10, 'b'), (5..15, 'a')];
let map = IntervalMap::<_, _, u32>::from_sorted(vec.into_iter());

// Alternatively, you can use `.collect()` method that creates an interval map
// with the default index size. `Collect` does not require sorted intervals,
// but takes O(N log N).
let vec = vec![(5..15, 'a'), (0..10, 'b')];
let map: IntervalMap<_, _> = vec.into_iter().collect();

Entry API

IntervalMap implements Entry, for updating and inserting values directly after search was made.

let mut map = iset::IntervalMap::new();
map.entry(0..100).or_insert("abc".to_string());
map.entry(100..200).or_insert_with(|| "def".to_string());
let val = map.entry(200..300).or_insert(String::new());
*val += "ghi";
map.entry(200..300).and_modify(|s| *s += "jkl").or_insert("xyz".to_string());

assert_eq!(map[0..100], "abc");
assert_eq!(map[100..200], "def");
assert_eq!(map[200..300], "ghijkl");

Implementation, merge and split

To allow for fast retrieval of all intervals overlapping a query, we store the range of the subtree in each node of the tree. Additionally, each node stores indices to the parent and to two children. As a result, size of the map is approximately n * (4 * sizeof(T) + sizeof(V) + 3 * sizeof(Ix)), where n is the number of elements.

In order to reduce number of heap allocations and access memory consecutively, we store tree nodes in a vector. This does not impact time complexity of all methods except for merge and split. In a heap-allocated tree, merge takes O(M log (N / M + 1)) where M is the size of the smaller tree. Here, we are required to merge sorted iterators and construct a tree using the sorted iterator as input, which takes O(N + M).

Because of that, this crate does not implement merge or split, however, these procedures can be emulated using from_sorted, itertools::merge and Iterator::partition in linear time.

Struct iset::IntervalMap

Implementations§

impl<T: PartialOrd + Copy, V> IntervalMap<T, V>

pub fn new() -> Self

impl<T: PartialOrd + Copy, V, Ix: IndexType> IntervalMap<T, V, Ix>

pub fn with_capacity(capacity: usize) -> Self

pub fn from_sorted<I>(iter: I) -> Selfwhere I: Iterator<Item = (Range<T>, V)>,

pub fn len(&self) -> usize

pub fn is_empty(&self) -> bool

pub fn clear(&mut self)

pub fn shrink_to_fit(&mut self)

pub fn entry<'a>(&'a mut self, interval: Range<T>) -> Entry<'a, T, V, Ix>

pub fn insert(&mut self, interval: Range<T>, value: V) -> Option<V>

pub fn force_insert(&mut self, interval: Range<T>, value: V)

pub fn contains(&self, interval: Range<T>) -> bool

pub fn get(&self, interval: Range<T>) -> Option<&V>

pub fn get_mut(&mut self, interval: Range<T>) -> Option<&mut V>

pub fn remove(&mut self, interval: Range<T>) -> Option<V>

pub fn remove_where( &mut self, interval: Range<T>, predicate: impl FnMut(&V) -> bool ) -> Option<V>

pub fn range(&self) -> Option<Range<T>>

pub fn smallest(&self) -> Option<(Range<T>, &V)>

pub fn smallest_mut(&mut self) -> Option<(Range<T>, &mut V)>

pub fn remove_smallest(&mut self) -> Option<(Range<T>, V)>

pub fn largest(&self) -> Option<(Range<T>, &V)>

pub fn largest_mut(&mut self) -> Option<(Range<T>, &mut V)>

pub fn remove_largest(&mut self) -> Option<(Range<T>, V)>

pub fn has_overlap<R>(&self, query: R) -> boolwhere R: RangeBounds<T>,

pub fn iter<'a, R>(&'a self, query: R) -> Iter<'a, T, V, R, Ix> ⓘwhere R: RangeBounds<T>,

pub fn intervals<'a, R>(&'a self, query: R) -> Intervals<'a, T, V, R, Ix> ⓘwhere R: RangeBounds<T>,

pub fn values<'a, R>(&'a self, query: R) -> Values<'a, T, V, R, Ix> ⓘwhere R: RangeBounds<T>,

pub fn iter_mut<'a, R>(&'a mut self, query: R) -> IterMut<'a, T, V, R, Ix> ⓘwhere R: RangeBounds<T>,

pub fn values_mut<'a, R>(&'a mut self, query: R) -> ValuesMut<'a, T, V, R, Ix> ⓘwhere R: RangeBounds<T>,

pub fn into_iter<R>(self, query: R) -> IntoIter<T, V, R, Ix> ⓘwhere R: RangeBounds<T>,

pub fn into_intervals<R>(self, query: R) -> IntoIntervals<T, V, R, Ix> ⓘwhere R: RangeBounds<T>,

pub fn into_values<R>(self, query: R) -> IntoValues<T, V, R, Ix> ⓘwhere R: RangeBounds<T>,

pub fn overlap<'a>(&'a self, point: T) -> Iter<'a, T, V, RangeInclusive<T>, Ix> ⓘ

pub fn intervals_overlap<'a>( &'a self, point: T ) -> Intervals<'a, T, V, RangeInclusive<T>, Ix> ⓘ

pub fn values_overlap<'a>( &'a self, point: T ) -> Values<'a, T, V, RangeInclusive<T>, Ix> ⓘ

pub fn overlap_mut<'a>( &'a mut self, point: T ) -> IterMut<'a, T, V, RangeInclusive<T>, Ix> ⓘ

pub fn values_overlap_mut<'a>( &'a mut self, point: T ) -> ValuesMut<'a, T, V, RangeInclusive<T>, Ix> ⓘ

pub fn values_at<'a>(&'a self, query: Range<T>) -> ValuesExact<'a, T, V, Ix> ⓘ

pub fn values_mut_at<'a>( &'a mut self, query: Range<T> ) -> ValuesExactMut<'a, T, V, Ix> ⓘ

pub fn unsorted_iter<'a>(&'a self) -> UnsIter<'a, T, V, Ix> ⓘ

pub fn unsorted_intervals<'a>(&'a self) -> UnsIntervals<'a, T, V, Ix> ⓘ

pub fn unsorted_values<'a>(&'a self) -> UnsValues<'a, T, V, Ix> ⓘ

pub fn unsorted_iter_mut<'a>(&'a mut self) -> UnsIterMut<'a, T, V, Ix> ⓘ

pub fn unsorted_values_mut<'a>(&'a mut self) -> UnsValuesMut<'a, T, V, Ix> ⓘ

pub fn unsorted_into_iter(self) -> UnsIntoIter<T, V, Ix> ⓘ

pub fn unsorted_into_intervals(self) -> UnsIntoIntervals<T, V, Ix> ⓘ

pub fn unsorted_into_values(self) -> UnsIntoValues<T, V, Ix> ⓘ

impl<T, V, Ix> IntervalMap<T, V, Ix>where T: PartialOrd + Copy + Default + AddAssign + Sub<Output = T>, Ix: IndexType,

pub fn covered_len<R>(&self, query: R) -> Twhere R: RangeBounds<T>,

Trait Implementations§

impl<T: Clone, V: Clone, Ix: Clone + IndexType> Clone for IntervalMap<T, V, Ix>

fn clone(&self) -> IntervalMap<T, V, Ix>

fn clone_from(&mut self, source: &Self)

impl<T: PartialOrd + Copy + Debug, V: Debug, Ix: IndexType> Debug for IntervalMap<T, V, Ix>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl<T: PartialOrd + Copy, V, Ix: IndexType> Default for IntervalMap<T, V, Ix>

fn default() -> Self

impl<T: PartialOrd + Copy, V> FromIterator<(Range<T>, V)> for IntervalMap<T, V>

fn from_iter<I>(iter: I) -> Selfwhere I: IntoIterator<Item = (Range<T>, V)>,

impl<T: PartialOrd + Copy, V, Ix: IndexType> Index<Range<T>> for IntervalMap<T, V, Ix>

type Output = V

fn index(&self, range: Range<T>) -> &Self::Output

impl<T: PartialOrd + Copy, V, Ix: IndexType> IntoIterator for IntervalMap<T, V, Ix>

type IntoIter = IntoIter<T, V, RangeFull, Ix>

type Item = (Range<T>, V)

fn into_iter(self) -> Self::IntoIter

Auto Trait Implementations§

impl<T, V, Ix> RefUnwindSafe for IntervalMap<T, V, Ix>where Ix: RefUnwindSafe, T: RefUnwindSafe, V: RefUnwindSafe,

impl<T, V, Ix> Send for IntervalMap<T, V, Ix>where Ix: Send, T: Send, V: Send,

impl<T, V, Ix> Sync for IntervalMap<T, V, Ix>where Ix: Sync, T: Sync, V: Sync,

impl<T, V, Ix> Unpin for IntervalMap<T, V, Ix>where Ix: Unpin, T: Unpin, V: Unpin,

impl<T, V, Ix> UnwindSafe for IntervalMap<T, V, Ix>where Ix: UnwindSafe, T: UnwindSafe, V: UnwindSafe,

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T