Skip to main content

Crate prefix_trie

Crate prefix_trie 

Source
Expand description

This crate provides prefix-map and prefix-set collections for IP prefixes and other fixed-width prefix types. PrefixMap is backed by a compact TreeBitMap-style trie and supports exact, longest-prefix, and shortest-prefix matches. The crate supports both IPv4 and IPv6 (from either ipnet, ipnetwork, or cidr). It also supports any tuple (R, u8), where R is any unsigned primitive integer (u8, u16, u32, u64, u128, or usize).

Prefixes are not stored verbatim. They are reconstructed from their trie position when returned from map and set operations, so host bits outside the prefix length are not preserved.

This crate also provides a joint::JointPrefixMap and joint::JointPrefixSet that contains two tables, one for IPv4 and one for IPv6.

The library ip_network_table-deps-treebitmap provides the same data structure (in the following called TreeBitMap) and uses a similar algorithm. The following table compares the performance and memory of the two libraries, and relates them to the HashMap and BTreeMap of the standard library. Throughput is reported relative to HashMap (1.00x = HashMap speed), with absolute throughput in parentheses. Bold marks the fastest implementation per row. See benches/benchmark.rs for details.

All benchmarks use IPv4 prefixes from a RIPE RIS peer snapshot (1,042,024 IPv4 prefixes or 246,174 IPv6 prefixes). See benches/benchmark.rs and benches/memory.rs for details. The benchmark results below were obtained on an AMD EPYC server.

BenchmarkHashMapPrefixMapTreeBitMapBTreeMap
Lookup
-> Random access
—> IPv41.00x (7.4 Mops)1.92x (14.2 Mops)1.15x (8.5 Mops)0.46x (3.4 Mops)
—> IPv61.00x (11.0 Mops)0.97x (10.7 Mops)0.58x (6.4 Mops)0.45x (4.9 Mops)
-> RIS updates
—> IPv41.00x (17.5 Mops)1.69x (29.5 Mops)0.78x (13.7 Mops)0.47x (8.2 Mops)
—> IPv61.00x (24.8 Mops)0.63x (15.7 Mops)0.33x (8.2 Mops)0.32x (7.9 Mops)
Insert & Remove
-> Random access
—> IPv41.00x (7.4 Mops)1.04x (7.7 Mops)0.89x (6.6 Mops)0.43x (3.2 Mops)
—> IPv61.00x (10.8 Mops)0.48x (5.2 Mops)0.44x (4.7 Mops)0.39x (4.3 Mops)
-> RIS updates
—> IPv41.00x (17.1 Mops)0.88x (15.0 Mops)0.71x (12.2 Mops)0.47x (8.0 Mops)
—> IPv61.00x (25.0 Mops)0.33x (8.3 Mops)0.29x (7.3 Mops)0.31x (7.7 Mops)
Create
-> Random order
—> IPv41.00x (7.8 Mops)1.13x (8.8 Mops)0.95x (7.4 Mops)0.55x (4.3 Mops)
—> IPv61.00x (11.4 Mops)0.52x (5.9 Mops)0.43x (4.9 Mops)0.42x (4.8 Mops)
-> Sorted order
—> IPv41.00x (10.3 Mops)1.45x (14.9 Mops)1.04x (10.7 Mops)0.85x (8.8 Mops)
—> IPv61.00x (11.7 Mops)0.70x (8.2 Mops)0.55x (6.5 Mops)0.51x (6.0 Mops)
-> Scattered order
—> IPv41.00x (10.3 Mops)1.02x (10.5 Mops)0.76x (7.8 Mops)0.34x (3.4 Mops)
—> IPv61.00x (11.6 Mops)0.59x (6.9 Mops)0.47x (5.5 Mops)0.46x (5.4 Mops)
Memory
-> IPv426.0 mB12.0 mB (set: 4.0 mB)11.0 mB16.4 mB
-> IPv612.5 mB6.0 mB (set: 4.0 mB)4.3 mB8.1 mB

Besides better performance than the TreeBitMap, prefix-trie includes a PrefixSet analogous to std::collections::HashSet. Set operations are exposed through composable trie views, so operations such as union, intersection, difference, covering union, and covering difference can be combined without building temporary maps. prefix-trie has an interface similar to std::collections, and its longest-prefix matching is not limited to individual host addresses.

§Description of the Tree

PrefixMap stores the logical binary prefix trie in multi-bit nodes. Each internal node covers five consecutive binary-trie levels. A node at depth d can hold values for prefixes with lengths d..=d+4, and it has up to 32 child slots for subtries rooted at depth d+5.

Each node stores two bitmaps: one for the value slots that are present in the node, and one for the child slots that are present below it. The allocators store multi-bit nodes and value cells in compact, linearized arrays, which improves cache locality and keeps lookup and traversal decisions local to a node. Physical slots are derived from the bitmaps with a popcount, avoiding one pointer per possible branch.

A stored entry is identified by its path through the trie and by a value bit inside the final multi-bit node. The prefix object passed to insert is not stored alongside the value. Returned prefixes are therefore reconstructed and canonicalized to the prefix length.

§Traversals

Iterators traverse the logical prefix trie in lexicographic order and yield reconstructed owned prefixes together with references or owned values. Complete iteration is linear in the number of stored entries and trie nodes visited.

Set operations use the same view infrastructure. union, intersection, difference, covering_union, and covering_difference traverse the involved trie views together and yield results in lexicographic order. Covering variants also report longest-prefix matches from the opposite side where appropriate.

§Trie Views

TrieView is a trait for immutable, mutable, and composed cursors into a trie. Concrete leaf views are TrieRef, created from &PrefixMap or &PrefixSet, and TrieRefMut, created from mutable references. Both are obtained through the AsView trait: call map.view() for a full-trie view or map.view_at(&prefix) for a non-empty subtrie.

Views can be rooted at a prefix even when no value is stored exactly at that prefix. If the prefix falls inside an existing multi-bit node, the view masks that node’s value and child bitmaps so that iteration and search stay inside the requested subtrie. Composed views such as trieview::UnionView, trieview::IntersectionView, and trieview::DifferenceView also implement TrieView, so view operations can be chained before iterating.

§Operations on the Tree

Most point operations are bounded by prefix width, not by the number of stored entries. Let w be the number of bits in the prefix representation, and let h = ceil((w + 1) / 5) be the maximum number of multi-bit nodes on a search path. For IPv4, h <= 7; for IPv6, h <= 26. Let n be the number of stored entries, and let v be the number of trie nodes visited by a traversal.

OperationComplexity
len, is_empty, mem_sizeO(1)
get, get_mut, contains_keyO(h)
get_lpm, get_spm, coverO(h)
entry, insertO(h)
remove, remove_keep_treeO(h)
children, view_atO(h) to create, then linear in the subtrie
iter, keys, valuesO(n + v) for a complete traversal
retain, clearO(n + v)
remove_childrenO(h + m) where m is the removed subtrie size
union, intersection, difference, …linear in the trie portions visited
Operations on an occupied map::EntryO(1) after the entry lookup
Inserting through a vacant map::EntryO(h) worst case

There are three removal styles:

  • PrefixMap::remove will remove an entry from the tree and modify the tree structure as if the value was never inserted before. It may remove now-empty multi-bit nodes and compact their allocator blocks.
  • PrefixMap::remove_children will remove all entries that are contained within the given prefix, including entries stored in the same multi-bit node and in child nodes below it.
  • PrefixMap::remove_keep_tree removes only the value and may leave empty trie nodes in place.

Re-exports§

pub use map::PrefixMap;
pub use set::PrefixSet;
pub use trieview::AsView;
pub use trieview::TrieRef;
pub use trieview::TrieRefMut;
pub use trieview::TrieView;

Modules§

joint
Module that defines the joint version of a prefix map and set, including all helper functions. You can access each individual table of the prefix map, allowing you to perform the usual operations set operations.
map
This module contains the implementation for the Dense Prefix Map.
set
Prefix set implemented on top of PrefixMap.
trieview
Composable trie-view trait for crate::PrefixMap.

Traits§

Prefix
A fixed-width prefix key.