Struct orx_selfref_col::SelfRefCol
source · pub struct SelfRefCol<'a, V, T, P = SplitVec<Node<'a, V, T>, Recursive>>{ /* private fields */ }
Expand description
SelfRefCol
is a core data structure to conveniently build safe and efficient self referential collections, such as linked lists and trees.
Note that this core structure is capable of representing a wide range of self referential collections, where the variant is conveniently defined by expressive trait type definitions.
The represented collections have the following features:
- Relations are represented by regular
&
references avoiding the need to use smart pointers such asBox
,Rc
,Arc
, etc. - The collection makes sure that these references are set only among elements of the collections. In other words, no external references are allowed, or references of elements of the collection cannot leak out. This constructs the safety guarantees.
- The elements of the collection are internally stored in a
PinnedVec
implementation, which is crucial for the correctness of the references. Furthermore, in this way, elements of the collection are stored close to each other rather than being in arbitrary locations in memory. This provides better cache locality when compared to such collections where elements are stored by arbitrary heap allocations.
The collection is defined by the following generic arguments:
T
: type of the elements stored in the collection.V
: type of theVariant
defining the structure of the collection with the following:V::Storage
: defines how the elements ofT
will be stored:NodeDataLazyClose
: elements are stored asOption<T>
allowing lazy node closure or element removal;NodeDataEagerClose
: elements are stored directly asT
.
V::Prev
: defines how references to previous elements will be stored.NodeRefNone
: there is no previous reference of elements.NodeRefSingle
: there is either one or no previous reference of elements, stored asOption<&Node>
.NodeRefsArray
: there are multiple possible previous references up to a constant numberN
, stored as[Option<&Node>; N]
.NodeRefsVec
: there are multiple possible previous references, stored asVec<&Node>
.
V::Next
: defines how references to next elements will be stored:- Similarly, represented as either one of
NodeRefNone
orNodeRefSingle
orNodeRefsArray
orNodeRefsVec
.
- Similarly, represented as either one of
V::Ends
: defines how references to ends of the collection will be stored:- Similarly, represented as either one of
NodeRefNone
orNodeRefSingle
orNodeRefsArray
orNodeRefsVec
.
- Similarly, represented as either one of
V::MemoryReclaim
: defines how memory of closed nodes will be reclaimed:MemoryReclaimNever
will never claim closed nodes.MemoryReclaimOnThreshold<D>
will claim memory of closed nodes whenever the ratio of closed nodes exceeds one over2^D
.
§Example
Consider the following four structs implementing Variant
to define four different self referential collections.
Note that the definitions are expressive and concise leading to efficient implementations.
use orx_selfref_col::*;
struct SinglyListVariant;
impl<'a, T: 'a> Variant<'a, T> for SinglyListVariant {
type Storage = NodeDataLazyClose<T>; // lazy close
type MemoryReclaim = MemoryReclaimOnThreshold<2>; // closed nodes will be reclaimed when utilization drops below 75%
type Prev = NodeRefNone; // previous nodes are not stored
type Next = NodeRefSingle<'a, Self, T>; // there is only one next node, if any
type Ends = NodeRefSingle<'a, Self, T>; // there is only one end, namely the front of the list
}
struct DoublyListVariant;
impl<'a, T: 'a> Variant<'a, T> for DoublyListVariant {
type Storage = NodeDataLazyClose<T>; // lazy close
type MemoryReclaim = MemoryReclaimOnThreshold<3>; // closed nodes will be reclaimed when utilization drops below 87.5%
type Prev = NodeRefSingle<'a, Self, T>; // there is only one previous node, if any
type Next = NodeRefSingle<'a, Self, T>; // there is only one next node, if any
type Ends = NodeRefsArray<'a, 2, Self, T>; // there are two ends, namely the front and back of the list
}
struct BinaryTreeVariant;
impl<'a, T: 'a> Variant<'a, T> for BinaryTreeVariant {
type Storage = NodeDataLazyClose<T>; // lazy close
type MemoryReclaim = MemoryReclaimOnThreshold<1>; // closed nodes will be reclaimed when utilization drops below 50%
type Prev = NodeRefSingle<'a, Self, T>; // there is only one previous node, namely parent node, if any
type Next = NodeRefsArray<'a, 2, Self, T>; // there are 0, 1 or 2 next or children nodes
type Ends = NodeRefSingle<'a, Self, T>; // there is only one end, namely the root of the tree
}
struct DynamicTreeVariant;
impl<'a, T: 'a> Variant<'a, T> for DynamicTreeVariant {
type Storage = NodeDataLazyClose<T>; // lazy close
type MemoryReclaim = MemoryReclaimNever; // closed nodes will be left as holes
type Prev = NodeRefSingle<'a, Self, T>; // there is only one previous node, namely parent node, if any
type Next = NodeRefsVec<'a, Self, T>; // there might be any number of next nodes, namely children nodes
type Ends = NodeRefSingle<'a, Self, T>; // there is only one end, namely the root of the tree
}
§Crates using SelfRefCol
The following crates use SelfRefCol
to conveniently build the corresponding data structure:
- https://crates.io/crates/orx-linked-list: implements singly and doubly linked lists.
Implementations§
source§impl<'a, V, T, P> SelfRefCol<'a, V, T, P>
impl<'a, V, T, P> SelfRefCol<'a, V, T, P>
sourcepub fn node_utilization(&self) -> f32
pub fn node_utilization(&self) -> f32
Returns the node utilization as a fraction of active nodes to the used nodes:
- 1.0 when there is no closed node;
- 0.0 when all used memory is used by closed nodes.
source§impl<'a, V, T, P> SelfRefCol<'a, V, T, P>
impl<'a, V, T, P> SelfRefCol<'a, V, T, P>
sourcepub fn ends(&self) -> &V::Ends
pub fn ends(&self) -> &V::Ends
Returns a reference to the ends of the self referential collection.
Ends represent special references of the self referential structure.
It can be nothing; i.e., NodeRefNone
; however, they are common in such structures.
For instance,
- ends of a singly linked list is the front of the list which can be represented as a
NodeRefSingle
reference; - ends of a doubly linked list contains two references, front and back of the list which can be represented by a
NodeRefsArray<2, _, _>
; - ends of a tree is the root which can again be represented as a
NodeRefSingle
reference.
Ends of a SelfRefCol
is generic over NodeRefs
trait which can be decided on the structure’s requirement.
sourcepub fn is_empty(&self) -> bool
pub fn is_empty(&self) -> bool
Returns whether or not the self referential collection is empty.
sourcepub fn clear(&mut self)
pub fn clear(&mut self)
Clears the collection: clears all elements and the ends of the collection.
sourcepub fn move_mutate<Move>(
&mut self,
value_to_move: Move,
move_mutate_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, P>, _: Move)
)
pub fn move_mutate<Move>( &mut self, value_to_move: Move, move_mutate_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, P>, _: Move) )
One of the five reference mutation methods:
SelfRefCol::move_mutate
SelfRefCol::mutate_take
SelfRefCol::move_mutate_take
SelfRefCol::mutate_filter_collect
This method takes the following arguments:
value_to_move
is, as the name suggests, a value to be moved to the mutation lambda. Most common reason to move this value to the lambda is to add it to the self referential collection.move_mutate_lambda
is the expression defining the mutation.- The lambda takes two parameters.
- The first parameter is the
SelfRefColMut
type which is the key for allSelfRefNode
mutation methods to provide safety guarantees. - The second parameter is the value moved into the lambda, which is exactly the
value_to_move
parameter of this method. - Note that the lambda is of a function pointer type; i.e.,
fn
, rather than a function trait such asFnOnce
. This is intentional and critical in terms of the safety guarantees. Its purpose is to prevent capturing data from the environment, as well as, prevent leaking vector references to outside of the lambda.
This design allows to conveniently mutate the references within the vector without the complexity of lifetimes and borrow checker. Prior references can be broken, references can be rearranged or new references can be built easily and all in one function. This convenience while being safe is achieved by the encapsulation of all mutations within a non-capturing lambda.
§Example
The following code block demonstrates the use of the move_mutate
function to define the push-front method of a singly linked list.
Note that self.col
below is a SelfRefCol
.
The pushed value
is moved to the lambda.
Inside the lambda, this value is pushed to the list, which is stored inside a linked list node.
Links and ends (front of the singly linked list) are updated by using the reference to the newly pushed node.
pub fn push_front(&mut self, value: T) {
self.col
.move_mutate(value, |x, value| match x.ends().front() {
Some(prior_front) => {
let new_front = x.push_get_ref(value);
new_front.set_next(&x, prior_front);
x.set_ends(new_front);
}
None => {
let node = x.push_get_ref(value);
x.set_ends(node);
}
});
}
sourcepub fn mutate_take(
&mut self,
mutate_take_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, P>) -> Option<T>
) -> Option<T>
pub fn mutate_take( &mut self, mutate_take_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, P>) -> Option<T> ) -> Option<T>
One of the five reference mutation methods:
SelfRefCol::move_mutate
SelfRefCol::mutate_take
SelfRefCol::move_mutate_take
SelfRefCol::mutate_filter_collect
SelfRefCol::move_append_mutate
This method takes one argument:
mutate_get_lambda
is the expression defining the mutation.- The lambda takes one parameter of
SelfRefColMut
type which is the key for allSelfRefNode
mutation methods to provide safety guarantees. - It returns an optional value of
T
which is the underlying element type of the self referential collection. - Note that the lambda is of a function pointer type; i.e.,
fn
, rather than a function trait such asFnOnce
. This is intentional and critical in terms of the safety guarantees. Its purpose is to prevent capturing data from the environment, as well as, prevent leaking vector references to outside of the lambda.
- The lambda takes one parameter of
This method is the counterpart of the move_mutate
.
Its main purpose is to take an element out of the collection and return it while preserving the internal references through the mutations.
This design allows to conveniently mutate the references within the vector without the complexity of lifetimes and borrow checker. Prior references can be broken, references can be rearranged or new references can be built easily and all in one function. This convenience while being safe is achieved by the encapsulation of all mutations within a non-capturing lambda.
§Example
The following code block demonstrates the use of the mutate_take
function to define the pop-front method of a singly linked list.
Note that self.vec
below is a SelfRefCol
.
Mutations are applied only if the vector is non-empty; i.e., there exists a front.
When this is the case, the ends reference (front of the list) is updated.
Furthermore, the prior-front’s underlying data is taken out and returned from the lambda.
This, in turn, is returned from the pop_front
method, demonstrating safely removing elements from the self referential collection.
pub fn pop_front(&mut self) -> Option<T> {
self.col.mutate_take(|x| {
x.ends().front().map(|prior_front| {
let new_front = *prior_front.next().get();
x.set_ends(new_front);
prior_front.close_node_take_data(&x)
})
})
}
sourcepub fn move_mutate_take<Move>(
&mut self,
value_to_move: Move,
move_mutate_take_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, P>, _: Move) -> Option<T>
) -> Option<T>
pub fn move_mutate_take<Move>( &mut self, value_to_move: Move, move_mutate_take_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, P>, _: Move) -> Option<T> ) -> Option<T>
One of the five reference mutation methods:
SelfRefCol::move_mutate
SelfRefCol::mutate_take
SelfRefCol::move_mutate_take
SelfRefCol::mutate_filter_collect
SelfRefCol::move_append_mutate
This method can be considered as the combination of move_mutate
and mutate_take
where we:
- move a value to the lambda, and hence, to the vector,
- apply reference mutations encapsulated inside the lambda,
- take out one element from the vector.
This method is most suitable for swap methods.
§Example
The following code block demonstrates the use of move_mutate_take
function to define the swap-front method of a singly linked list.
- when the list is empty, we simply push the value as the first element and return None;
- otherwise, we take out the value of the current front node, replace it with the given value, and return the taken out value.
pub fn swap_front(&mut self, new_front: T) -> Option<T> {
self.col
.move_mutate_take(new_front, |x, value| match x.ends().front() {
Some(front_node) => Some(front_node.swap_data(&x, value)),
None => {
let node = x.push_get_ref(value);
x.set_ends(node);
None
}
})
}
sourcepub fn mutate_filter_collect<Predicate, Collect>(
&mut self,
predicate: &Predicate,
collect: &mut Collect,
mutate_filter_collect_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, P>, _: &Predicate, _: &mut Collect)
)
pub fn mutate_filter_collect<Predicate, Collect>( &mut self, predicate: &Predicate, collect: &mut Collect, mutate_filter_collect_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, P>, _: &Predicate, _: &mut Collect) )
One of the five reference mutation methods:
SelfRefCol::move_mutate
SelfRefCol::mutate_take
SelfRefCol::move_mutate_take
SelfRefCol::mutate_filter_collect
SelfRefCol::move_append_mutate
This method takes three arguments:
predicate
is the function to be used to select elements to be kept.collect
is the closure to collect the elements which does not satisfy the predicate and will be removed from this collection.mutate_filter_collect_lambda
is the expression defining the retain together with the mutation.- In addition to
predicate
andcollect
, the lambda takesSelfRefColMut
type which is the key for allSelfRefNode
mutation methods to provide safety guarantees. - Note that the lambda is of a function pointer type; i.e.,
fn
, rather than a function trait such asFnOnce
. This is intentional and critical in terms of the safety guarantees. Its purpose is to prevent capturing data from the environment, as well as, prevent leaking vector references to outside of the lambda.
- In addition to
This method is a generalization of mutate_take
which returns the element removed from the collection.
In this method collect(T)
is called on removed elements.
This method might be doing nothing to drop the removed values, or might be pushing them to a captured collection such as a vector.
Note that the signature of Collect
is FnMut(T)
; this makes sure that the function is called with removed/owned element values making sure that no references can leak out.
This design allows to conveniently mutate the references within the vector without the complexity of lifetimes and borrow checker. Prior references can be broken, references can be rearranged or new references can be built easily and all in one function. This convenience while being safe is achieved by the encapsulation of all mutations within a non-capturing lambda.
§Examples
use orx_selfref_col::*;
#[derive(Debug)]
struct Var;
impl<'a> Variant<'a, String> for Var {
type Storage = NodeDataLazyClose<String>;
type Prev = NodeRefSingle<'a, Self, String>;
type Next = NodeRefsVec<'a, Self, String>;
type Ends = NodeRefsArray<'a, 2, Self, String>;
type MemoryReclaim = MemoryReclaimNever;
}
// build up collection
let mut col = SelfRefCol::<Var, _>::new();
let values = ['a', 'b', 'c', 'd', 'e'];
col.move_mutate(values.map(|x| x.to_string()), |x, vals| {
for value in vals {
let _ = x.push_get_ref(value);
}
});
let taboo_list = ['a', 's', 'd', 'f'];
let taboo_list = taboo_list.map(|x| x.to_string());
let is_allowed = |c: &String| !taboo_list.contains(c);
let mut collected = vec![];
let mut collect = |c| collected.push(c);
col.mutate_filter_collect(&is_allowed, &mut collect, |x, predicate, collect| {
for i in 0..x.len() {
let node = x.get_node(i).expect("is-some");
if let Some(value) = node.data() {
if !predicate(value) {
collect(node.close_node_take_data(&x));
}
}
}
});
assert_eq!(3, col.len());
assert_eq!(&['a'.to_string(), 'd'.to_string()], collected.as_slice());
source§impl<'a, V, T> SelfRefCol<'a, V, T, SplitVec<Node<'a, V, T>, Recursive>>where
V: Variant<'a, T>,
impl<'a, V, T> SelfRefCol<'a, V, T, SplitVec<Node<'a, V, T>, Recursive>>where
V: Variant<'a, T>,
sourcepub fn move_append_mutate<Move>(
&mut self,
other: Self,
value_to_move: Move,
move_append_mutate_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, SplitVec<Node<'a, V, T>, Recursive>>, _: SelfRefColMut<'_, 'a, V, T, SplitVec<Node<'a, V, T>, Recursive>>, _: Move)
)
pub fn move_append_mutate<Move>( &mut self, other: Self, value_to_move: Move, move_append_mutate_lambda: fn(_: SelfRefColMut<'_, 'a, V, T, SplitVec<Node<'a, V, T>, Recursive>>, _: SelfRefColMut<'_, 'a, V, T, SplitVec<Node<'a, V, T>, Recursive>>, _: Move) )
One of the five reference mutation methods:
SelfRefCol::move_mutate
SelfRefCol::mutate_take
SelfRefCol::move_mutate_take
SelfRefCol::mutate_filter_collect
SelfRefCol::move_append_mutate
This method appends to self referential collections.
Note that this method is available in self referential collections using an underlying pinned vector with orx_split_vec::Recursive
growth.
This allows appending underlying vectors in constant time.
This method takes the following arguments:
other
is the other self referential collection to be appended to this collection.value_to_move
is, as the name suggests, a value to be moved to the mutation lambda.move_append_mutate_lambda
is the expression defining the mutation.- The lambda takes three parameters.
- The first parameter is the
SelfRefColMut
type which is the key for allSelfRefNode
mutation methods to provide safety guarantees. - The second parameter is simply the same; however, of the
other
collection. - The third parameter is the value moved into the lambda, which is exactly the
value_to_move
parameter of this method. - Note that the lambda is of a function pointer type; i.e.,
fn
, rather than a function trait such asFnOnce
. This is intentional and critical in terms of the safety guarantees. Its purpose is to prevent capturing data from the environment, as well as, prevent leaking vector references to outside of the lambda.
This design allows to conveniently mutate the references within the vector without the complexity of lifetimes and borrow checker. Prior references can be broken, references can be rearranged or new references can be built easily and all in one function. This convenience while being safe is achieved by the encapsulation of all mutations within a non-capturing lambda.
§Example
The following code block demonstrates the use of the move_append_mutate
function to define the append-front method of a singly linked list.
The method appends the other
list to the front of the self
list in O(1) time complexity.
Note that appending the underlying storages are handled automatically by SelfRefCol
.
The lambda, taking mutation keys of both collections being merged, is responsible for fixing references.
In this example, one next-relation is established and ends (front and back) of the list are updated.
pub fn append_front(&mut self, other: Self) {
self.col.move_append_mutate(other.col, (), |x, y, _| {
match (x.ends().front(), y.ends().back()) {
(Some(a), Some(b)) => {
b.set_next(&x, a);
x.set_ends([y.ends().front(), x.ends().back()]);
}
(None, Some(_)) => {
x.set_ends([y.ends().front(), y.ends().back()]);
}
_ => {}
};
None
});
}