orx-concurrent-bag
An efficient, convenient and lightweight thread-safe grow-only collection, ideal for collecting results concurrently.
- convenient: the bag can be shared among threads simply as a shared reference, not even requiring
Arc, - efficient: allows copy-free collecting which makes it performant especially when the type to be collected is not very small (please see the benchmarks section for tradeoffs and details),
- lightweight: a minimalistic implementation.
The bag preserves the order of elements with respect to the order the push method is called.
Examples
Safety guarantees to push to the bag with an immutable reference makes it easy to share the bag among threads.
Using std::sync::Arc
Following the common approach of using an Arc, we can share our bag among threads and collect results concurrently.
use *;
use ;
let = ;
let bag = new;
let mut thread_vec: = Vecnew;
for i in 0..num_threads
for handle in thread_vec
let mut vec_from_bag: = unsafe .copied.collect;
vec_from_bag.sort;
let mut expected: = .flat_map.collect;
expected.sort;
assert_eq!;
Using std::thread::scope
An even more convenient approach would be to use thread scopes. This allows to use shared reference of the bag across threads, instead of Arc.
use *;
use thread;
let = ;
let bag = new;
let bag_ref = &bag; // just take a reference
scope;
let mut vec_from_bag: = bag.into_inner.iter.copied.collect;
vec_from_bag.sort;
let mut expected: = .flat_map.collect;
expected.sort;
assert_eq!;
Safety
ConcurrentBag uses a SplitVec as the underlying storage.
SplitVec implements PinnedVec which guarantees that elements which are already pushed to the vector stay pinned to their memory locations.
This feature makes it safe to grow with a shared reference on a single thread, as implemented by ImpVec.
In order to achieve this feature in a concurrent program, ConcurrentBag pairs the SplitVec with an AtomicUsize.
AtomicUsizefixes the target memory location of each element to be pushed at the time thepushmethod is called. Regardless of whether or not writing to memory completes before another element is pushed, every pushed element receives a unique position reserved for it.SplitVecguarantees that already pushed elements are not moved around in memory and new elements are written to the reserved position.
The approach guarantees that
- only one thread can write to the memory location of an element being pushed to the bag,
- at any point in time, only one thread is responsible for the allocation of memory if the bag requires new memory,
- no thread reads any of the written elements (reading happens after converting the bag
into_inner), - hence, there exists no race condition.
This pair allows a lightweight and convenient concurrent bag which is ideal for collecting results concurrently.
Write-Only vs Read-Write
The concurrent bag is write-only & grow-only bag which is convenient and efficient for collecting elements.
See ConcurrentVec for a read-and-write variant which
- guarantees that reading and writing never happen concurrently, and hence,
- allows safe iteration or access to already written elements of the concurrent vector,
- with a minor additional cost of values being wrapped by an
Option.
Benchmarks
You may see the benchmark at benches/grow.rs.
In this benchmark, concurrent results are collected using ConcurrentBag together with scoped threads and Arc. Computation time performance of these two is negligible, hence, only scoped thread implementation is reported. Results are compared by the collect method rayons parallel iterator.
We can see that:
rayonis extremely performant when the data size to be collected is small and there is a huge concurrency load. We can see that it outperformsConcurrentBagwhen the threads do not do any work at all to produce outputs and the output data isi32.- On the other hand, when there exists some work to be done to produce the outputs (workload),
ConcurrentBagstarts to perform significantly faster. - Similarly, when the output data is large (
[i32; 32]in this example), regardless of the additional workload,ConcurrentBagperforms faster.