Expand description
make creating and synchronizing threads ergonomic, therefore fun!
This is the synchronization library as part of the ergo
crates ecosystem. It contains useful
types, traits and functions for spawning threads and synchronizing them. It is named sync
because of std::sync
and because it is not async, which is/will be a separate part of the
ergo ecocystem.
This provides ergonomic access to threading/synchronization primitives and macros. It does not provide an opinion on which threading primitives you use. See the following crates:
rayon
for procesing data structures in parallel. Note that rayon cannot be used for generic iterators (likerecv.iter()
).may
for stackful coroutines, similar to golang’s goroutines.crossbeam_utils
for scoped threads.
However, please note that in most cases using spawn
with channels and num_cpus
is sufficient for performing most tasks. Obviously if you are a server servicing 100+
clients, or doing big data analysis, or have other specific requirements then you want more
specialized concurrency primitives, which the above can provide separately from this crate.
§Thankyou
The crates that are wraped/exported are:
crossbeam_channel
: Multi-producer multi-consumer channels for message passingnum_cpus
: Get the number of CPUs in Rusttaken
: Macros for taking ownership
Consider supporting their development individually and starring them on github.
§How to Use
Use this library with:
#[macro_use] extern crate ergo_sync;
use ergo_sync::*;
§Types Functions and Modules
ch
module: for channel types (also see thech!
andselect_loop!
macros).spawn
: the standadstd::thread::spawn
which spawns a regular OS thread. The advantage of this (over scoped threads) is that it can outlive the current function. The disadvantage is that as far as the compiler knows it always outlives the current function, meaning it must own all of its variables (or they have to be'static
).num_cpus
: for getting the number of cpus when creating your own thread pools.std_prelude
: Various concurrency related types fromstd_prelude
including:Atomic*
,Mutex
,Arc
for concurrency safe typessleep
and (redefined non-deprecated)sleep_ms
.
In addition it provides the following helper macros:
ch!
:Use with channels with ergonomic syntax and panic with helpful error messages when sending/receiving on a channel is invalid.ch!(send <- 42)
for sending a value.let v = ch!(<- recv)
for receiving a value.ch!(! <- recv)
to wait for channels to close.<-?
for async operation support.
ch_try!
: to handle an expression that could beErr
and send it over a channel if it is.select_loop!
: for selecting from multiple channels.take!
: for expressing ownership consisely. You will move or clone variables extremely often in threads, this helps you express that better thanlet value = value
.
§Examples
§Example: Channels
See the docs for the ch
module.
§Example: producer / consumer
The producer/consumer model is this library’s bread and butter. Once you understand channels you should next learn producer/consumer.
In the ergo_sync
model you should:
- Do “CPU work” by spawning up to
num_cpus::get()
threads. - Do “IO work” using between 4 - 16 threads since most storage devices only provide up to that many channels. I personally prefer to use 8.
A typical application might look like this:
+-----------------------+
| Get paths to parse |
| (typically one thread |
| using walkdir which |
| is rediculously fast) |
| Send them via channel |
+-----------------------+
___|___
/ | \
v v v
+------------------------+
| 4-16 threads receiving |
| paths via channels and |
| reading raw strings. |
| |
| These are sent to next |
| stage via channels |
+------------------------+
___|___
/ | \
v v v
+------------------------+
| num_cpu threads |
| reading the string |
| iterators and |
| processing them. |
| |
| This is pure cpu work. |
+------------------------+
|
|
v
+------------------------+
| Collect results in the |
| current thread to |
| prepare for next step |
+------------------------+
This example basically implements the above example using the source code of this crate as the example. The below code searches through the crate source looking for every use of the word “example”.
Note: it is recommended to use
ergo_fs
to do filesystem operations, as all errors will have the context (path and action) of what caused the error and you will have access to best in class filesystem operations like walking the directory structure and expressing the types you expect. We do not use it here so we can focus onergo_sync
’s API.
#[macro_use] extern crate ergo_sync;
use std::fs;
use std::io;
use std::io::prelude::*;
use std::path::{Path, PathBuf};
use ergo_sync::*;
/// List the dir and return any paths found
fn read_paths<P: AsRef<Path>>(
dir: P, send_paths: &Sender<PathBuf>,
errs: &Sender<io::Error>,
) {
for entry in ch_try!(errs, fs::read_dir(dir), return) {
let entry = ch_try!(errs, entry, continue);
let meta = ch_try!(errs, entry.metadata(), continue);
if meta.is_file() {
ch!(send_paths <- entry.path());
} else if meta.is_dir() {
// recurse into the path
read_paths(entry.path(), send_paths, errs);
} else {
// ignore symlinks for this example
}
}
}
/// Send one line at a time from the file
fn read_lines(path: PathBuf, send_lines: &Sender<String>, errs: &Sender<io::Error>) {
let file = ch_try!(errs, fs::File::open(path), return);
let buf = io::BufReader::new(file);
for line in buf.lines() {
// send the line but return immediately if any `io::Error` is hit
ch!(send_lines <- ch_try!(errs, line, return));
}
}
/// Parse each line for "example", counting the number of times it appears.
fn count_examples(line: &str) -> u64 {
// Probably use the `regex` crate in a real life example.
line.match_indices("example").count() as u64
}
fn main() {
let (recv_count, handle_errs) = {
// This scope will drop channels that we are not returning.
// This prevents deadlock, as recv channels will not stop
// blocking until all their send counterparts are dropped.
let (send_errs, recv_errs) = ch::bounded(128);
let (send_paths, recv_paths) = ch::bounded(128);
// First we spawn a single thread to handle errors.
// In this case we will just count and log them.
let handle_errs = spawn(|| {
take!(recv_errs);
let mut count = 0_u64;
for err in recv_errs.iter() {
eprintln!("ERROR: {}", err);
count += 1;
}
count
});
// We spawn a single thread to "walk" the directory for paths.
let errs = send_errs.clone();
spawn(|| {
take!(send_paths, errs);
read_paths("src", &send_paths, &errs);
});
// We read the lines using 8 threads (since this is IO bound)
let (send_lines, recv_lines) = ch::bounded(128);
for _ in 0..8 {
take!(=recv_paths, =send_lines, =send_errs);
spawn(|| {
take!(recv_paths, send_lines, send_errs);
for path in recv_paths {
read_lines(path, &send_lines, &send_errs);
}
});
}
// Now we do actual "CPU work" using the rayon thread pool
let (send_count, recv_count) = ch::bounded(128);
// Create a pool of threads for actually doing the "work"
for _ in 0..num_cpus::get() {
take!(=recv_lines, =send_count);
spawn(move || {
for line in recv_lines.iter() {
let count = count_examples(&line);
if count != 0 {
ch!(send_count <- count);
}
}
});
}
(recv_count, handle_errs)
};
// Finally we can get our count.
let count: u64 = recv_count.iter().sum();
// And assert we had no errors
assert_eq!(0, handle_errs.finish());
}
Re-exports§
pub extern crate crossbeam_channel;
pub extern crate num_cpus;
pub extern crate std_prelude;
Modules§
- ch
- Module for working with channels. Rexport of
crossbeam_channel
Macros§
- ch
- Use with channels with ergonomic syntax and panic with helpful error messages when sending/receiving on a channel is invalid.
- ch_try
- Handle an expression that could be
Err
and send it over a channel if it is.
Structs§
- Arc
- A thread-safe reference-counting pointer. ‘Arc’ stands for ‘Atomically Reference Counted’.
- Atomic
Bool - A boolean type which can be safely shared between threads.
- Atomic
Isize - An integer type which can be safely shared between threads.
- Atomic
Usize - An integer type which can be safely shared between threads.
- Duration
- A
Duration
type to represent a span of time, typically used for system timeouts. - Mutex
- A mutual exclusion primitive useful for protecting shared data
Enums§
- Atomic
Ordering - Atomic memory orderings
Constants§
- ATOMIC_
USIZE_ INIT Deprecated - An
AtomicUsize
initialized to0
.
Traits§
- Finish
Handle - Convinience trait mimicking
std::thread::JoinHandle
with better ergonomics.
Functions§
- sleep
- Puts the current thread to sleep for at least the specified amount of time.
- sleep_
ms - Just sleep for a certain number of milliseconds.
- spawn
- Spawns a new thread, returning a
JoinHandle
for it.