Struct moka::future::Cache

source · [−]

pub struct Cache<K, V, S = RandomState> { /* private fields */ }

Available on crate feature future only.

Expand description

A thread-safe, futures-aware concurrent in-memory cache.

Cache supports full concurrency of retrievals and a high expected concurrency for updates. It can be accessed inside and outside of asynchronous contexts.

Cache utilizes a lock-free concurrent hash table as the central key-value storage. Cache performs a best-effort bounding of the map using an entry replacement algorithm to determine which entries to evict when the capacity is exceeded.

To use this cache, enable a crate feature called “future”.

Examples

Cache entries are manually added using an insert method, and are stored in the cache until either evicted or manually invalidated:

Inside an async context (async fn or async block), use insert, get_with or invalidate methods for updating the cache and await them.
Outside any async context, use blocking_insert or blocking_invalidate methods. They will block for a short time under heavy updates.

Here’s an example of reading and updating a cache by using multiple asynchronous tasks with Tokio runtime:

 // Cargo.toml
 //
 // [dependencies]
 // moka = { version = "0.8", features = ["future"] }
 // tokio = { version = "1", features = ["rt-multi-thread", "macros" ] }
 // futures-util = "0.3"

 use moka::future::Cache;

 #[tokio::main]
 async fn main() {
     const NUM_TASKS: usize = 16;
     const NUM_KEYS_PER_TASK: usize = 64;

     fn value(n: usize) -> String {
         format!("value {}", n)
     }

     // Create a cache that can store up to 10,000 entries.
     let cache = Cache::new(10_000);

     // Spawn async tasks and write to and read from the cache.
     let tasks: Vec<_> = (0..NUM_TASKS)
         .map(|i| {
             // To share the same cache across the async tasks, clone it.
             // This is a cheap operation.
             let my_cache = cache.clone();
             let start = i * NUM_KEYS_PER_TASK;
             let end = (i + 1) * NUM_KEYS_PER_TASK;

             tokio::spawn(async move {
                 // Insert 64 entries. (NUM_KEYS_PER_TASK = 64)
                 for key in start..end {
                     // insert() is an async method, so await it.
                     my_cache.insert(key, value(key)).await;
                     // get() returns Option<String>, a clone of the stored value.
                     assert_eq!(my_cache.get(&key), Some(value(key)));
                 }

                 // Invalidate every 4 element of the inserted entries.
                 for key in (start..end).step_by(4) {
                     // invalidate() is an async method, so await it.
                     my_cache.invalidate(&key).await;
                 }
             })
         })
         .collect();

     // Wait for all tasks to complete.
     futures_util::future::join_all(tasks).await;

     // Verify the result.
     for key in 0..(NUM_TASKS * NUM_KEYS_PER_TASK) {
         if key % 4 == 0 {
             assert_eq!(cache.get(&key), None);
         } else {
             assert_eq!(cache.get(&key), Some(value(key)));
         }
     }
 }

If you want to atomically initialize and insert a value when the key is not present, you might want to check other insertion methods get_with and try_get_with.

Avoiding to clone the value at `get`

The return type of get method is Option<V> instead of Option<&V>. Every time get is called for an existing key, it creates a clone of the stored value V and returns it. This is because the Cache allows concurrent updates from threads so a value stored in the cache can be dropped or replaced at any time by any other thread. get cannot return a reference &V as it is impossible to guarantee the value outlives the reference.

If you want to store values that will be expensive to clone, wrap them by std::sync::Arc before storing in a cache. Arc is a thread-safe reference-counted pointer and its clone() method is cheap.

Size-based Eviction

// Cargo.toml
//
// [dependencies]
// moka = { version = "0.8", features = ["future"] }
// tokio = { version = "1", features = ["rt-multi-thread", "macros" ] }
// futures-util = "0.3"

use std::convert::TryInto;
use moka::future::Cache;

#[tokio::main]
async fn main() {
    // Evict based on the number of entries in the cache.
    let cache = Cache::builder()
        // Up to 10,000 entries.
        .max_capacity(10_000)
        // Create the cache.
        .build();
    cache.insert(1, "one".to_string()).await;

    // Evict based on the byte length of strings in the cache.
    let cache = Cache::builder()
        // A weigher closure takes &K and &V and returns a u32
        // representing the relative size of the entry.
        .weigher(|_key, value: &String| -> u32 {
            value.len().try_into().unwrap_or(u32::MAX)
        })
        // This cache will hold up to 32MiB of values.
        .max_capacity(32 * 1024 * 1024)
        .build();
    cache.insert(2, "two".to_string()).await;
}

If your cache should not grow beyond a certain size, use the max_capacity method of the CacheBuilder to set the upper bound. The cache will try to evict entries that have not been used recently or very often.

At the cache creation time, a weigher closure can be set by the weigher method of the CacheBuilder. A weigher closure takes &K and &V as the arguments and returns a u32 representing the relative size of the entry:

If the weigher is not set, the cache will treat each entry has the same size of 1. This means the cache will be bounded by the number of entries.
If the weigher is set, the cache will call the weigher to calculate the weighted size (relative size) on an entry. This means the cache will be bounded by the total weighted size of entries.

Note that weighted sizes are not used when making eviction selections.

Time-based Expirations

Cache supports the following expiration policies:

Time to live: A cached entry will be expired after the specified duration past from insert.
Time to idle: A cached entry will be expired after the specified duration past from get or insert.

// Cargo.toml
//
// [dependencies]
// moka = { version = "0.8", features = ["future"] }
// tokio = { version = "1", features = ["rt-multi-thread", "macros" ] }
// futures-util = "0.3"

use moka::future::Cache;
use std::time::Duration;

#[tokio::main]
async fn main() {
    let cache = Cache::builder()
        // Time to live (TTL): 30 minutes
        .time_to_live(Duration::from_secs(30 * 60))
        // Time to idle (TTI):  5 minutes
        .time_to_idle(Duration::from_secs( 5 * 60))
        // Create the cache.
        .build();

    // This entry will expire after 5 minutes (TTI) if there is no get().
    cache.insert(0, "zero").await;

    // This get() will extend the entry life for another 5 minutes.
    cache.get(&0);

    // Even though we keep calling get(), the entry will expire
    // after 30 minutes (TTL) from the insert().
}

Thread Safety

All methods provided by the Cache are considered thread-safe, and can be safely accessed by multiple concurrent threads.

Cache<K, V, S> requires trait bounds Send, Sync and 'static for K (key), V (value) and S (hasher state).
Cache<K, V, S> will implement Send and Sync.

To share a cache across async tasks (or OS threads), do one of the followings:

Create a clone of the cache by calling its clone method and pass it to other task.
Wrap the cache by a sync::OnceCell or sync::Lazy from once_cell create, and set it to a static variable.

Cloning is a cheap operation for Cache as it only creates thread-safe reference-counted pointers to the internal data structures.

Hashing Algorithm

By default, Cache uses a hashing algorithm selected to provide resistance against HashDoS attacks. It will be the same one used by std::collections::HashMap, which is currently SipHash 1-3.

While SipHash’s performance is very competitive for medium sized keys, other hashing algorithms will outperform it for small keys such as integers as well as large keys such as long strings. However those algorithms will typically not protect against attacks such as HashDoS.

The hashing algorithm can be replaced on a per-Cache basis using the build_with_hasher method of the CacheBuilder. Many alternative algorithms are available on crates.io, such as the aHash crate.

Struct moka::future::Cache

Implementations

impl<K, V> Cache<K, V, RandomState> where K: Hash + Eq + Send + Sync + 'static, V: Clone + Send + Sync + 'static,

pub fn new(max_capacity: u64) -> Self

pub fn builder() -> CacheBuilder<K, V, Cache<K, V, RandomState>>

impl<K, V, S> Cache<K, V, S> where K: Hash + Eq + Send + Sync + 'static, V: Clone + Send + Sync + 'static, S: BuildHasher + Clone + Send + Sync + 'static,

pub fn contains_key<Q>(&self, key: &Q) -> bool where Arc<K>: Borrow<Q>, Q: Hash + Eq + ?Sized,

pub fn get<Q>(&self, key: &Q) -> Option<V> where Arc<K>: Borrow<Q>, Q: Hash + Eq + ?Sized,

pub async fn get_or_insert_with( &self, key: K, init: impl Future<Output = V>) -> V

pub async fn get_or_try_insert_with<F, E>( &self, key: K, init: F) -> Result<V, Arc<E>> where F: Future<Output = Result<V, E>>, E: Send + Sync + 'static,

pub async fn get_with(&self, key: K, init: impl Future<Output = V>) -> V

pub async fn get_with_if( &self, key: K, init: impl Future<Output = V>, replace_if: impl FnMut(&V) -> bool) -> V

pub async fn try_get_with<F, E>(&self, key: K, init: F) -> Result<V, Arc<E>> where F: Future<Output = Result<V, E>>, E: Send + Sync + 'static,

pub async fn insert(&self, key: K, value: V)

pub async fn invalidate<Q>(&self, key: &Q) where Arc<K>: Borrow<Q>, Q: Hash + Eq + ?Sized,

pub fn invalidate_all(&self)

pub fn invalidate_entries_if<F>( &self, predicate: F) -> Result<PredicateId, PredicateError> where F: Fn(&K, &V) -> bool + Send + Sync + 'static,

pub fn iter(&self) -> Iter<'_, K, V>ⓘNotable traits for Iter<'i, K, V>impl<'i, K, V> Iterator for Iter<'i, K, V> where K: Eq + Hash + Send + Sync + 'static, V: Clone + Send + Sync + 'static, type Item = (Arc<K>, V);

pub fn blocking(&self) -> BlockingOp<'_, K, V, S>

pub fn policy(&self) -> Policy

Trait Implementations

impl<K: Clone, V: Clone, S: Clone> Clone for Cache<K, V, S>

fn clone(&self) -> Cache<K, V, S>

fn clone_from(&mut self, source: &Self)

impl<K, V, S> ConcurrentCacheExt<K, V> for Cache<K, V, S> where K: Hash + Eq + Send + Sync + 'static, V: Send + Sync + 'static, S: BuildHasher + Clone + Send + Sync + 'static,

fn sync(&self)

impl<'a, K, V, S> IntoIterator for &'a Cache<K, V, S> where K: Hash + Eq + Send + Sync + 'static, V: Clone + Send + Sync + 'static, S: BuildHasher + Clone + Send + Sync + 'static,

type Item = (Arc<K>, V)

type IntoIter = Iter<'a, K, V>

fn into_iter(self) -> Self::IntoIter

impl<K, V, S> Send for Cache<K, V, S> where K: Send + Sync, V: Send + Sync, S: Send,

impl<K, V, S> Sync for Cache<K, V, S> where K: Send + Sync, V: Send + Sync, S: Sync,

Auto Trait Implementations

impl<K, V, S = RandomState> !RefUnwindSafe for Cache<K, V, S>

impl<K, V, S> Unpin for Cache<K, V, S>

impl<K, V, S = RandomState> !UnwindSafe for Cache<K, V, S>

Blanket Implementations

impl<T> Any for T where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for T where T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for T where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for T where U: From<T>,

fn into(self) -> U

impl<T> ToOwned for T where T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for T where U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for T where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<K, V> Cache<K, V, RandomState> where
K: Hash + Eq + Send + Sync + 'static,
V: Clone + Send + Sync + 'static,

impl<K, V, S> Cache<K, V, S> where
K: Hash + Eq + Send + Sync + 'static,
V: Clone + Send + Sync + 'static,
S: BuildHasher + Clone + Send + Sync + 'static,

pub fn contains_key<Q>(&self, key: &Q) -> bool where
Arc<K>: Borrow<Q>,
Q: Hash + Eq + ?Sized,

pub fn get<Q>(&self, key: &Q) -> Option<V> where
Arc<K>: Borrow<Q>,
Q: Hash + Eq + ?Sized,

pub async fn get_or_insert_with(
&self,
key: K,
init: impl Future<Output = V>
) -> V

pub async fn get_or_try_insert_with<F, E>(
&self,
key: K,
init: F
) -> Result<V, Arc<E>> where
F: Future<Output = Result<V, E>>,
E: Send + Sync + 'static,

pub async fn get_with_if(
&self,
key: K,
init: impl Future<Output = V>,
replace_if: impl FnMut(&V) -> bool
) -> V

pub async fn try_get_with<F, E>(&self, key: K, init: F) -> Result<V, Arc<E>> where
F: Future<Output = Result<V, E>>,
E: Send + Sync + 'static,

pub async fn invalidate<Q>(&self, key: &Q) where
Arc<K>: Borrow<Q>,
Q: Hash + Eq + ?Sized,

pub fn invalidate_entries_if<F>(
&self,
predicate: F
) -> Result<PredicateId, PredicateError> where
F: Fn(&K, &V) -> bool + Send + Sync + 'static,

pub fn iter(&self) -> Iter<'_, K, V>ⓘNotable traits for Iter<'i, K, V>`impl<'i, K, V> Iterator for Iter<'i, K, V> where K: Eq + Hash + Send + Sync + 'static, V: Clone + Send + Sync + 'static, type Item = (Arc<K>, V);`

impl<K, V, S> ConcurrentCacheExt<K, V> for Cache<K, V, S> where
K: Hash + Eq + Send + Sync + 'static,
V: Send + Sync + 'static,
S: BuildHasher + Clone + Send + Sync + 'static,

impl<'a, K, V, S> IntoIterator for &'a Cache<K, V, S> where
K: Hash + Eq + Send + Sync + 'static,
V: Clone + Send + Sync + 'static,
S: BuildHasher + Clone + Send + Sync + 'static,

impl<K, V, S> Send for Cache<K, V, S> where
K: Send + Sync,
V: Send + Sync,
S: Send,

impl<K, V, S> Sync for Cache<K, V, S> where
K: Send + Sync,
V: Send + Sync,
S: Sync,

impl<T> Any for T where
T: 'static + ?Sized,

impl<T> Borrow<T> for T where
T: ?Sized,

impl<T> BorrowMut<T> for T where
T: ?Sized,

impl<T, U> Into<U> for T where
U: From<T>,

impl<T> ToOwned for T where
T: Clone,

impl<T, U> TryFrom<U> for T where
U: Into<T>,

impl<T, U> TryInto<U> for T where
U: TryFrom<T>,