Trait ReplayBuffer

Source

pub trait ReplayBuffer<O, A>where
    O: Clone + Send + Sync,
    A: Clone + Send + Sync,{
    // Required methods
    fn push(&mut self, experience: Experience<O, A>);
    fn sample(
        &self,
        batch_size: usize,
        rng: &mut impl Rng,
    ) -> Vec<Experience<O, A>>;
    fn len(&self) -> usize;
    fn capacity(&self) -> Option<usize>;

    // Provided methods
    fn is_empty(&self) -> bool { ... }
    fn is_full(&self) -> bool { ... }
    fn ready_for(&self, batch_size: usize) -> bool { ... }
}

Expand description

A buffer that stores past experience for agent training.

Used primarily by off-policy algorithms (DQN, SAC, TD3) to break temporal correlations between training samples. On-policy algorithms (PPO, A2C) typically collect fixed-length trajectories instead and don’t need this trait — they can use a plain Vec<Experience<O, A>>.

§Implementing this trait

The most common implementation is a circular buffer with a fixed capacity that overwrites the oldest experience when full. Concrete implementations live in ember-rl, not here.

§Bounds

O: Clone + Send + Sync and A: Clone + Send + Sync are required because sampling returns owned Experience values (not references), and buffers may be accessed across threads during async training.