pub trait Pomdp: Environment<Feedback = Reward> { }
Expand description

A Partially Observable Markov Decision Process

An Environment with Reward feedback.

Implementors