pub struct Sarsa {
pub q_table: Array2<f64>,
pub alpha: f64,
pub epsilon: f64,
pub gamma: f64,
}Expand description
Tabular SARSA agent (on-policy TD learning).
Fields§
§q_table: Array2<f64>Q-value table (n_states × n_actions).
alpha: f64Learning rate.
epsilon: f64ε-greedy exploration rate.
gamma: f64Discount factor.
Implementations§
Source§impl Sarsa
impl Sarsa
Sourcepub fn new(
n_states: usize,
n_actions: usize,
alpha: f64,
epsilon: f64,
gamma: f64,
) -> Self
pub fn new( n_states: usize, n_actions: usize, alpha: f64, epsilon: f64, gamma: f64, ) -> Self
Create a new SARSA agent.
Sourcepub fn update(
&mut self,
s: usize,
a: usize,
r: f64,
s_next: usize,
a_next: usize,
)
pub fn update( &mut self, s: usize, a: usize, r: f64, s_next: usize, a_next: usize, )
Apply one SARSA TD update.
Q(s,a) ← Q(s,a) + α [ r + γ Q(s',a') − Q(s,a) ]
Trait Implementations§
Auto Trait Implementations§
impl Freeze for Sarsa
impl RefUnwindSafe for Sarsa
impl Send for Sarsa
impl Sync for Sarsa
impl Unpin for Sarsa
impl UnsafeUnpin for Sarsa
impl UnwindSafe for Sarsa
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> Pointable for T
impl<T> Pointable for T
Source§impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
Source§fn to_subset(&self) -> Option<SS>
fn to_subset(&self) -> Option<SS>
The inverse inclusion map: attempts to construct
self from the equivalent element of its
superset. Read moreSource§fn is_in_subset(&self) -> bool
fn is_in_subset(&self) -> bool
Checks if
self is actually part of its subset T (and can be converted to it).Source§fn to_subset_unchecked(&self) -> SS
fn to_subset_unchecked(&self) -> SS
Use with care! Same as
self.to_subset but without any property checks. Always succeeds.Source§fn from_subset(element: &SS) -> SP
fn from_subset(element: &SS) -> SP
The inclusion map: converts
self to the equivalent element of its superset.