Struct Chain

Source

pub struct Chain {
    pub size: usize,
    pub discount_factor: f64,
}

Expand description

Chain Environment

Consists of n states in a line with 2 actions.

Action 0 moves back to the start for 2 reward.
Action 1 moves forward for 0 reward in all states but the last. In the last state, taking action 1 is a self-transition with 10 reward.
Every action has a 0.2 chance of “slipping” and taking the opposite action.

Described in “Bayesian Q-learning” by Dearden, Friedman and Russel (1998)

Fields§

§size: usize§discount_factor: f64

Implementations§

Source §

impl Chain

Source

pub const fn new(size: usize, discount_factor: f64) -> Self

Trait Implementations§

Source §

impl Clone for Chain

Source §

fn clone(&self) -> Chain

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

Source §

impl Debug for Chain

Source §

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Source §

impl Default for Chain

Source §

fn default() -> Self

Returns the “default value” for a type. Read more

Source §

impl<'de> Deserialize<'de> for Chain

Source §

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

Source §

fn observation_space(&self) -> Self::ObservationSpace

Space containing all possible observations. Read more

Source §

fn action_space(&self) -> Self::ActionSpace

The space of all possible actions. Read more

Source §

fn feedback_space(&self) -> Self::FeedbackSpace

The space of all possible feedback. Read more

Source §

fn discount_factor(&self) -> f64

A discount factor applied to future feedback. Read more

Source §

impl Environment for Chain

Source §

type State = usize

Environment state type. Not necessarily observable by the agent.

Source §

type Observation = usize

Observation of the state provided to the agent.

Source §

type Action = Move

Action selected by the agent.

Source §

type Feedback = Reward

Feedback provided to a learning agent as the result of each step. Reward, for example. Read more

Source §

fn initial_state(&self, _: &mut Prng) -> Self::State

Sample a state for the start of a new episode. Read more

Source §

fn observe(&self, state: &Self::State, _: &mut Prng) -> Self::Observation

Generate an observation for a given state.

Source §

fn step( &self, state: Self::State, action: &Self::Action, rng: &mut Prng, _: &mut dyn StatsLogger, ) -> (Successor<Self::State>, Self::Feedback)

Perform a state transition in reponse to an action. Read more

Source §

fn run<T, L>( self, actor: T, seed: SimSeed, logger: L, ) -> Steps<Self, T, Prng, L> ⓘ
where T: Actor<Self::Observation, Self::Action>, L: StatsLogger, Self: Sized,

Run this environment with the given actor.

Source §

impl PartialEq for Chain

Source §

fn eq(&self, other: &Chain) -> bool

Tests for self and other values to be equal, and is used by ==.

1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.

Source §

impl Serialize for Chain

Source §

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

Source §

impl CloneBuild for Chain

Source §

impl Copy for Chain

Source §

impl StructuralPartialEq for Chain

Auto Trait Implementations§

§

impl Freeze for Chain

§

impl RefUnwindSafe for Chain

§

impl Send for Chain

§

impl Sync for Chain

§

impl Unpin for Chain

§

impl UnwindSafe for Chain

Blanket Implementations§

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> AsAny for T
where T: Any,

Source §

fn as_any(&self) -> &(dyn Any + 'static)

Convert into an Any trait reference.

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> BuildEnv for T
where T: CloneBuild + StructuredEnvironment + ?Sized,

Source §

type Observation = <T as Environment>::Observation

Environment observation type.

Source §

type Action = <T as Environment>::Action

Environment action type.

Source §

type Feedback = <T as Environment>::Feedback

Environment feedback type.

Source §

type ObservationSpace = <T as EnvStructure>::ObservationSpace

Environment observation space type.

Source §

type ActionSpace = <T as EnvStructure>::ActionSpace

Environment action space type.

Source §

type FeedbackSpace = <T as EnvStructure>::FeedbackSpace

Environment feedback space type.

Source §

type Environment = T

Type of environment to build

Source §

fn build_env( &self, _: &mut ChaCha8Rng, ) -> Result<<T as BuildEnv>::Environment, BuildEnvError>

Build an environment instance. Read more

Source §

impl<T> CloneToUninit for T
where T: Clone,

Source §

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §