Struct relearn::envs::MemoryGame

source · [−]

pub struct MemoryGame {
    pub num_actions: usize,
    pub history_len: usize,
}

Expand description

Memory Game Environment

The agent must remember the inital state and choose the corresponding action as the final action in an episode.

The environment consists of (NUM_ACTIONS + HISTORY_LEN) states.
An episode starts in a state [0, NUM_ACTIONS) uniformly at random.
Step i in [0, HISTORY_LEN) transitions to state NUM_ACTIONS + i with 0 reward regardless of the action.
On step HISTORY_LEN, the agent chooses one of NUM_ACTIONS actions and if the action index matches the index of the inital state then the agent earns +1 reward, otherwise it earns -1 reward. This step is terminal.
Every episode has length HISTORY_LEN + 1.

Fields

num_actions: usize

The number of actions.

history_len: usize

Length of remembered history required to solve the environment.

Implementations

impl MemoryGame

pub const fn new(num_actions: usize, history_len: usize) -> Self

Create a new MemoryGame instance

Args

num_actions - Number of possible actions.
history_len - Length of remembered history required to solve the environment.

Trait Implementations

impl Clone for MemoryGame

fn clone(&self) -> MemoryGame

Returns a copy of the value. Read more

1.0.0 · source

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for MemoryGame

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl Default for MemoryGame

fn default() -> Self

Returns the “default value” for a type. Read more

impl<'de> Deserialize<'de> for MemoryGame

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error> where
__D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl EnvStructure for MemoryGame

type ObservationSpace = IndexSpace

type ActionSpace = IndexSpace

type FeedbackSpace = IntervalSpace<Reward>

fn observation_space(&self) -> Self::ObservationSpace

Space containing all possible observations. Read more

fn action_space(&self) -> Self::ActionSpace

The space of all possible actions. Read more

fn feedback_space(&self) -> Self::FeedbackSpace

The space of all possible feedback. Read more

fn discount_factor(&self) -> f64

A discount factor applied to future feedback. Read more

impl Environment for MemoryGame

type State = (usize, usize)

(current_state, initial_state)

type Observation = usize

Observation of the state provided to the agent.

type Action = usize

Action selected by the agent.

type Feedback = Reward

Feedback provided to a learning agent as the result of each step. Reward, for example. Read more

fn initial_state(&self, rng: &mut Prng) -> Self::State

Sample a state for the start of a new episode. Read more

fn observe(&self, state: &Self::State, _rng: &mut Prng) -> Self::Observation

Generate an observation for a given state.

fn step(
    &self,
    state: Self::State,
    action: &Self::Action,
    _: &mut Prng,
    _: &mut dyn StatsLogger
) -> (Successor<Self::State>, Self::Feedback)

Perform a state transition in reponse to an action. Read more

fn run<T, L>(self, actor: T, seed: SimSeed, logger: L) -> Steps<Self, T, Prng, L>ⓘNotable traits for Steps<E, T, R, L>`impl<E, T, R, L> Iterator for Steps<E, T, R, L> where     E: Environment,     T: Actor<E::Observation, E::Action>,     R: BorrowMut<Prng>,     L: StatsLogger, type Item = PartialStep<E::Observation, E::Action, E::Feedback>;` where
    T: Actor<Self::Observation, Self::Action>,
    L: StatsLogger,
    Self: Sized,

Run this environment with the given actor.

impl Hash for MemoryGame

fn hash<H: Hasher>(&self, state: &mut H)

Feeds this value into the given Hasher. Read more

1.3.0 · source

fn hash_slice<H>(data: &[Self], state: &mut H) where
H: Hasher,

Feeds a slice of this type into the given Hasher. Read more

impl PartialEq<MemoryGame> for MemoryGame

fn eq(&self, other: &MemoryGame) -> bool

This method tests for self and other values to be equal, and is used by ==. Read more

fn ne(&self, other: &MemoryGame) -> bool

This method tests for !=.

impl Serialize for MemoryGame

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error> where
S: Serializer,

Serialize this value into the given Serde serializer. Read more

impl CloneBuild for MemoryGame

impl Copy for MemoryGame

impl Eq for MemoryGame

impl StructuralEq for MemoryGame

impl StructuralPartialEq for MemoryGame

Auto Trait Implementations

impl RefUnwindSafe for MemoryGame

impl Send for MemoryGame

impl Sync for MemoryGame

impl Unpin for MemoryGame

impl UnwindSafe for MemoryGame

Blanket Implementations

impl<T> Any for T where
T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> AsAny for T where
T: Any,

fn as_any(&self) -> &(dyn Any + 'static)

Convert into an Any trait reference.

impl<T> Borrow<T> for T where
T: ?Sized,

const: unstable · source

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T where
T: ?Sized,

const: unstable · source

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<Q, K> Equivalent<K> for Q where
Q: Eq + ?Sized,
K: Borrow<Q> + ?Sized,

fn equivalent(&self, key: &K) -> bool

Compare self to key and return true if they are equal.

impl<T> From<T> for T

const: unstable · source

fn from(t: T) -> T

Returns the argument unchanged.

impl<T, U> Into<U> for T where
U: From<T>,

const: unstable · source

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> Pointable for T

const ALIGN: usize = mem::align_of::<T>()

The alignment of pointer.

type Init = T

The type for initializers.

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more

impl<T> Same<T> for T

type Output = T

Should always be Self

impl<T> ToOwned for T where
T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T where
U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

const: unstable · source

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T where
U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

const: unstable · source

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<V, T> VZip<V> for T where
V: MultiLane<T>,

fn vzip(self) -> V

impl<T> DeserializeOwned for T where
T: for<'de> Deserialize<'de>,

impl<T> Pomdp for T where
T: Environment<Feedback = Reward>,

impl<T> PomdpStructure for T where
T: EnvStructure<FeedbackSpace = IntervalSpace<Reward>>,