Struct SnapshotBase

Source

pub struct SnapshotBase<const R: usize, ObservationType: Observation<R>, RewardType: Reward> {
    pub observation: ObservationType,
    pub reward: RewardType,
    pub status: EpisodeStatus,
}

Expand description

Default snapshot implementation for standard reinforcement learning observations.

SnapshotBase stores an observation, reward, and EpisodeStatus. Construct via the named constructors running, terminated, or truncated.

§Type Parameters

R - The observation tensor rank
ObservationType - The type of observation (must implement Observation<R>)
RewardType - The type of reward (must implement Reward)

Fields§

§observation: ObservationType

The observation derived from the state.

§reward: RewardType

The reward received from the last action.

§status: EpisodeStatus

Episode lifecycle status.

Implementations§

Source §

impl<const R: usize, ObservationType: Observation<R>, RewardType: Reward> SnapshotBase<R, ObservationType, RewardType>

Source

pub fn running(observation: ObservationType, reward: RewardType) -> Self

Snapshot for a step where the episode is still running.

Source

pub fn terminated(observation: ObservationType, reward: RewardType) -> Self

Snapshot for the step on which the MDP reached a terminal state.

Source

pub fn truncated(observation: ObservationType, reward: RewardType) -> Self

Snapshot for the step on which an external step limit was reached.

Trait Implementations§

Source §

impl<const R: usize, ObservationType: Clone + Observation<R>, RewardType: Clone + Reward> Clone for SnapshotBase<R, ObservationType, RewardType>

Source §

fn clone(&self) -> SnapshotBase<R, ObservationType, RewardType>

Returns a duplicate of the value. Read more

1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

Source §

impl<const R: usize, ObservationType: Debug + Observation<R>, RewardType: Debug + Reward> Debug for SnapshotBase<R, ObservationType, RewardType>

Source §

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Source §

impl<const R: usize, ObservationType: Observation<R>, RewardType: Reward> Snapshot<R> for SnapshotBase<R, ObservationType, RewardType>

Source §

type ObservationType = ObservationType

The observation type exposed to the agent at each step.

Source §

type RewardType = RewardType

The type of reward contained in this snapshot.

Source §

fn observation(&self) -> &Self::ObservationType

Access the observed state.

Source §

fn reward(&self) -> &Self::RewardType

Access the reward received.

Source §

fn status(&self) -> EpisodeStatus

Episode lifecycle status for this step.

Source §

fn is_done(&self) -> bool

true when the episode loop should stop.

Source §

fn is_terminated(&self) -> bool

true only for intrinsic MDP termination.

Source §

fn is_truncated(&self) -> bool

true only for extrinsic step-limit truncation.

Source §

fn metadata(&self) -> Option<&SnapshotMetadata>

Optional named reward components and position data.

Auto Trait Implementations§

§

impl<const R: usize, ObservationType, RewardType> Freeze for SnapshotBase<R, ObservationType, RewardType>
where ObservationType: Freeze, RewardType: Freeze,

§

impl<const R: usize, ObservationType, RewardType> RefUnwindSafe for SnapshotBase<R, ObservationType, RewardType>
where ObservationType: RefUnwindSafe, RewardType: RefUnwindSafe,

§

impl<const R: usize, ObservationType, RewardType> Send for SnapshotBase<R, ObservationType, RewardType>
where RewardType: Send,

§

impl<const R: usize, ObservationType, RewardType> Sync for SnapshotBase<R, ObservationType, RewardType>
where RewardType: Sync,

§

impl<const R: usize, ObservationType, RewardType> Unpin for SnapshotBase<R, ObservationType, RewardType>
where ObservationType: Unpin, RewardType: Unpin,

§

impl<const R: usize, ObservationType, RewardType> UnsafeUnpin for SnapshotBase<R, ObservationType, RewardType>
where ObservationType: UnsafeUnpin, RewardType: UnsafeUnpin,

§

impl<const R: usize, ObservationType, RewardType> UnwindSafe for SnapshotBase<R, ObservationType, RewardType>
where ObservationType: UnwindSafe, RewardType: UnwindSafe,

Blanket Implementations§

Source §

impl<T> Adaptor<()> for T

Source §

fn adapt(&self)

Adapt the type to be passed to a metric.

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

Source §

impl<T> Downcast<T> for T

Source §

fn downcast(&self) -> &T

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T> Instrument for T

Source §

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

Source §

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> IntoComptime for T

Source §

fn comptime(self) -> Self

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §