Struct relearn::envs::UniformBernoulliBandits[][src]

pub struct UniformBernoulliBandits {
    pub num_arms: usize,
}
Expand description

A distribution over Beroulli bandit environments with uniformly sampled means.

The mean of each arm is sampled uniformly from [0, 1].

Reference

This environment distribution is used in the paper “RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning” by Duan et al.

Fields

num_arms: usize

Number of bandit arms.

Implementations

Trait Implementations

Returns a copy of the value. Read more

Performs copy-assignment from source. Read more

Formats the value using the given formatter. Read more

Returns the “default value” for a type. Read more

Space containing all possible observations. Read more

The space of all possible actions. Read more

A lower and upper bound on possible reward values. Read more

A discount factor applied to future rewards. Read more

Performs the conversion.

Feeds this value into the given Hasher. Read more

Feeds a slice of this type into the given Hasher. Read more

This method tests for self and other values to be equal, and is used by ==. Read more

This method tests for !=.

Sample a POMDP from the distribution. Read more

Update in-place from the given source value.

Auto Trait Implementations

Blanket Implementations

Gets the TypeId of self. Read more

Immutably borrows from an owned value. Read more

Mutably borrows from an owned value. Read more

Sample an environment from the distribution. Read more

Compare self to key and return true if they are equal.

Performs the conversion.

Performs the conversion.

The resulting type after obtaining ownership.

Creates owned data from borrowed data, usually by cloning. Read more

🔬 This is a nightly-only experimental API. (toowned_clone_into)

recently added

Uses borrowed data to replace owned data, usually by cloning. Read more

The type returned in the event of a conversion error.

Performs the conversion.

The type returned in the event of a conversion error.

Performs the conversion.

Apply an update from the given source value.