Expand description
§Ratel
A library for running multi-armed bandit simulations.
The simulation has two components: a Bandit with some number of arms which each has a payout according to some distribution, and an Agent that uses some strategy to learn the optimal bandit arm. A Game then consists of an Agent learning about a Bandit by pulling arms according to its preferred strategy.
Structs§
- Binomial
Bandit - A bandit whose arms distribute rewards according to the binomial distributions.
- Epsilon
Greedy Agent - Agent that follows the Epsilon-Greedy Algorithm.
- Exponential
Bandit - A bandit whose arms distribute rewards according to the exponential distributions.
- Game
- Structure to make the Agent interact with the Bandit.
- Gamma
Bandit - A bandit whose arms distribute rewards according to the gamma distributions.
- Gaussian
Bandit - A bandit whose arms distribute rewards according to the Gaussian distributions.
- Greedy
Agent - Agent that follows the Greedy Algorithm.
- Harmonic
Stepper - Stepper with a harmonically decreasing step size.
- LogNormal
Bandit - A bandit whose arms distribute rewards according to the Log Normal distributions.
- Optimistic
Agent - Agent that follows the Optimistic Algorithm.
- Record
Counter - Counter for cases where individual records must be maintained.
Traits§
- Agent
- A trait for common members of the Agents.
- Bandit
- A trait for common members of the Bandits
- Counter
- A trait for common methods required by all counters.
- Stepper
- Trait containing all methods required by all steppers.
Functions§
- print_
hms - Prints the amount of time that has elapsed since a timer was started.