oxicuda-rl 0.1.1

GPU-accelerated reinforcement learning primitives for OxiCUDA
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
//! Policy distributions for discrete and continuous action spaces.
//!
//! * [`crate::policy::CategoricalPolicy`] — discrete actions via categorical / softmax distribution
//! * [`crate::policy::GaussianPolicy`] — continuous actions via diagonal Gaussian (reparameterised)
//! * [`crate::policy::DeterministicPolicy`] — deterministic policy for DDPG/TD3

pub mod categorical;
pub mod deterministic;
pub mod gaussian;

pub use categorical::CategoricalPolicy;
pub use deterministic::DeterministicPolicy;
pub use gaussian::GaussianPolicy;