Modules

Critics for an actor-critic agent.

Utilities for calculating step history features.

Policies for an actor-critic agent.

Structs

Actor-crtic agent. Consists of a Policy and a Critic.

Deep Q-Learning Agent

Configuration for DqnAgent

Wraps a module to have a lazily-initialized CPU copy if not already in CPU memory.