Modules
Critics for an actor-critic agent.
Utilities for calculating step history features.
Policies for an actor-critic agent.
Structs
Configuration for ActorCriticAgent
.
Deep Q-Learning Agent
Wraps a module to have a lazily-initialized CPU copy if not already in CPU memory.