Module policy_gradient

Module policy_gradient 

Source
Expand description

Advanced Policy Gradient Optimization with Meta-Gradient Learning

Implementation of cutting-edge policy gradient methods with meta-learning capabilities:

  • Meta-gradient learning for automatic learning rate adaptation
  • Higher-order optimization dynamics
  • Meta-policy networks for learning optimization strategies
  • Adaptive curriculum learning across problem classes
  • Hierarchical optimization policies

Structs§

AdvancedAdvancedPolicyGradientOptimizer
Advanced Policy Gradient Optimizer with Meta-Learning
CurriculumController
Curriculum learning controller
LearningMetrics
Learning metrics for meta-learning
MetaExperienceBuffer
Meta-experience buffer for higher-order learning
MetaGradients
Meta-gradients for higher-order optimization
MetaLearningStats
Meta-learning statistics
MetaPolicyNetwork
Advanced Neural Network with Meta-Learning Capabilities
MetaTrajectory
Enhanced trajectory with meta-learning information

Functions§

advanced_advanced_policy_gradient_optimize
Convenience function for advanced meta-learning policy gradient optimization
placeholder
policy_gradient_optimize
Legacy convenience function for backward compatibility