Expand description
Reinforcement Learning Optimization Module
This module implements optimization algorithms based on reinforcement learning principles, where optimization strategies are learned through interaction with the objective function environment.
§Key Features
- Policy Gradient Optimization: Learn optimization policies using policy gradients
- Q-Learning for Optimization: Value-based approach to optimization strategy learning
- Actor-Critic Methods: Combined policy and value learning for optimization
- Bandit-based Optimization: Multi-armed bandit approaches for hyperparameter tuning
- Evolutionary Strategies: Population-based RL optimization
- Meta-Learning: Learning to optimize across different problem classes
§Applications
- Automatic hyperparameter tuning
- Adaptive optimization algorithms
- Black-box optimization
- Neural architecture search
- AutoML optimization pipelines
Re-exports§
pub use actor_critic::*;
pub use bandit_optimization::*;
pub use evolutionary_strategies::*;
pub use meta_learning::*;
pub use policy_gradient::*;
pub use q_learning_optimization::*;
Modules§
- actor_
critic - Actor-Critic Methods for Optimization
- bandit_
optimization - Multi-Armed Bandit Optimization
- evolutionary_
strategies - Evolutionary Strategies for RL Optimization
- meta_
learning - Meta-Learning for Optimization
- policy_
gradient - Advanced Policy Gradient Optimization with Meta-Gradient Learning
- q_
learning_ optimization - Q-Learning for Optimization
- utils
- Utility functions for RL optimization
Structs§
- Convergence
Metrics - Convergence metrics for RL state
- Experience
- Experience tuple for RL
- Experience
Buffer - Experience replay buffer
- Improvement
Reward - Simple improvement-based reward function
- Optimization
State - State representation for optimization RL
- RLOptimization
Config - Configuration for reinforcement learning optimization
Enums§
- Optimization
Action - Action space for optimization RL
Traits§
- RLOptimizer
- Trait for RL-based optimizers
- Reward
Function - Reward function for optimization RL