Skip to main content

Module optim

Module optim 

Source
Expand description

Optimizers for training neural networks

Re-exports§

pub use dp::add_gaussian_noise;
pub use dp::clip_gradient;
pub use dp::estimate_noise_multiplier;
pub use dp::grad_norm;
pub use dp::privacy_cost_per_step;
pub use dp::DpError;
pub use dp::DpSgd;
pub use dp::DpSgdConfig;
pub use dp::PrivacyBudget;
pub use dp::RdpAccountant;
pub use hpo::AcquisitionFunction;
pub use hpo::GridSearch;
pub use hpo::HPOError;
pub use hpo::HyperbandScheduler;
pub use hpo::HyperparameterSpace;
pub use hpo::ParameterDomain;
pub use hpo::ParameterValue;
pub use hpo::SearchStrategy;
pub use hpo::SurrogateModel;
pub use hpo::TPEOptimizer;
pub use hpo::Trial;
pub use hpo::TrialStatus;

Modules§

dp
Differential Privacy Module (MLOPS-015)
hpo
Hyperparameter Optimization Module (MLOPS-011)

Structs§

Adam
Adam optimizer (Adaptive Moment Estimation)
AdamW
AdamW optimizer
CosineAnnealingLR
Cosine Annealing Learning Rate Scheduler
LinearWarmupLR
Linear Warmup Learning Rate Scheduler
SGD
SGD optimizer with optional momentum
StepDecayLR
Step Decay Learning Rate Scheduler
WarmupCosineDecayLR
Warmup + Cosine Decay Learning Rate Scheduler

Traits§

LRScheduler
Learning rate scheduler trait
Optimizer
Trait for optimization algorithms

Functions§

clip_grad_norm
Clip gradients by global norm
clip_grad_norm_refs
Clip gradients by global norm on borrowed parameter references.
simd_adam_update
Fused Adam parameter update.
simd_adamw_update
Fused AdamW parameter update with decoupled weight decay.
simd_axpy
AXPY operation: y = a*x + y