pub struct ScheduleFreeAdamW { /* private fields */ }Expand description
Schedule-free AdamW optimizer.
Maintains both training sequence (x) and evaluation sequence (y). During training, gradients are computed w.r.t. x_t. During evaluation, use y_t for better generalization.
§Algorithm
# Training step:
m_t = β₁ * m_{t-1} + (1 - β₁) * g_t
v_t = β₂ * v_{t-1} + (1 - β₂) * g_t²
x_t = x_{t-1} - lr * m_t / (√v_t + ε) - lr * λ * x_{t-1} # weight decay
# Evaluation sequence (exponential moving average):
y_t = (1 - γ) * x_t + γ * y_{t-1}
# At test time, use y_t instead of x_t§Example
use tensorlogic_train::{ScheduleFreeAdamW, ScheduleFreeConfig};
use scirs2_core::ndarray::Array2;
use std::collections::HashMap;
let config = ScheduleFreeConfig::default()
.with_lr(0.001)
.with_warmup_steps(1000);
let mut optimizer = ScheduleFreeAdamW::new(config);
// Training mode - use training parameters
optimizer.set_training_mode(true);
// ... compute gradients ...
// Evaluation mode - switch to averaged parameters
optimizer.set_training_mode(false);
let eval_params = optimizer.get_eval_parameters();Implementations§
Source§impl ScheduleFreeAdamW
impl ScheduleFreeAdamW
Sourcepub fn new(config: ScheduleFreeConfig) -> Self
pub fn new(config: ScheduleFreeConfig) -> Self
Create a new Schedule-Free AdamW optimizer.
Sourcepub fn set_training_mode(&mut self, training: bool)
pub fn set_training_mode(&mut self, training: bool)
Set training mode.
When training_mode = true, use train_params (x_t) for gradients. When training_mode = false, use eval_params (y_t) for evaluation.
Sourcepub fn is_training(&self) -> bool
pub fn is_training(&self) -> bool
Get current training mode.
Trait Implementations§
Source§impl Clone for ScheduleFreeAdamW
impl Clone for ScheduleFreeAdamW
Source§fn clone(&self) -> ScheduleFreeAdamW
fn clone(&self) -> ScheduleFreeAdamW
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for ScheduleFreeAdamW
impl Debug for ScheduleFreeAdamW
Auto Trait Implementations§
impl Freeze for ScheduleFreeAdamW
impl RefUnwindSafe for ScheduleFreeAdamW
impl Send for ScheduleFreeAdamW
impl Sync for ScheduleFreeAdamW
impl Unpin for ScheduleFreeAdamW
impl UnwindSafe for ScheduleFreeAdamW
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more