Skip to main content

SamOptimizer

Struct SamOptimizer 

Source
pub struct SamOptimizer<O: Optimizer> { /* private fields */ }
Expand description

SAM optimizer (Sharpness Aware Minimization).

SAM seeks parameters that lie in neighborhoods having uniformly low loss, improving model generalization. It requires two forward-backward passes per step: one to compute the adversarial perturbation, and one to compute the actual gradient.

Reference: Foret et al. “Sharpness-Aware Minimization for Efficiently Improving Generalization” (ICLR 2021)

Note: This is a wrapper optimizer. SAM requires special handling in the training loop to perform two gradient computations per step. The typical usage is:

  1. Compute gradients at current parameters
  2. Compute adversarial perturbation
  3. Compute gradients at perturbed parameters
  4. Update with the perturbed gradients

Implementations§

Source§

impl<O: Optimizer> SamOptimizer<O>

Source

pub fn new(base_optimizer: O, rho: f64) -> TrainResult<Self>

Create a new SAM optimizer.

§Arguments
  • base_optimizer - The base optimizer to use (SGD, Adam, etc.)
  • rho - Perturbation radius (typically 0.05)
Source

pub fn first_step( &mut self, parameters: &mut HashMap<String, Array<f64, Ix2>>, gradients: &HashMap<String, Array<f64, Ix2>>, ) -> TrainResult<()>

Compute adversarial perturbations.

This should be called with the first set of gradients to compute the perturbation direction.

Source

pub fn second_step( &mut self, parameters: &mut HashMap<String, Array<f64, Ix2>>, gradients: &HashMap<String, Array<f64, Ix2>>, ) -> TrainResult<()>

Perform the actual optimization step.

This should be called with the second set of gradients (computed at the perturbed parameters). It will remove the perturbations and update the parameters using the base optimizer.

Trait Implementations§

Source§

impl<O: Debug + Optimizer> Debug for SamOptimizer<O>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<O: Optimizer> Optimizer for SamOptimizer<O>

Source§

fn step( &mut self, parameters: &mut HashMap<String, Array<f64, Ix2>>, gradients: &HashMap<String, Array<f64, Ix2>>, ) -> TrainResult<()>

Update parameters with computed gradients.
Source§

fn zero_grad(&mut self)

Zero all gradients.
Source§

fn get_lr(&self) -> f64

Get current learning rate.
Source§

fn set_lr(&mut self, lr: f64)

Set learning rate.
Source§

fn state_dict(&self) -> HashMap<String, Vec<f64>>

Get optimizer state for checkpointing.
Source§

fn load_state_dict(&mut self, state: HashMap<String, Vec<f64>>)

Load optimizer state from checkpoint.

Auto Trait Implementations§

§

impl<O> Freeze for SamOptimizer<O>
where O: Freeze,

§

impl<O> RefUnwindSafe for SamOptimizer<O>
where O: RefUnwindSafe,

§

impl<O> Send for SamOptimizer<O>
where O: Send,

§

impl<O> Sync for SamOptimizer<O>
where O: Sync,

§

impl<O> Unpin for SamOptimizer<O>
where O: Unpin,

§

impl<O> UnwindSafe for SamOptimizer<O>
where O: UnwindSafe,

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V