Skip to main content

GradScaler

Struct GradScaler

pub struct GradScaler { /* private fields */ }

Expand description

GradScaler for mixed precision training.

Scales loss before backward to prevent gradient underflow in float16, then unscales gradients before optimizer step. Dynamically adjusts scale factor based on whether inf/nan gradients are detected.

let mut scaler = GradScaler::new();
let scaled_loss = scaler.scale(&loss)?;
scaled_loss.backward()?;
let stepped = scaler.step(&params, &mut || optim.step())?;
scaler.update();

Implementations§

impl GradScaler

pub fn new() -> Self

Create a new GradScaler with default settings.

Initial scale: 2^16 = 65536, growth: 2.0, backoff: 0.5, interval: 2000.

pub fn scale(&self, loss: &Variable) -> Result<Variable>

Scale the loss before backward. Returns loss * scale.

pub fn scale_factor(&self) -> f64

Current scale factor.

pub fn step( &mut self, params: &[Parameter], step_fn: &mut dyn FnMut() -> Result<()>, ) -> Result<bool>

Unscale gradients, check for inf/nan, and step the optimizer.

Returns true if the step was taken (all gradients finite). Returns false if inf/nan detected (optimizer step skipped).

pub fn update(&mut self)

Update the scale factor after each step.

Call this after every step() call, regardless of whether it succeeded.

Trait Implementations§

impl Default for GradScaler

fn default() -> Self

Returns the “default value” for a type. Read more

impl Stateful for GradScaler

fn save_state<W: Write>(&self, w: &mut W) -> Result<()>

Serialize optimizer state (lr, momentum buffers, etc.) to a writer.

fn load_state<R: Read>(&mut self, r: &mut R) -> Result<()>

Restore optimizer state from a reader.

fn save_state_file(&self, path: &str) -> Result<()>

Save state to a file. Uses gzip compression if path ends with .gz.

fn load_state_file(&mut self, path: &str) -> Result<()>

Load state from a file. Detects gzip from .gz extension.

Auto Trait Implementations§

impl Freeze for GradScaler

impl RefUnwindSafe for GradScaler

impl Send for GradScaler

impl Sync for GradScaler

impl Unpin for GradScaler

impl UnsafeUnpin for GradScaler

impl UnwindSafe for GradScaler

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.