Enum ApplyPolicy

Source

pub enum ApplyPolicy {
    Sync,
    Cadence,
    Async,
}

Expand description

Controls WHEN parameter averaging occurs (the interval K).

All three modes run the same architecture; only the averaging trigger differs. The interval K determines how many batches each GPU processes with its own local optimizer before parameters are synchronized across replicas.

Sync: K=1 (every batch). Equivalent to standard DDP. Best convergence guarantees, but fast GPUs idle waiting for slow ones.
Cadence: K=N (ElChe anchor count). The slow GPU anchors the cadence, fast GPUs fill the wall time with extra batches. Recommended for heterogeneous hardware (e.g. mixing GPU generations).
Async: same proportional scheduling as Cadence (ElChe batch counts), but with divergence correction: if replicas drift apart, the anchor is nudged down (tighter sync). Differs from Cadence only in epoch dispatch (per-rank vs broadcast) in non-progressive mode.

Variants§

§

Sync

Average after every batch (K=1). Equivalent to standard synchronous DDP. Lowest risk of model divergence. Fast GPUs wait at the collective barrier.

§

Cadence

Average every N batches where N is determined by ElChe’s cadence strategy. The slow device sets the pace; fast devices process proportionally more batches per averaging window. Good default for mixed GPU setups.

§

Async

Same proportional scheduling as Cadence, plus divergence correction: if parameter norms drift apart, ElChe’s anchor is nudged down (tighter sync). Differs from Cadence only in epoch dispatch (per-rank in non-progressive, identical in progressive mode).

ApplyPolicy

Enum ApplyPolicy Copy item path

Variants§

Sync

Cadence

Async

Trait Implementations§

impl Clone for ApplyPolicy

fn clone(&self) -> ApplyPolicy

fn clone_from(&mut self, source: &Self)

impl Debug for ApplyPolicy

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl PartialEq for ApplyPolicy

fn eq(&self, other: &ApplyPolicy) -> bool

fn ne(&self, other: &Rhs) -> bool

impl Copy for ApplyPolicy

impl StructuralPartialEq for ApplyPolicy

Auto Trait Implementations§

impl Freeze for ApplyPolicy

impl RefUnwindSafe for ApplyPolicy

impl Send for ApplyPolicy

impl Sync for ApplyPolicy

impl Unpin for ApplyPolicy

impl UnsafeUnpin for ApplyPolicy

impl UnwindSafe for ApplyPolicy

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Enum ApplyPolicy

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,