Expand description
Gradient Checkpointing
Implements gradient checkpointing to reduce memory usage during training:
- Selective activation storage
- Recomputation during backward pass
- Memory-efficient training for large models
- Configurable checkpoint intervals
Structs§
- Checkpoint
Config - Gradient checkpointing configuration
- Checkpoint
Manager - Checkpoint manager
- Checkpoint
Stats - Checkpoint statistics
- Checkpointed
Layer - Checkpointed layer wrapper
- Checkpointed
Sequential - Sequential model with gradient checkpointing
Enums§
- Checkpoint
Strategy - Checkpoint strategy
Functions§
- estimate_
memory_ savings - Utility function to estimate memory savings