Expand description
Re-export aprender’s pruning primitives
Structs§
- Magnitude
Importance - Magnitude-based importance estimator.
- Magnitude
Pruner - Simple magnitude-based pruner.
- Pruning
Result - Result of a pruning operation with diagnostics.
- SparseGPT
Importance SparseGPTimportance estimator using Hessian-based saliency.- Sparsity
Mask - Sparsity mask with validation.
- Wanda
Importance - Wanda (Weights and Activations) importance estimator.
- Wanda
Pruner - Wanda-based pruner.
Enums§
- Pruning
Error - Pruning operation errors with detailed context.
- Sparse
Tensor - Unified sparse tensor type.
- Sparsity
Pattern - Sparsity pattern constraints.
Traits§
- Importance
- Core trait for importance estimation algorithms.
- Pruner
- High-level pruning interface.
Functions§
- generate_
block_ mask - Generate a block sparsity mask.
- generate_
column_ mask - Generate a column sparsity mask.
- generate_
nm_ mask - Generate an N:M structured sparsity mask.
- generate_
row_ mask - Generate a row sparsity mask.
- generate_
unstructured_ mask - Generate an unstructured sparsity mask based on importance scores.
- sparsify
- Apply a sparsity mask to a tensor and return sparse representation.