Struct DecoderIncoherencePenalty

Source

pub struct DecoderIncoherencePenalty {
    pub target: PsiSlice,
    pub block_sizes: Vec<usize>,
    pub p_out: usize,
    pub k_atoms: usize,
    pub pairs: Vec<(usize, usize, f64)>,
    pub weight: f64,
    pub learnable_weight: bool,
    pub rho_index: usize,
    pub weight_schedule: Option<ScalarWeightSchedule>,
}

Expand description

Cross-atom decoder column-space incoherence, restricted to co-activating atom pairs (issue #671).

Lives on the β tier and targets the flat SAE decoder coefficient block. The β layout concatenates the per-atom decoder blocks in atom order: atom k owns M_k · p_out coefficients, stored as β[off_k + a·p_out + o] for basis row a and output feature o. The stored block is B_k ∈ ℝ^{M_k × p_out} with rows B_k[a, :] representing decoder directions in output space.

The penalty is the co-activation-masked cross-column-space overlap

  P = ½ · w · Σ_{j<k} W[j,k] · ‖B_j B_k^T‖²_F,
  W[j,k] = ½ · (coactivation[j,k] + coactivation[k,j]).

coactivation[j,k] is the mean over observations of gate[n,j] · gate[n,k]; pairs that never co-fire (W[j,k] = 0) contribute nothing. In the SAE objective this is the separability lever: atoms that are active on the same examples are discouraged from spanning the same decoder output directions, while unrelated atoms are not pushed apart just because they both exist in the dictionary.

The Hessian used here is the Gauss-Newton (positive-semidefinite) curvature of the Frobenius objective in C, dropping the indefinite second-order term in C. This keeps the β-tier Newton / PIRLS curvature block PSD, matching the other quadratic-on-Gram penalties.

Gotchas:

block_sizes are decoder basis-row counts M_k, not output widths; every atom shares the same p_out. Stored decoder blocks are (M_k, p_out), so B_j B_k^T is the cross-Gram of decoder directions in output space and remains well-defined for heterogeneous M_k.
The descriptor path builds a placeholder penalty; live SAE wiring replaces the co-activation matrix with the current mean gate products.
Offsets are interpreted against the vector passed to this penalty. In the SAE decoder-incoherence path the registered target slice is zero-based; callers using an already sliced target view must keep that convention.

Fields§

§target: PsiSlice§block_sizes: Vec<usize>

Per-atom decoder basis-function counts M_k. The atom blocks are laid out contiguously in β order; Σ_k M_k·p_out == target.len().

§p_out: usize

Output / feature dimension p_out (decoder column count, shared by all atoms).

§k_atoms: usize

Atom count K. The operator only stores the SPARSE list of penalized atom pairs (pairs), not the dense K×K co-activation matrix — at K = 32768 that dense matrix is 8 GiB. Every consumer of this operator already skipped pairs whose symmetrized weight is 0, so storing only the nonzero pairs is exactly equivalent to the dense matrix while being linear in the number of co-active / near-collinear pairs (#1026).

§pairs: Vec<(usize, usize, f64)>

Sparse penalized atom pairs (j, k, w) with j < k and the symmetrized weight w = ½·(W[j,k] + W[k,j]) > 0 (this is exactly the value the old pair_weight(j, k) returned). Pairs with w == 0 are omitted; the dense operator skipped them, so results are byte-identical.

§weight: f64

Base strength. If learnable_weight is true the resolved strength is weight·exp(rho[rho_index]); otherwise it is fixed at weight.

§learnable_weight: bool§rho_index: usize§weight_schedule: Option<ScalarWeightSchedule>

Struct DecoderIncoherencePenalty Copy item path

Fields§

Implementations§

impl DecoderIncoherencePenalty

pub fn new( target: PsiSlice, block_sizes: Vec<usize>, p_out: usize, coactivation: Array2<f64>, weight: f64, learnable_weight: bool, ) -> Result<Self, String>

pub fn new_sparse( target: PsiSlice, block_sizes: Vec<usize>, p_out: usize, pairs: Vec<(usize, usize, f64)>, weight: f64, learnable_weight: bool, ) -> Result<Self, String>

pub fn with_weight_schedule(self, schedule: ScalarWeightSchedule) -> Self

pub fn accumulate_psd_majorizer_dense( &self, target: ArrayView1<'_, f64>, rho: ArrayView1<'_, f64>, scale: f64, hbb: &mut Array2<f64>, )

Trait Implementations§

impl AnalyticPenalty for DecoderIncoherencePenalty

fn hvp( &self, target: ArrayView1<'_, f64>, rho: ArrayView1<'_, f64>, v: ArrayView1<'_, f64>, ) -> Array1<f64>

fn psd_majorizer_hvp( &self, target: ArrayView1<'_, f64>, rho: ArrayView1<'_, f64>, v: ArrayView1<'_, f64>, ) -> Array1<f64>

fn tier(&self) -> PenaltyTier

fn value(&self, target: ArrayView1<'_, f64>, rho: ArrayView1<'_, f64>) -> f64

fn grad_target( &self, target: ArrayView1<'_, f64>, rho: ArrayView1<'_, f64>, ) -> Array1<f64>

fn grad_rho( &self, target: ArrayView1<'_, f64>, rho: ArrayView1<'_, f64>, ) -> Array1<f64>

fn rho_count(&self) -> usize

fn name(&self) -> &str

fn apply_schedule(&mut self, iter: usize)

fn hessian_diag( &self, target: ArrayView1<'_, f64>, rho: ArrayView1<'_, f64>, ) -> Option<Array1<f64>>

fn psd_majorizer_diag( &self, target: ArrayView1<'_, f64>, rho: ArrayView1<'_, f64>, ) -> Option<Array1<f64>>

impl Clone for DecoderIncoherencePenalty

fn clone(&self) -> DecoderIncoherencePenalty

fn clone_from(&mut self, source: &Self)

impl Debug for DecoderIncoherencePenalty

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl PenaltyManifest for DecoderIncoherencePenalty

const KIND_TAG: &'static str = "decoder_incoherence"

const PYTHON_WRAPPER: &'static str = "DecoderIncoherencePenalty"

const ROW_BLOCK_DIAGONAL: bool = false

fn dispatch_tier(&self) -> PenaltyTier

Auto Trait Implementations§

impl Freeze for DecoderIncoherencePenalty

impl RefUnwindSafe for DecoderIncoherencePenalty

impl Send for DecoderIncoherencePenalty

impl Sync for DecoderIncoherencePenalty

impl Unpin for DecoderIncoherencePenalty

impl UnsafeUnpin for DecoderIncoherencePenalty

impl UnwindSafe for DecoderIncoherencePenalty

Blanket Implementations§

impl<T> Allocation for Twhere T: RefUnwindSafe + Send + Sync,

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> ByRef<T> for T

fn by_ref(&self) -> &T

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DTwhere ST: ?Sized, DT: ?Sized,

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DTwhere ST: ?Sized, DT: ?Sized,

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> DistributionExt for Twhere T: ?Sized,

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> Twhere Self: Distribution<T>,

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Imply<T> for Uwhere T: ?Sized, U: ?Sized,

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T> Pointable for T

const ALIGN: usize

type Init = T

unsafe fn init(init: <T as Pointable>::Init) -> usize

unsafe fn deref<'a>(ptr: usize) -> &'a T

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

unsafe fn drop(ptr: usize)

impl<T> Read<Exclusive, BecauseExclusive> for Twhere T: ?Sized,

impl<T> Same for T

type Output = T

impl<SS, SP> SupersetOf<SS> for SPwhere SS: SubsetOf<SP>,

fn to_subset(&self) -> Option<SS>

fn is_in_subset(&self) -> bool

fn to_subset_unchecked(&self) -> SS

fn from_subset(element: &SS) -> SP

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

Struct DecoderIncoherencePenalty

impl<T> Allocation for T
where T: RefUnwindSafe + Send + Sync,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DT
where ST: ?Sized, DT: ?Sized,

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DT
where ST: ?Sized, DT: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T> DistributionExt for T
where T: ?Sized,

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> T
where Self: Distribution<T>,

impl<T, U> Imply<T> for U
where T: ?Sized, U: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T> Read<Exclusive, BecauseExclusive> for T
where T: ?Sized,

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,