Struct SaeRowLayout

Source

pub struct SaeRowLayout {
    pub active_atoms: Vec<Vec<usize>>,
    pub coord_starts: Vec<Vec<usize>>,
    pub coord_offsets_full: Vec<usize>,
    pub coord_dims: Vec<usize>,
}

Expand description

Per-row active-set layout for sparse SAE assignment (any mode).

When the assignment is sparse — structurally (JumpReLU gate) or effectively (softmax / IBP-MAP at large K, where the assignment mass concentrates on a small support) — only a subset of K atoms are active per observation. The Arrow-Schur row block for observation i has dim q_active_i = |active_atoms_i| + Σ_{k ∈ active_i} d_k rather than q = assignment_dim + Σ_k d_k. This struct records which atoms are active per row and maps compressed block positions back to full-q positions so that apply_newton_step can unpack the compact delta_t from the solve.

For JumpReLU the active set is exactly the gated support (a_{n,k} ≠ 0), so the compact solve is identity to the dense solve. For IBP-MAP the active set is the union of a top-k_active_cap truncation and a magnitude cutoff on a_{n,k}; this is only enabled when K is large enough that the dense (m_total · p)² data Gram would not fit the host / device working-set budget, and the dropped atoms carry O(a_{n,k}²) curvature that is negligible by construction of the cutoff.

#1408: SOFTMAX engages this compact layout when an explicit top_k (softmax_active_cap) and/or the in-core memory budget bounds the active set — the AssignmentMode::Softmax arm of assemble_arrow_schur consults crate::manifold::SaeManifoldTerm::softmax_active_plan and, on Some((cap, cutoff)), builds the active set via Self::from_dense_weights. The full-K dense softmax layout is retained only when neither lever engages (no top_k, in-budget K). Folding softmax top_k into the compact solve required writing the active×active Gershgorin Loewner majorizer sub-block (#1419; the softmax entropy curvature is indefinite, so its raw diagonal cannot be used) AND contracting that SAME majorizer over the compact logit slots in the logdet ρ-trace (assignment_log_strength_hessian_trace) and the θ-adjoint, so value, log|H|, and Γ differentiate one operator on the compact support. That coordinated change is landed and FD-certified; the FFI’s after-the-fit top-k projection is then a no-op at the optimum.

Fields§

§active_atoms: Vec<Vec<usize>>

active_atoms[row] — sorted indices of active atoms for that row.

§coord_starts: Vec<Vec<usize>>

For row i, active atom active_atoms[i][j] has its coord block starting at compressed position coord_starts[i][j].

§coord_offsets_full: Vec<usize>

Full-q coordinate offset for atom k (length k_atoms).

§coord_dims: Vec<usize>

Per-atom coordinate dimensions, indexed by atom index.

Struct SaeRowLayout Copy item path

Fields§

Implementations§

impl SaeRowLayout

pub fn row_q_active(&self, row: usize) -> usize

pub fn expand_row(&self, row: usize, delta_t_row: &[f64], out: &mut [f64])

Trait Implementations§

impl Clone for SaeRowLayout

fn clone(&self) -> SaeRowLayout

fn clone_from(&mut self, source: &Self)

impl Debug for SaeRowLayout

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Auto Trait Implementations§

impl Freeze for SaeRowLayout

impl RefUnwindSafe for SaeRowLayout

impl Send for SaeRowLayout

impl Sync for SaeRowLayout

impl Unpin for SaeRowLayout

impl UnsafeUnpin for SaeRowLayout

impl UnwindSafe for SaeRowLayout

Blanket Implementations§

impl<T> Allocation for Twhere T: RefUnwindSafe + Send + Sync,

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> ByRef<T> for T

fn by_ref(&self) -> &T

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DTwhere ST: ?Sized, DT: ?Sized,

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DTwhere ST: ?Sized, DT: ?Sized,

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> DistributionExt for Twhere T: ?Sized,

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> Twhere Self: Distribution<T>,

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Imply<T> for Uwhere T: ?Sized, U: ?Sized,

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T> Pointable for T

const ALIGN: usize

type Init = T

unsafe fn init(init: <T as Pointable>::Init) -> usize

unsafe fn deref<'a>(ptr: usize) -> &'a T

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

unsafe fn drop(ptr: usize)

impl<T> Read<Exclusive, BecauseExclusive> for Twhere T: ?Sized,

impl<T> Same for T

type Output = T

impl<SS, SP> SupersetOf<SS> for SPwhere SS: SubsetOf<SP>,

fn to_subset(&self) -> Option<SS>

fn is_in_subset(&self) -> bool

fn to_subset_unchecked(&self) -> SS

fn from_subset(element: &SS) -> SP

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<V, T> VZip<V> for Twhere V: MultiLane<T>,

fn vzip(self) -> V

Struct SaeRowLayout

impl<T> Allocation for T
where T: RefUnwindSafe + Send + Sync,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DT
where ST: ?Sized, DT: ?Sized,

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DT
where ST: ?Sized, DT: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T> DistributionExt for T
where T: ?Sized,

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> T
where Self: Distribution<T>,

impl<T, U> Imply<T> for U
where T: ?Sized, U: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T> Read<Exclusive, BecauseExclusive> for T
where T: ?Sized,

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,