Struct ArrowFactorCache

Source

pub struct ArrowFactorCache {Show 19 fields
    pub htt_factors: ArrowFactorSlab,
    pub htt_factors_undamped: ArrowUndampedFactors,
    pub schur_factor: Option<Array2<f64>>,
    pub joint_hessian_log_det: Option<f64>,
    pub solver_mode: ArrowSolverMode,
    pub ridge_t: f64,
    pub ridge_beta: f64,
    pub htbeta: ArrowHtbetaCache,
    pub d: usize,
    pub row_dims: Arc<[usize]>,
    pub row_offsets: Arc<[usize]>,
    pub k: usize,
    pub manifold_mode_fingerprint: u64,
    pub row_hessian_fingerprint: u64,
    pub pcg_diagnostics: PcgDiagnostics,
    pub gauge_deflated_directions: usize,
    pub deflated_row_directions: Arc<[Vec<Array1<f64>>]>,
    pub deflation_row_spectra: Arc<[Option<RowDeflationSpectrum>]>,
    pub cross_row_woodbury: Option<CrossRowWoodbury>,
}

Fields§

§htt_factors: ArrowFactorSlab

Per-row lower-triangular Cholesky factors of H_tt^(i) + ridge_t·I.

These are the damped factors used inside the Newton solve. The IFT predictor must NOT use them — see Self::htt_factors_undamped.

§htt_factors_undamped: ArrowUndampedFactors

Per-row lower-triangular Cholesky factors of the UNDAMPED H_tt^(i) (no ridge_t added).

The IFT predictor formula Δt_i = -(H_tt^(i))⁻¹ · (H_tβ^(i) Δβ + δg_t^(i)) is derived from ∂g_t/∂t = H_tt at the stationary point, with no LM damping term. Reusing the damped factors would bias the predicted shift toward zero in proportion to ridge_t. We pay one extra O(N d³) Cholesky per Newton solve — the same complexity class as the Newton solve itself — to make the IFT exact.

§schur_factor: Option<Array2<f64>>

Lower-triangular Cholesky factor of the Schur complement when the selected BA mode formed/factored dense RCS. None for ArrowSolverMode::InexactPCG, where Agarwal-style inexact LM avoids the dense K × K factor.

§joint_hessian_log_det: Option<f64>

Exact undamped joint-Hessian log-determinant produced by the dense factorization path. REML evidence consumes this directly so the Laplace normalizer cannot miss the log-det even when later cache consumers only need solves/traces.

§solver_mode: ArrowSolverMode

BA mode used to create this cache.

§ridge_t: f64

Ridge values used to build the cached factors (recorded so the warm-start predictor knows whether the cache is still valid for a requested ridge level).

§ridge_beta: f64§htbeta: ArrowHtbetaCache

Per-row cross-block access for H_tβ^(i) x.

Large caches retain a row matvec callback or disable β-coupled IFT prediction instead of cloning every dense d × K slab.

§d: usize

Maximum per-row latent dim (upper bound; matches sys.d at creation).

§row_dims: Arc<[usize]>

Per-row latent dims: row_dims[i] is the active dim for row i.

§row_offsets: Arc<[usize]>

Flat-buffer row offsets for delta_t / IFT output vectors. row_offsets[i] is the start of row i; row_offsets[n] is the total length.

§k: usize

β dimensionality K.

§manifold_mode_fingerprint: u64

Geometry tag for the row-local factors and cross-blocks.

§row_hessian_fingerprint: u64

Row-system tag for the cached per-row factors, cross-blocks, and shared-block diagonal used to build the Schur factor.

§pcg_diagnostics: PcgDiagnostics

PCG instrumentation from the solve that produced this cache.

Zero-valued (default) when the selected mode did not use PCG (i.e. Direct or SqrtBA).

§gauge_deflated_directions: usize

Number of row-local gauge directions stiffened in an undamped evidence factorization.

Each direction is stiffened at UNIT stiffness kappa = 1.0, so it contributes log(1) = 0 to the row-block logdet through the returned Cholesky factor: the gauge orbit is a criterion null direction and adds nothing to the Laplace normalizer (the quotient pseudo-determinant convention, cf. PenaltyPseudologdet). Zero theta/rho dependence.

§deflated_row_directions: Arc<[Vec<Array1<f64>>]>

Per-row unit-norm directions vᵢ (in each row’s d-dim latent block coordinates) that an undamped evidence factorization stiffened to UNIT stiffness λ̃ = 1 (gauge or spectral deflation). Indexed by row; empty for every PD row factored without deflation, and empty overall on the non-deflating solver paths (streaming / cross-row-penalty CG / device).

A deflated direction contributes log(1) = 0 to the row-block log-det and is ρ/θ-INDEPENDENT, so its true contribution to ∂log|H|/∂ρ is 0. The analytic outer-gradient traces (assignment_log_strength_hessian_trace, learnable_ibp_data_logdet_alpha_trace, logdet_theta_adjoint) contract ∂H_raw/∂ρ (the RAW, pre-deflation block derivative) against the DEFLATED inverse, which assigns 1/λ̃ = 1 to each vᵢ and therefore spuriously adds ½ vᵢᵀ (∂H_raw/∂ρ) vᵢ. Those traces subtract this per-row term (kept-subspace restriction) using these directions; without them the REML outer ρ-gradient is biased by +Σ_deflated ½ vᵢᵀ ∂H_raw/∂ρ vᵢ.

§deflation_row_spectra: Arc<[Option<RowDeflationSpectrum>]>

Per-row RAW spectral decomposition of an undamped evidence H_tt block that underwent SPECTRAL deflation, surfaced so the outer ρ/θ-gradient traces can apply the EXACT deflation-map (Daleckii–Krein) derivative correction, not just the within-row kept-subspace term.

The criterion VALUE re-deflates H_tt at every ρ, so its gradient is tr(H_deflated⁻¹ DΦ[∂H_raw/∂ρ]), where Φ is the spectral pin-to-unit map. By Daleckii–Krein DΦ[Ȧ] = U (F ∘ UᵀȦU) Uᵀ with the divided- difference matrix F_{ml} = (λ̃ₘ − λ̃ₗ)/(λₘ − λₗ) (raw λ in the denominator, conditioned λ̃ in the numerator). The kept×kept block of F is 1 (the kept subspace contracts the raw derivative unchanged), the deflated×deflated block is 0, and the kept(m)×deflated(i) block is (λₘ − 1)/(λₘ − λᵢ) — this last, ROTATION, term is what the per-row kept-subspace correction alone misses; it couples to the β-block through the Schur back-substitution carried in (H⁻¹)_tt.

Some(spectrum) only for spectrally-deflated rows; None for PD rows, gauge-only deflation (ρ-independent structural null — within-row term suffices), and every non-SAE-evidence solver path (streaming / device / cross-row CG). Empty overall when no row deflated spectrally.

§cross_row_woodbury: Option<CrossRowWoodbury>

Exact cross-row IBP rank-R Woodbury correction (#1038), present iff the source system carried an IbpCrossRowSource. When set, the per-row factors above are of the NO-SELF base H₀' (self term d_k·z'_ik² downdated from each logit diagonal), and this carrier supplies the exact rank-R correction so the value/curvature solve (Self::full_inverse_apply), the evidence log-determinant (Self::arrow_log_det), and the θ/ρ-adjoint all describe the same H_full = H₀' + U D Uᵀ.

Struct ArrowFactorCache Copy item path

Fields§

Implementations§

impl ArrowFactorCache

pub fn n_rows(&self) -> usize

pub fn htbeta_available(&self) -> bool

pub fn used_device(&self) -> bool

pub fn undamped_factor(&self, row: usize) -> ArrayView2<'_, f64>

pub fn undamped_factor_count(&self) -> usize

pub fn undamped_factors_iter( &self, ) -> impl Iterator<Item = ArrayView2<'_, f64>> + '_

pub fn compute_undamped_arrow_log_det(&self) -> Option<f64>

pub fn delta_t_len(&self) -> usize

pub fn apply_htbeta_row( &self, row: usize, delta_beta: ArrayView1<'_, f64>, out: &mut Array1<f64>, ) -> bool

pub fn apply_htbeta_row_transpose( &self, row: usize, v: ArrayView1<'_, f64>, out: &mut Array1<f64>, fallback_op: Option<&RowHtbetaMatvec>, ) -> bool

pub fn arrow_log_det(&self) -> (f64, Option<f64>)

pub fn cross_row_woodbury_log_det(&self) -> f64

pub fn latent_block_inverse_diagonal( &self, ) -> Result<Array1<f64>, ArrowSchurError>

§Consuming the diagonal as a per-(atom, axis) trace

§Errors

pub fn full_inverse_apply( &self, w_t: ArrayView1<'_, f64>, w_beta: ArrayView1<'_, f64>, ) -> Result<(Array1<f64>, Array1<f64>), ArrowSchurError>

pub fn schur_inverse_apply( &self, rhs: ArrayView1<'_, f64>, ) -> Result<Array1<f64>, ArrowSchurError>

§Errors

pub fn schur_inverse_block( &self, block: Range<usize>, ) -> Result<Array2<f64>, ArrowSchurError>

Trait Implementations§

impl Clone for ArrowFactorCache

fn clone(&self) -> ArrowFactorCache

fn clone_from(&mut self, source: &Self)

impl Debug for ArrowFactorCache

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Auto Trait Implementations§

impl !RefUnwindSafe for ArrowFactorCache

impl !UnwindSafe for ArrowFactorCache

impl Freeze for ArrowFactorCache

impl Send for ArrowFactorCache

impl Sync for ArrowFactorCache

impl Unpin for ArrowFactorCache

impl UnsafeUnpin for ArrowFactorCache

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> ByRef<T> for T

fn by_ref(&self) -> &T

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DTwhere ST: ?Sized, DT: ?Sized,

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DTwhere ST: ?Sized, DT: ?Sized,

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> DistributionExt for Twhere T: ?Sized,

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> Twhere Self: Distribution<T>,

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Imply<T> for Uwhere T: ?Sized, U: ?Sized,

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T> Pointable for T

const ALIGN: usize

type Init = T

unsafe fn init(init: <T as Pointable>::Init) -> usize

unsafe fn deref<'a>(ptr: usize) -> &'a T

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

unsafe fn drop(ptr: usize)

impl<T> Read<Exclusive, BecauseExclusive> for Twhere T: ?Sized,

impl<T> Same for T

type Output = T

impl<SS, SP> SupersetOf<SS> for SPwhere SS: SubsetOf<SP>,

fn to_subset(&self) -> Option<SS>

fn is_in_subset(&self) -> bool

fn to_subset_unchecked(&self) -> SS

fn from_subset(element: &SS) -> SP

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

Struct ArrowFactorCache

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DT
where ST: ?Sized, DT: ?Sized,

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DT
where ST: ?Sized, DT: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T> DistributionExt for T
where T: ?Sized,

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> T
where Self: Distribution<T>,

impl<T, U> Imply<T> for U
where T: ?Sized, U: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T> Read<Exclusive, BecauseExclusive> for T
where T: ?Sized,

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,