Struct MultinomialLogitLikelihood

Source

pub struct MultinomialLogitLikelihood {
    pub active_classes: usize,
    pub row_weights: Option<Array1<f64>>,
}

Expand description

Multinomial-logit (softmax) likelihood with explicit reference class.

Conventions:

K is the total number of classes; the linear predictor has M = K - 1 columns corresponding to the active classes. Class K - 1 is the reference class with η_{K-1} ≡ 0 (so the gauge is fixed by construction and no additional sum-to-zero projection is required at the η level).
y is the categorical response with shape (N, K). Each row must be a point on the probability simplex (y_c ≥ 0, Σ_c y_c = 1): a one-hot indicator for hard-label classification, or a label-smoothed probability vector. The row weight w_n scales the whole row’s likelihood contribution and is independent of the row mass — it is not the row sum. Callers enforce the simplex precondition via [validate_multinomial_simplex] at every construction boundary; under it the residual gradient y_a − p_a and Fisher block p_a δ_ab − p_a p_b below are the exact derivatives of the log-likelihood Σ_c y_c log p_c.
eta is the active linear predictor with shape (N, M = K - 1).

Softmax with baseline:

    p_a   = exp(η_a) / (1 + Σ_b exp(η_b))           for a ∈ [0, K-1)
    p_{K-1} = 1 / (1 + Σ_b exp(η_b))

Log-likelihood (rows with weight w_n, default 1.0):

    log L = Σ_n w_n · ( Σ_{a < K-1} y_{n,a} · η_{n,a} − log(1 + Σ_b exp(η_{n,b})) )
          = Σ_n w_n · Σ_{c ∈ [0, K)} y_{n,c} · log p_{n,c}

Per-row gradient w.r.t. the active η is the canonical Bernoulli/softmax residual:

    ∂ log L / ∂η_{n,a} = w_n · (y_{n,a} − p_{n,a})       for a ∈ [0, K-1)

Per-row Fisher (= observed, since logit is canonical for the multinomial) information block, shape (M, M):

    H_{n,a,b} = w_n · ( p_{n,a} · δ_{ab} − p_{n,a} · p_{n,b} )

This is the standard reference-coded multinomial-logit GLM. The dense per-row block flows through VectorLikelihood::hess_block into gam_solve::pirls::dense_block_xtwx, which builds the stacked XᵀWX in output-major coefficient ordering β = [β_0; β_1; …; β_{K-2}] with each per-class block of size (P, P).

Fields§

§active_classes: usize

Number of active classes M = K − 1. Cached for shape checks.

§row_weights: Option<Array1<f64>>

Optional row weights (length N), or None for uniform 1.0.

Struct MultinomialLogitLikelihood Copy item path

Fields§

Implementations§

impl MultinomialLogitLikelihood

pub fn with_classes(total_classes: usize) -> Result<Self, EstimationError>

pub fn with_row_weights(self, w: Array1<f64>) -> Result<Self, EstimationError>

pub fn total_classes(&self) -> usize

pub fn softmax_with_baseline(eta_active: &[f64], out: &mut [f64])

pub fn probabilities(&self, eta: ArrayView2<'_, f64>) -> Array2<f64>

Trait Implementations§

impl Clone for MultinomialLogitLikelihood

fn clone(&self) -> MultinomialLogitLikelihood

fn clone_from(&mut self, source: &Self)

impl Debug for MultinomialLogitLikelihood

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl VectorLikelihood for MultinomialLogitLikelihood

fn log_lik(&self, eta: ArrayView2<'_, f64>, y: ArrayView2<'_, f64>) -> f64

fn grad_eta( &self, eta: ArrayView2<'_, f64>, y: ArrayView2<'_, f64>, ) -> Array2<f64>

fn hess_diag( &self, eta: ArrayView2<'_, f64>, y: ArrayView2<'_, f64>, ) -> Array2<f64>

fn hess_block( &self, eta: ArrayView2<'_, f64>, y: ArrayView2<'_, f64>, ) -> Array3<f64>

Auto Trait Implementations§

impl Freeze for MultinomialLogitLikelihood

impl RefUnwindSafe for MultinomialLogitLikelihood

impl Send for MultinomialLogitLikelihood

impl Sync for MultinomialLogitLikelihood

impl Unpin for MultinomialLogitLikelihood

impl UnsafeUnpin for MultinomialLogitLikelihood

impl UnwindSafe for MultinomialLogitLikelihood

Blanket Implementations§

impl<T> Allocation for Twhere T: RefUnwindSafe + Send + Sync,

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> ByRef<T> for T

fn by_ref(&self) -> &T

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DTwhere ST: ?Sized, DT: ?Sized,

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DTwhere ST: ?Sized, DT: ?Sized,

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> DistributionExt for Twhere T: ?Sized,

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> Twhere Self: Distribution<T>,

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Imply<T> for Uwhere T: ?Sized, U: ?Sized,

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T> Pointable for T

const ALIGN: usize

type Init = T

unsafe fn init(init: <T as Pointable>::Init) -> usize

unsafe fn deref<'a>(ptr: usize) -> &'a T

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

unsafe fn drop(ptr: usize)

impl<T> Read<Exclusive, BecauseExclusive> for Twhere T: ?Sized,

impl<T> Same for T

type Output = T

impl<SS, SP> SupersetOf<SS> for SPwhere SS: SubsetOf<SP>,

fn to_subset(&self) -> Option<SS>

fn is_in_subset(&self) -> bool

fn to_subset_unchecked(&self) -> SS

fn from_subset(element: &SS) -> SP

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<V, T> VZip<V> for Twhere V: MultiLane<T>,

fn vzip(self) -> V

Struct MultinomialLogitLikelihood

impl<T> Allocation for T
where T: RefUnwindSafe + Send + Sync,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DT
where ST: ?Sized, DT: ?Sized,

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DT
where ST: ?Sized, DT: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T> DistributionExt for T
where T: ?Sized,

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> T
where Self: Distribution<T>,

impl<T, U> Imply<T> for U
where T: ?Sized, U: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T> Read<Exclusive, BecauseExclusive> for T
where T: ?Sized,

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,