Skip to main content

MultinomialLogitLikelihood

Struct MultinomialLogitLikelihood 

Source
pub struct MultinomialLogitLikelihood {
    pub active_classes: usize,
    pub row_weights: Option<Array1<f64>>,
}
Expand description

Multinomial-logit (softmax) likelihood with explicit reference class.

Conventions:

  • K is the total number of classes; the linear predictor has M = K - 1 columns corresponding to the active classes. Class K - 1 is the reference class with η_{K-1} ≡ 0 (so the gauge is fixed by construction and no additional sum-to-zero projection is required at the η level).
  • y is the categorical response with shape (N, K). Each row must be a point on the probability simplex (y_c ≥ 0, Σ_c y_c = 1): a one-hot indicator for hard-label classification, or a label-smoothed probability vector. The row weight w_n scales the whole row’s likelihood contribution and is independent of the row mass — it is not the row sum. Callers enforce the simplex precondition via [validate_multinomial_simplex] at every construction boundary; under it the residual gradient y_a − p_a and Fisher block p_a δ_ab − p_a p_b below are the exact derivatives of the log-likelihood Σ_c y_c log p_c.
  • eta is the active linear predictor with shape (N, M = K - 1).

Softmax with baseline:

    p_a   = exp(η_a) / (1 + Σ_b exp(η_b))           for a ∈ [0, K-1)
    p_{K-1} = 1 / (1 + Σ_b exp(η_b))

Log-likelihood (rows with weight w_n, default 1.0):

    log L = Σ_n w_n · ( Σ_{a < K-1} y_{n,a} · η_{n,a} − log(1 + Σ_b exp(η_{n,b})) )
          = Σ_n w_n · Σ_{c ∈ [0, K)} y_{n,c} · log p_{n,c}

Per-row gradient w.r.t. the active η is the canonical Bernoulli/softmax residual:

    ∂ log L / ∂η_{n,a} = w_n · (y_{n,a} − p_{n,a})       for a ∈ [0, K-1)

Per-row Fisher (= observed, since logit is canonical for the multinomial) information block, shape (M, M):

    H_{n,a,b} = w_n · ( p_{n,a} · δ_{ab} − p_{n,a} · p_{n,b} )

This is the standard reference-coded multinomial-logit GLM. The dense per-row block flows through VectorLikelihood::hess_block into gam_solve::pirls::dense_block_xtwx, which builds the stacked XᵀWX in output-major coefficient ordering β = [β_0; β_1; …; β_{K-2}] with each per-class block of size (P, P).

Fields§

§active_classes: usize

Number of active classes M = K − 1. Cached for shape checks.

§row_weights: Option<Array1<f64>>

Optional row weights (length N), or None for uniform 1.0.

Implementations§

Source§

impl MultinomialLogitLikelihood

Source

pub fn with_classes(total_classes: usize) -> Result<Self, EstimationError>

Construct from the total number of classes K ≥ 2.

Source

pub fn with_row_weights(self, w: Array1<f64>) -> Result<Self, EstimationError>

Attach per-row weights (length N, finite and non-negative).

Source

pub fn total_classes(&self) -> usize

Total class count K = M + 1.

Source

pub fn softmax_with_baseline(eta_active: &[f64], out: &mut [f64])

Numerically-stable softmax with implicit reference column (η_{K-1} = 0).

Writes K probabilities into out (length M + 1). The shift uses max(0, max(eta_active)) so the reference class is included in the max and the denominator stays bounded. This is the canonical reference implementation; the FFI surface and any direct matrix-free callers route through this method rather than carrying their own softmax.

Source

pub fn probabilities(&self, eta: ArrayView2<'_, f64>) -> Array2<f64>

Convenience: compute the full (N, K) probability matrix from (N, K-1) active linear predictor. This is the multinomial inverse link used by prediction.

Trait Implementations§

Source§

impl Clone for MultinomialLogitLikelihood

Source§

fn clone(&self) -> MultinomialLogitLikelihood

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for MultinomialLogitLikelihood

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl VectorLikelihood for MultinomialLogitLikelihood

Source§

fn log_lik(&self, eta: ArrayView2<'_, f64>, y: ArrayView2<'_, f64>) -> f64

log p(Y | η).
Source§

fn grad_eta( &self, eta: ArrayView2<'_, f64>, y: ArrayView2<'_, f64>, ) -> Array2<f64>

∂ log p(Y | η) / ∂ η, shape (N, M).
Source§

fn hess_diag( &self, eta: ArrayView2<'_, f64>, y: ArrayView2<'_, f64>, ) -> Array2<f64>

Diagonal of the per-row Hessian −∂² log p / ∂ η ∂ η, shape (N, M). This is the per-row block consumed by solver/arrow_schur.rs.
Source§

fn hess_block( &self, eta: ArrayView2<'_, f64>, y: ArrayView2<'_, f64>, ) -> Array3<f64>

Per-row dense Hessian block −∂² log p / ∂η_a ∂η_b, shape (N, M, M). Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Allocation for T
where T: RefUnwindSafe + Send + Sync,

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> ByRef<T> for T

Source§

fn by_ref(&self) -> &T

Source§

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DT
where ST: ?Sized, DT: ?Sized,

Source§

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DT
where ST: ?Sized, DT: ?Sized,

Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> DistributionExt for T
where T: ?Sized,

Source§

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> T
where Self: Distribution<T>,

Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Imply<T> for U
where T: ?Sized, U: ?Sized,

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> Read<Exclusive, BecauseExclusive> for T
where T: ?Sized,

Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

Source§

fn to_subset(&self) -> Option<SS>

The inverse inclusion map: attempts to construct self from the equivalent element of its superset. Read more
Source§

fn is_in_subset(&self) -> bool

Checks if self is actually part of its subset T (and can be converted to it).
Source§

fn to_subset_unchecked(&self) -> SS

Use with care! Same as self.to_subset but without any property checks. Always succeeds.
Source§

fn from_subset(element: &SS) -> SP

The inclusion map: converts self to the equivalent element of its superset.
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V