Struct LogWeight

Source

pub struct LogWeight(/* private fields */);

Expand description

Log weight for numerically stable probability computation

The log semiring addresses numerical stability issues inherent in probability computation by working in the logarithmic domain. This enables handling of very small probabilities (e.g., 10^-100) that would underflow in linear probability space, while maintaining mathematically equivalent probabilistic semantics.

§Mathematical Semantics

Value Range: Real numbers ℝ plus positive infinity (+∞)
Addition (⊕): -log(e^(-a) + e^(-b)) (log-sum-exp) - combines probabilities
Multiplication (⊗): a + b (addition in log space) - multiplies probabilities
Zero (0̄): +∞ - represents impossible event (probability 0)
One (1̄): 0.0 - represents certain event (probability 1)

§Relationship to Probability Semiring

The log semiring is related to the probability semiring through logarithmic transformation:

If p, q are probabilities, then -log p ⊕ -log q = -log(p + q)
If p, q are probabilities, then -log p ⊗ -log q = -log(p × q)

This preserves probabilistic semantics while providing numerical stability.

§Use Cases

§Large-Vocabulary Speech Recognition

use arcweight::prelude::*;

// Acoustic model probabilities (very small values)
let acoustic_prob = LogWeight::from_probability(1e-50);  // Converts to log space
let language_prob = LogWeight::from_probability(0.001);

// Combine probabilities safely
let combined = acoustic_prob.times(&language_prob);  // Multiplication in log space

// Convert back to probability if needed
let final_prob = combined.to_probability();
println!("Final probability: {:.2e}", final_prob);  // ~1e-53

§Machine Translation with Large Models

use arcweight::prelude::*;

// Translation model scores (often very small probabilities)
let phrase_prob = LogWeight::from_probability(1e-20);
let alignment_prob = LogWeight::from_probability(1e-15);
let reordering_prob = LogWeight::from_probability(0.1);

// Combine all model scores
let translation_score = phrase_prob
    .times(&alignment_prob)
    .times(&reordering_prob);

// Alternative translations (add probabilities)
let alternative1 = LogWeight::from_probability(1e-35);
let alternative2 = LogWeight::from_probability(2e-35);
let combined_alternatives = alternative1.plus(&alternative2);  // LogSumExp

§Neural Language Model Integration

use arcweight::prelude::*;

// Softmax probabilities from neural networks
let word_probs = vec![
    LogWeight::from_probability(0.4),   // Most likely word
    LogWeight::from_probability(0.3),   // Second choice
    LogWeight::from_probability(0.2),   // Third choice
    LogWeight::from_probability(0.1),   // Least likely
];

// Compute probability of any of these words
let any_word_prob = word_probs.into_iter()
    .fold(LogWeight::zero(), |acc, prob| acc.plus(&prob));

// Should be close to 1.0 (sum of probabilities)
assert!((any_word_prob.to_probability() - 1.0).abs() < 1e-10);

§Sequence Analysis in Bioinformatics

use arcweight::prelude::*;

// DNA sequence alignment with very long sequences
let base_prob = LogWeight::from_probability(0.25);  // Each base equally likely
let sequence_length = 1000;

// Probability of specific sequence (would underflow in linear space)
let mut sequence_prob = LogWeight::one();
for _ in 0..sequence_length {
    sequence_prob = sequence_prob.times(&base_prob);
}

// Convert to scientific notation for display
println!("Sequence probability: {:.2e}", sequence_prob.to_probability());

§Working with FSTs

use arcweight::prelude::*;

let log_p1 = LogWeight::from_probability(0.5);   // Convert from probability
let log_p2 = LogWeight::from_probability(0.25);  // Convert from probability

// Addition performs log-sum-exp (combines probabilities)
let sum = log_p1 + log_p2;  // -log(0.5 + 0.25) = -log(0.75)
assert!((sum.to_probability() - 0.75).abs() < 1e-6);

// Multiplication is addition in log space (multiplies probabilities)
let product = log_p1 * log_p2;  // -log(0.5 × 0.25) = -log(0.125)
assert!((product.to_probability() - 0.125).abs() < 1e-10);

// Identity elements
assert_eq!(LogWeight::zero(), LogWeight::INFINITY);  // Impossible event
assert_eq!(LogWeight::one(), LogWeight::new(0.0));   // Certain event

§Numerical Stability Implementation

The log semiring implements numerically stable log-sum-exp operation:

-log(e^(-a) + e^(-b)) = -max(a,b) - log(1 + e^(-|a-b|))

This formulation prevents overflow/underflow by:

Working with the larger magnitude value first
Computing the difference in a stable manner
Using the identity: log(1 + x) ≈ x for small x

§Performance Characteristics

Arithmetic: Addition is expensive (log-sum-exp), multiplication is O(1)
Memory: 8 bytes per weight (single f64)
Precision: Double precision for high-accuracy probability computation
Conversion: Efficient probability ↔ log conversions available
Range: Handles probabilities from ~10^-308 to 1.0

§Conversion Utilities

use arcweight::prelude::*;

// Convert from probability to log weight
let prob = 0.001;
let log_weight = LogWeight::from_probability(prob);
assert_eq!(log_weight.value(), &(-prob.ln()));

// Convert back to probability
let recovered_prob = log_weight.to_probability();
assert!((recovered_prob - prob).abs() < 1e-15);

// Handle edge cases
let zero_prob = LogWeight::from_probability(0.0);
assert!(<LogWeight as num_traits::Zero>::is_zero(&zero_prob));
assert_eq!(zero_prob.to_probability(), 0.0);

§Advanced Usage

§Normalization in Log Space

use arcweight::prelude::*;

// Normalize a probability distribution in log space
let log_probs = vec![
    LogWeight::new(1.0),  // Unnormalized log probabilities
    LogWeight::new(2.0),
    LogWeight::new(0.5),
];

// Compute log partition function (log of sum of probabilities)
let log_z = log_probs.iter()
    .fold(LogWeight::zero(), |acc, &p| acc.plus(&p));

// Normalize each probability
let normalized: Vec<_> = log_probs.iter()
    .map(|&p| p.divide(&log_z).unwrap())
    .collect();

// Verify normalization (sum should be 1.0)
let sum = normalized.iter()
    .fold(LogWeight::zero(), |acc, &p| acc.plus(&p));
assert!((sum.to_probability() - 1.0).abs() < 1e-10);

§Integration with FST Algorithms

Log weights work with all FST algorithms while providing numerical stability:

Shortest Path: Finds maximum probability paths
Forward-Backward: Stable computation of path probabilities
Composition: Combines probabilistic models
Determinization: Maintains probability distributions

§See Also

Core Concepts - Log Semiring for mathematical background
ProbabilityWeight for simple probability computation
TropicalWeight for optimization problems

Struct LogWeight Copy item path

§Mathematical Semantics

§Relationship to Probability Semiring

§Use Cases

§Large-Vocabulary Speech Recognition

§Machine Translation with Large Models

§Neural Language Model Integration

§Sequence Analysis in Bioinformatics

§Working with FSTs

§Numerical Stability Implementation

§Performance Characteristics

§Conversion Utilities

§Advanced Usage

§Normalization in Log Space

§Integration with FST Algorithms

§See Also

Implementations§

impl LogWeight

pub const INFINITY: Self

pub fn new(value: f64) -> Self

pub fn from_probability(p: f64) -> Self

pub fn to_probability(&self) -> f64

Trait Implementations§

impl Add for LogWeight

type Output = LogWeight

fn add(self, rhs: Self) -> Self::Output

impl Clone for LogWeight

fn clone(&self) -> LogWeight

fn clone_from(&mut self, source: &Self)

impl Debug for LogWeight

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl<'de> Deserialize<'de> for LogWeight

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

impl Display for LogWeight

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl DivisibleSemiring for LogWeight

fn divide(&self, other: &Self) -> Option<Self>

impl Hash for LogWeight

fn hash<__H: Hasher>(&self, state: &mut __H)

fn hash_slice<H>(data: &[Self], state: &mut H)where H: Hasher, Self: Sized,

impl Mul for LogWeight

type Output = LogWeight

fn mul(self, rhs: Self) -> Self::Output

impl One for LogWeight

fn one() -> Self

fn set_one(&mut self)

fn is_one(&self) -> boolwhere Self: PartialEq,

impl Ord for LogWeight

fn cmp(&self, other: &LogWeight) -> Ordering

fn max(self, other: Self) -> Selfwhere Self: Sized,

fn min(self, other: Self) -> Selfwhere Self: Sized,

fn clamp(self, min: Self, max: Self) -> Selfwhere Self: Sized,

impl PartialEq for LogWeight

fn eq(&self, other: &LogWeight) -> bool

fn ne(&self, other: &Rhs) -> bool

impl PartialOrd for LogWeight

fn partial_cmp(&self, other: &LogWeight) -> Option<Ordering>

fn lt(&self, other: &Rhs) -> bool

fn le(&self, other: &Rhs) -> bool

fn gt(&self, other: &Rhs) -> bool

fn ge(&self, other: &Rhs) -> bool

impl Semiring for LogWeight

type Value = f64

fn new(value: Self::Value) -> Self

fn value(&self) -> &Self::Value

fn properties() -> SemiringProperties

fn approx_eq(&self, other: &Self, epsilon: f64) -> bool

fn plus(&self, other: &Self) -> Self

fn times(&self, other: &Self) -> Self

fn plus_assign(&mut self, other: &Self)

fn times_assign(&mut self, other: &Self)

fn is_zero(&self) -> bool

fn is_one(&self) -> bool

impl Serialize for LogWeight

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

impl Zero for LogWeight

fn zero() -> Self

fn is_zero(&self) -> bool

fn set_zero(&mut self)

impl Copy for LogWeight

Struct LogWeight

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

fn hash<H: Hasher>(&self, state: &mut H)

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

fn is_one(&self) -> bool
where Self: PartialEq,

fn max(self, other: Self) -> Self
where Self: Sized,

fn min(self, other: Self) -> Self
where Self: Sized,

fn clamp(self, min: Self, max: Self) -> Self
where Self: Sized,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T> ToOwned for T
where T: Clone,

impl<T> ToString for T
where T: Display + ?Sized,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,