Struct argmin::solver::quasinewton::LBFGS

source · [−]

pub struct LBFGS<L, P, G, F> { /* private fields */ }

Expand description

Limited-memory BFGS (L-BFGS) method

L-BFGS is an approximation to BFGS which requires a limited amount of memory. Instead of storing the inverse, only a few vectors which implicitly represent the inverse matrix are stored.

It requires a line search and the number of vectors to be stored (history size m) must be set. Additionally an initial guess for the parameter vector is required, which is to be provided via the configure method of the Executor (See IterState, in particular IterState::param). In the same way the initial gradient and cost function corresponding to the initial parameter vector can be provided. If these are not provided, they will be computed during initialization of the algorithm.

Two tolerances can be configured, which are both needed in the stopping criteria. One is a tolerance on the gradient (set with with_tolerance_grad): If the norm of the gradient is below said tolerance, the algorithm stops. It defaults to sqrt(EPSILON). The other one is a tolerance on the change of the cost function from one iteration to the other. If the change is below this tolerance (default: EPSILON), the algorithm stops. This parameter can be set via with_tolerance_cost.

Orthant-Wise Limited-memory Quasi-Newton (OWL-QN) method

OWL-QN is a method that adapts L-BFGS to L1-regularization. The original L-BFGS requires a loss function to be differentiable and does not support L1-regularization. Therefore, this library switches to OWL-QN when L1-regularization is specified. L1-regularization can be performed via with_l1_regularization.

TODO: Implement compact representation of BFGS updating (Nocedal/Wright p.230)

Requirements on the optimization problem

The optimization problem is required to implement CostFunction and Gradient.

Reference

Jorge Nocedal and Stephen J. Wright (2006). Numerical Optimization. Springer. ISBN 0-387-30303-0.

Galen Andrew and Jianfeng Gao (2007). Scalable Training of L1-Regularized Log-Linear Models, International Conference on Machine Learning.

Struct argmin::solver::quasinewton::LBFGS

Implementations

impl<L, P, G, F> LBFGS<L, P, G, F> where F: ArgminFloat,

pub fn new(linesearch: L, m: usize) -> Self

pub fn with_tolerance_grad(self, tol_grad: F) -> Result<Self, Error>

pub fn with_tolerance_cost(self, tol_cost: F) -> Result<Self, Error>

pub fn with_l1_regularization(self, l1_coeff: F) -> Result<Self, Error>

Trait Implementations

impl<L: Clone, P: Clone, G: Clone, F: Clone> Clone for LBFGS<L, P, G, F>

fn clone(&self) -> LBFGS<L, P, G, F>

fn clone_from(&mut self, source: &Self)

impl<'de, L, P, G, F> Deserialize<'de> for LBFGS<L, P, G, F> where L: Deserialize<'de>, P: Deserialize<'de>, G: Deserialize<'de>, F: Deserialize<'de>,

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error> where __D: Deserializer<'de>,

impl<L, P, G, F> Serialize for LBFGS<L, P, G, F> where L: Serialize, P: Serialize, G: Serialize, F: Serialize,

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error> where __S: Serializer,

const NAME: &'static str = "L-BFGS"

fn init( &mut self, problem: &mut Problem<O>, state: IterState<P, G, (), (), F>) -> Result<(IterState<P, G, (), (), F>, Option<KV>), Error>

fn next_iter( &mut self, problem: &mut Problem<O>, state: IterState<P, G, (), (), F>) -> Result<(IterState<P, G, (), (), F>, Option<KV>), Error>

fn terminate(&mut self, state: &IterState<P, G, (), (), F>) -> TerminationReason

fn terminate_internal(&mut self, state: &I) -> TerminationReason

Auto Trait Implementations

impl<L, P, G, F> RefUnwindSafe for LBFGS<L, P, G, F> where F: RefUnwindSafe, G: RefUnwindSafe, L: RefUnwindSafe, P: RefUnwindSafe,

impl<L, P, G, F> Send for LBFGS<L, P, G, F> where F: Send, G: Send, L: Send, P: Send,

impl<L, P, G, F> Sync for LBFGS<L, P, G, F> where F: Sync, G: Sync, L: Sync, P: Sync,

impl<L, P, G, F> Unpin for LBFGS<L, P, G, F> where F: Unpin, G: Unpin, L: Unpin, P: Unpin,

impl<L, P, G, F> UnwindSafe for LBFGS<L, P, G, F> where F: UnwindSafe, G: UnwindSafe, L: UnwindSafe, P: UnwindSafe,

Blanket Implementations

impl<T> Any for T where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for T where T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for T where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for T where U: From<T>,

fn into(self) -> U

impl<T> ToOwned for T where T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for T where U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for T where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<V, T> VZip<V> for T where V: MultiLane<T>,

fn vzip(self) -> V

impl<T> DeserializeOwned for T where T: for<'de> Deserialize<'de>,

impl<T> SendSyncUnwindSafe for T where T: Send + Sync + UnwindSafe + ?Sized,

impl<L, P, G, F> LBFGS<L, P, G, F> where
F: ArgminFloat,

impl<'de, L, P, G, F> Deserialize<'de> for LBFGS<L, P, G, F> where
L: Deserialize<'de>,
P: Deserialize<'de>,
G: Deserialize<'de>,
F: Deserialize<'de>,

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error> where
__D: Deserializer<'de>,

impl<L, P, G, F> Serialize for LBFGS<L, P, G, F> where
L: Serialize,
P: Serialize,
G: Serialize,
F: Serialize,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error> where
S: Serializer,

fn init(
&mut self,
problem: &mut Problem<O>,
state: IterState<P, G, (), (), F>
) -> Result<(IterState<P, G, (), (), F>, Option<KV>), Error>

fn next_iter(
&mut self,
problem: &mut Problem<O>,
state: IterState<P, G, (), (), F>
) -> Result<(IterState<P, G, (), (), F>, Option<KV>), Error>

impl<L, P, G, F> RefUnwindSafe for LBFGS<L, P, G, F> where
F: RefUnwindSafe,
G: RefUnwindSafe,
L: RefUnwindSafe,
P: RefUnwindSafe,

impl<L, P, G, F> Send for LBFGS<L, P, G, F> where
F: Send,
G: Send,
L: Send,
P: Send,

impl<L, P, G, F> Sync for LBFGS<L, P, G, F> where
F: Sync,
G: Sync,
L: Sync,
P: Sync,

impl<L, P, G, F> Unpin for LBFGS<L, P, G, F> where
F: Unpin,
G: Unpin,
L: Unpin,
P: Unpin,

impl<L, P, G, F> UnwindSafe for LBFGS<L, P, G, F> where
F: UnwindSafe,
G: UnwindSafe,
L: UnwindSafe,
P: UnwindSafe,

impl<T> Any for T where
T: 'static + ?Sized,

impl<T> Borrow<T> for T where
T: ?Sized,

impl<T> BorrowMut<T> for T where
T: ?Sized,

impl<T, U> Into<U> for T where
U: From<T>,

impl<T> ToOwned for T where
T: Clone,

impl<T, U> TryFrom<U> for T where
U: Into<T>,

impl<T, U> TryInto<U> for T where
U: TryFrom<T>,

impl<V, T> VZip<V> for T where
V: MultiLane<T>,

impl<T> DeserializeOwned for T where
T: for<'de> Deserialize<'de>,

impl<T> SendSyncUnwindSafe for T where
T: Send + Sync + UnwindSafe + ?Sized,