Crate numdiff

Expand description

Numerical differentiation.

§Overview

This crate provides generic functions to evaluate various types of derivatives of the following types of functions:

Univariate, scalar-valued functions ($f:\mathbb{R}\to\mathbb{R}$)
Univariate, vector-valued functions ($\mathbf{f}:\mathbb{R}\to\mathbb{R}^{m}$)
Multivariate, scalar-valued functions ($f:\mathbb{R}^{n}\to\mathbb{R}$)
Multivariate, vector-valued functions ($\mathbf{f}:\mathbb{R}^{n}\to\mathbb{R}^{m}$)

These functions are made generic over the choice of vector representation, as long as the vector type implements the linalg_traits::Vector trait. See the linalg_traits documentation for more information.

§Example

Consider the function

$$f(\mathbf{x})=x_{0}^{5}+\sin^{3}{x_{1}}$$

The numdiff crate provides various functions that can be used to approximate its gradient. Here, we approximate its gradient at $\mathbf{x}=(5,8)^{T}$ using forward_difference::gradient() (i.e. using the forward difference approximation). We perform this gradient approximation three times, each time using a different vector type to define the function $f(\mathbf{x})$.

use nalgebra::SVector;
use ndarray::{array, Array1};
use numtest::*;

use numdiff::forward_difference::gradient;

// f(x) written in terms of a dynamically-sized standard vector (f1), a statically-sized
// nalgebra vector (f2), and a dynamically-sized ndarray vector (f3).
let f1 = |x: &Vec<f64>| x[0].powi(5) + x[1].sin().powi(3);
let f2 = |x: &SVector<f64,2>| x[0].powi(5) + x[1].sin().powi(3);
let f3 = |x: &Array1<f64>| x[0].powi(5) + x[1].sin().powi(3);

// Evaluation points using the three types of vectors.
let x1: Vec<f64> = vec![5.0, 8.0];
let x2: SVector<f64, 2> = SVector::from_row_slice(&[5.0, 8.0]);
let x3: Array1<f64> = array![5.0, 8.0];

// Approximate the gradients.
let grad_f1: Vec<f64> = gradient(&f1, &x1, None);
let grad_f2: SVector<f64, 2> = gradient(&f2, &x2, None);
let grad_f3: Array1<f64> = gradient(&f3, &x3, None);

// Verify that the gradient approximations are all identical.
assert_arrays_equal!(grad_f1, grad_f2);
assert_arrays_equal!(grad_f1, grad_f3);

§Automatic Differentiation (Forward-Mode)

Derivative Type	Function Type	Macro to Generate Derivative Function
derivative	$f:\mathbb{R}\to\mathbb{R}$	`get_sderivative!`
derivative	$\mathbf{f}:\mathbb{R}\to\mathbb{R}^{m}$	`get_vderivative!`
gradient	$f:\mathbb{R}^{n}\to\mathbb{R}$	`get_gradient!`

§Limitations

These macros only work on functions, and not on closures.
- Currently, this also means we can’t “pass extra parameters” to a function.
Constants (e.g. 5.0_f64) need to be defined using linalg_traits::Scalar::new (e.g. if a function has the generic parameter S: Scalar, then instead of defining a constant number such as 5.0_f64, we need to do S::new(5.0)).
- This is also the case for some functions that can take constants are arguments, such as num_traits::Float::powf.
When defining functions that operate on generic scalars (to make them compatible with automatic differentiatino), we cannot do an assignment operation such as 1.0 += x if x: S where S: Scalar.

§Alternatives

There are already some alternative crates in the Rust ecosystem that already implement dual numbers. Originally, I intended to implement the autodifferentiation functions in this crate using one of those other dual number implementations in the backend. However, each crate had certain shortcomings that ultimately led to me providing a custom implementation of dual numbers in this crate. The alternative crates implementing dual numbers are described below.

§`num-dual`

This crate can be used to differentiate functions of generic types that implement the DualNum trait. Since this trait is implemented for f32 and f64, it would allow us to write generic functions that can be simply evaluated using f64s, but can also be automatically differentiated if needed.
However, there are some notable shortcomings that are described below.
The Dual struct panics in its implementations of the following standard functions which are quite common in engineering:
num-dual has a required dependency on nalgebra, which is quite a heavy dependency for those who do not need it.

§`autodj`

Can only differentiate functions written using custom types, such as DualF64.
Multivariate functions, especially those with a dynamic number of variables, can be extremely clunky (see this example).

§`autodiff`

Can only differentiate functions written using the custom type FT<T>.
Incorrect implementation of certain functions, such as num_traits::Float::floor (see the source code).

§Central Difference Approximations

Derivative Type	Function Type	Function to Approximate Derivative
derivative	$f:\mathbb{R}\to\mathbb{R}$	`central_difference::sderivative()`
derivative	$\mathbf{f}:\mathbb{R}\to\mathbb{R}^{m}$	`central_difference::vderivative()`
partial derivative	$f:\mathbb{R}^{n}\to\mathbb{R}$	`central_difference::spartial_derivative()`
partial derivative	$\mathbf{f}:\mathbb{R}^{n}\to\mathbb{R}^{m}$	`central_difference::vpartial_derivative()`
gradient	$f:\mathbb{R}^{n}\to\mathbb{R}$	`central_difference::gradient()`
directional derivative	$f:\mathbb{R}^{n}\to\mathbb{R}$	`central_difference::directional_derivative()`
Jacobian	$\mathbf{f}:\mathbb{R}^{n}\to\mathbb{R}^{m}$	`central_difference::jacobian()`
Hessian	$f:\mathbb{R}^{n}\to\mathbb{R}$	`central_difference::shessian()`
Hessian	$\mathbf{f}:\mathbb{R}^{n}\to\mathbb{R}^{m}$	`central_difference::vhessian()`

§Forward Difference Approximations

Derivative Type	Function Type	Function to Approximate Derivative
derivative	$f:\mathbb{R}\to\mathbb{R}$	`forward_difference::sderivative()`
derivative	$\mathbf{f}:\mathbb{R}\to\mathbb{R}^{m}$	`forward_difference::vderivative()`
partial derivative	$f:\mathbb{R}^{n}\to\mathbb{R}$	`forward_difference::spartial_derivative()`
partial derivative	$\mathbf{f}:\mathbb{R}^{n}\to\mathbb{R}^{m}$	`forward_difference::vpartial_derivative()`
gradient	$f:\mathbb{R}^{n}\to\mathbb{R}$	`forward_difference::gradient()`
directional derivative	$f:\mathbb{R}^{n}\to\mathbb{R}$	`forward_difference::directional_derivative()`
Jacobian	$\mathbf{f}:\mathbb{R}^{n}\to\mathbb{R}^{m}$	`forward_difference::jacobian()`
Hessian	$f:\mathbb{R}^{n}\to\mathbb{R}$	`forward_difference::shessian()`
Hessian	$\mathbf{f}:\mathbb{R}^{n}\to\mathbb{R}^{m}$	`forward_difference::vhessian()`

Modules§

central_difference
Central difference approximations.
constants
Constants used for numerical differentiation.
forward_difference
Forward difference approximations.

Macros§

get_gradient
Get a function that returns the gradient of the provided multivariate, scalar-valued function.
get_sderivative
Get a function that returns the derivative of the provided univariate, scalar-valued function.
get_vderivative
Get a function that returns the derivative of the provided univariate, vector-valued function.

Structs§

Dual
First-order dual number.

Traits§

DualVector
Trait to create a vector of dual numbers.

Crate numdiffCopy item path