Module dfdx::tensor_ops

Expand description

Operations on tensors like relu(), matmul(), softmax(), and more.

Generic function and struct methods

All functionality is provided in two ways.

The generic standalone function that takes a generic parameter. e.g. relu().
The struct method for tensor structs. e.g. crate::tensor::Tensor::relu().

The functions are all just pass throughs to the tensor methods.

Fallibility

All tensor methods also have a try_* variant, like crate::tensor::Tensor::relu() and crate::tensor::Tensor::try_relu().

These methods return a Result, where the error in most cases indicates an allocation error.

Axes/Dimensions for broadcasting/reductions/selecting

For the following sections, some traits/functions utilizing const isize to determine the axis to apply the transformation to.

Here are the valid axes for each tensor:

0d tensor: Axis<0>
1d tensor: Axis<0>
2d tensor: Axis<0>, Axis<1>
3d tensor: Axis<0>, Axis<1>, Axis<2>,
4d tensor: Axis<0>, Axis<1>, Axis<2>, Axis<3>
etc.

To specify multiple axes you can use Axes2, Axes3, and Axes4

Reductions

There are a number of methods that reduce 1 or more axes.Anything that can be reduced can also be broadcasted back to the original shape using BroadcastTo.

Each axis reducing function has two generic parameters:

The target shape
The axes to reduce along You only need to specify one of these! Generally it is better practice to specify the target shape, unless it is ambiguous in which case you should specify the axes.

For example:

let t: Tensor<Rank3<2, 4, 6>, f32, _> = dev.zeros();
// shape version
let _ = t.clone().sum::<Rank1<4>, _>();
// axes version
let _ = t.clone().sum::<_, Axes2<0, 2>>();
// typed version
let _: Tensor<Rank1<4>, _, _> = t.clone().sum();

Complete list of reductions:

Broadcasts

Broadcasting tensors is provided through the BroadcastTo trait. Similar to reductions there are two generic parameters to broadcast:

(Required) The target shape
(usually optional) The axes of the result type to broadcast You’ll only need to specify axes if the shape makes the broadcasts ambiguous.

For example:

let t: Tensor<Rank1<4>, f32, _> = dev.zeros();
// shape version
let _ = t.clone().broadcast::<Rank3<2, 4, 6>, _>();
// typed version
let _: Tensor<Rank3<2, 4, 6>, _, _> = t.clone().broadcast();

Rust can also infer the output type if you use it in another operation:

let big: Tensor<Rank2<2, 5>, f32, _> = dev.zeros();
let small: Tensor<Rank1<5>, f32, _> = dev.zeros();
let _ = big + small.broadcast();

Permutes

Permuting has an identical interface to broadcasts/reductions:

let t: Tensor<Rank3<2, 3, 4>, f32, _> = dev.zeros();
// shape version
let _ = t.clone().permute::<Rank3<3, 4, 2>, _>();
// axes version
let _ = t.clone().permute::<_, Axes3<1, 2, 0>>();

Indexing using select and gather

Two traits provide indexing capability SelectTo and GatherTo. The difference is:

SelectTo::select allows you to select a single value
GatherTo::gather allows you select multiple values from the same axis.

For example you can select from the 0th axis like so:

let t = dev.tensor([[1.0, 2.0, 3.0], [4.0, 5.0, 6.0]]);
let r: Tensor<Rank1<3>, f32, _> = t.select(dev.tensor(1));
assert_eq!(r.array(), [4.0, 5.0, 6.0]);

Or you can gather from the 0th axis to select multiple entries:

let t = dev.tensor([[1.0, 2.0, 3.0], [4.0, 5.0, 6.0]]);
let r: Tensor<Rank2<3, 3>, f32, _> = t.gather(dev.tensor([1, 1, 0]));
assert_eq!(r.array(), [
    [4.0, 5.0, 6.0],
    [4.0, 5.0, 6.0],
    [1.0, 2.0, 3.0],
]);

To select from anything after the 0th axis, you need a multi-dimensional axis. See GatherTo and SelectTo docstrings for examples of this.

But you can use BroadcastTo to make this easy! In this example we select the same index from the 1st axis of a tensor:

let t = dev.tensor([[1.0, 2.0, 3.0], [4.0, 5.0, 6.0]]);
let r = t.select::<Rank1<2>, _>(dev.tensor(1).broadcast());
assert_eq!(r.array(), [2.0, 5.0]);

Structs

AdamConfig
Configuration of hyperparameters for crate::optim::Adam.
Bilinear
Upscales images using bilinear interpolation between a pixels neighbors
NearestNeighbor
Upscales images using a pixel’s nearest neighbor.
RMSpropConfig
Configuration of hyperparameters for crate::optim::RMSprop.
SgdConfig
Configuration of hyperparameters for crate::optim::Sgd.

Enums

Momentum
Momentum used for crate::optim::Sgd and others
Pool2DKind
WeightDecay
L2 and decoupled regularization methods

Traits

Backward
Runs backprop algorithm with all operations contained in the tape that t has.
BroadcastTo
Broadcast self into a new shape.
ChooseFrom
Choose values from two tensors using a boolean mask. Equivalent to torch.where from pytorch.
Device
A Storage that requires all the tensor ops implementations
GatherTo
Select multiple values from a single axis, replacing that dimension with a different one. Equivalent to torch.gather from pytorch.
GenericUpscale2D
LogSumExpTo
Reduction along multiple axes using LogSumExp.
MaxTo
Reduction along multiple axes using max.
MeanTo
Reduction along multiple axes using mean.
MinTo
Reduction along multiple axes using min.
PermuteTo
Changes order of dimensions/axes in a tensor.
RealizeTo
Realizes the concrete shape of the tensor as another compatable shape, or returns the original tensor if the new shape’s dimensions are incompatable.
ReshapeTo
Changes the shape of a tensor without re-ordering axes. If the tensor is contiguous already, then no data movement will occur. If the tensor is not contiguous, the result of this will be contiguous.
Roll
Shifts data along an axis by a specified amount.
SelectTo
Select a single value from a single dimension, removing that dimension from the shape. Equivalent to torch.select from pytorch.
StddevTo
Reduction along multiple axes using standard deviation.
SumTo
Reduction along multiple axes using sum.
TryAdd
Fallible version of std::ops::Add. See add
TryAttentionReshape
AttentionReshape qkv + past_key + past_value into (q, k, v) used in attention layer
TryConcatDeprecated
Concatenate two tensors along the first dimension.
TryConcatAlong
Concatenate two tensors along a given axis.
TryConv2D
Apply the 2d convolution to a tensor.
TryConvTrans2D
TryDiv
Fallible version of std::ops::Div. See div
TryEq
TryGe
TryGt
TryLe
TryLt
TryMatMul
Fallible matrix multiplication. See matmul for examples.
TryMul
Fallible version of std::ops::Mul. See mul.
TryNe
TryPReLU
Parametric Rectified Linear Unit (PReLU). max(0, lhs) + rhs*min(0, lhs)
TryPool2D
TryStack
Stack an array or vec of tensors together along a new dimension.
TrySub
Fallible version of std::ops::Sub. See sub
TryUpscale2D
Upscales an image to a new shape. Valid methods of upscaling are:
UpscaleMethod
Upscaling method to be used with TryUpscale2D, can be either NearestNeighbor or Bilinear.
VarTo
Reduction alogn multiple axes using variance

Functions

abs
Absolute value (abs). |t|
accurate_gelu
Accurate Gaussian Linear Unit (GeLU). This is defined as x * Phi(x) where Phi(x) is the cumulative distribution function of a standard normal distribution. This can be calculated via the Error Function erf(x) using
add
Element wise and scalar addition.
axpy
Elementwise a * alpha + b * beta.
bce_with_logits
Binary Cross Entropy With Logits in numerically stable way.
bool_and
Element wise and scalar boolean ‘and’.
bool_not
Inverts each value in a boolean tensor.
bool_or
Element wise and scalar boolean ‘or’.
bool_xor
Element wise and scalar boolean ‘xor’.
clamp
Clamp all elements between the provided min and max values.
cos
Cosine function.
div
Element wise and scalar division.
dropout
Zeros elements with probability p and scales all elements by 1 / (1 - p).
eq
Element-wise equality comparison. ==
exp
Exponential function (exp). e^t
fast_gelu
Fast Gaussian Linear Unit (GeLU). A fast version of the gaussiane linear unit calculated by
ge
Element-wise greater than or equals comparison. >=
geluDeprecated
Use fast_gelu instead
gt
Element-wise strictly greater than comparison. >
huber_error
Huber Loss uses absolute error when the error is higher than beta, and squared error when the error is lower than beta.
le
Element-wise less than or equals comparison. <=
leakyrelu
Computes prelu, but with a scalar value. max(0, t) + a*min(0, t)
ln
Natural Logarithm (ln). log_e(t).
log_softmax
log(softmax(t)) in numerically stable way across Ax. Does t - logsumexp(t) under the hood.
lower_tri
Applies a 2D lower triangular mask by setting values above the diagonal to E::default().
lt
Element-wise strictly less than comparison. <
matmul
Matrix * Matrix, Vector * Matrix, Vector * Vector, and broadcasted/batched versions.
maximum
Element wise maximum.
minimum
Element wise minimum.
mul
Element wise and scalar multiplication.
nans_to
Replaces any std::f32::NAN with value.
ne
Element-wise inequality comparison. !=
negate
Negates all elements.
normalize
Normalizes t to have mean 0.0 and stddev 1.0 along Ax. epsilon is used during stddev. Computes (t - t.mean(Ax)) / t.std(Ax, epsilon).
powf
Raises to a float power; t^i.
powi
Raises to an integer power; t^i.
prelu
Parametric Rectified Linear Unit (PReLU). max(0, lhs) + rhs*min(0, lhs)
recip
1 / x
relu
Rectified Linear Unit (ReLU). max(0, t)
sigmoid
Sigmoid. 1 / (1 + exp(-t)).
sin
Sine function.
slice
Slices all dimensions of a tensor, with the starting and ending indices of each dimension determined by a tuple of ranges.
softmax
Computes the softmax function across Ax.
sqrt
√t or t^0.5
square
t^2
sub
Element wise and scalar subtraction.
tanh
Hyperbolic Tangent (Tanh).
to_dtype
Copies the elements of a tensor, converting its data to a different dtype.
upper_tri
Applies a 2D upper triangular mask by setting values below the diagonal to E::default().