Crate mdarray

source ·
Expand description

Multidimensional array for Rust

Overview

The mdarray crate provides a multidimensional array for Rust. Its main target is for numeric types, however generic types are supported as well. The purpose is to provide a generic container type that is simple and flexible to use, with interworking to other crates for e.g. BLAS/LAPACK functionality.

Here are the main features of mdarray:

  • Dense array type, where the rank is known at compile time.
  • Subarrays (views) can be created with arbitrary shapes and strides.
  • Standard Rust mechanisms are used for e.g. slices, indexing and iteration.
  • Generic expressions for multidimensional iteration.

The design is inspired from the Rust ndarray, nalgebra and bitvec crates, the proposed C++ mdarray and mdspan types, and multidimensional arrays in Julia and Matlab.

Array types

The base type for multidimensional arrays is Array<B>, where the generic parameter is a buffer for the array storage. The following variants exist:

  • Array<GridBuffer> is a dense array that owns the storage, similar to the Rust Vec type.
  • Array<ViewBuffer> and Array<ViewBufferMut> are arrays that refer to a parent array. They are used for example when creating a view of a larger array without duplicating elements.
  • Array<SpanBuffer> is used as a generic array reference, similar to the Rust slice type. It consists of a pointer to an internal structure that holds the storage and the layout mapping. Arrays and array views can be dereferenced to an array span.

The layout mapping describes how elements are stored in memory. The mapping is parameterized by the rank (i.e. the number of dimensions) and the array layout. It contains the shape (i.e. the size in each dimension), and the strides per dimension if needed.

The array layout is Dense if elements are stored contiguously without gaps. In this case, the strides are calculated from the shape and not stored as part of the layout. The layout is General if each dimension can have arbitrary stride except for the innermost one, which has unit stride. It is compatible with the BLAS/LAPACK general matrix storage.

The layout is Flat if the innermost dimension can have arbitrary stride and the other dimensions must follow in order, allowing for linear indexing. The layout is Strided if all dimensions can have arbitrary strides.

The array elements are stored in column-major order, also known as Fortran order where the first dimension is the innermost one.

The following type aliases are provided:

  • Grid<T, const N: usize, A = Global> for a dense array
  • Span<T, const N: usize, F = Dense> for an array span
  • View<T, const N: usize, F = Dense> for an array view
  • ViewMut<T, const N: usize, F = Dense> for a mutable array view

Prefer using Span instead of array views for function parameters, since they can refer to either an owned array or an array view. Array views are useful for example when lifetimes need to be maintained in function return types.

Indexing and views

Scalar indexing is done using the normal square-bracket index operator and an array of usize per dimension as index.

If the array layout supports linear indexing (i.e. the layout is Dense or Flat), a scalar usize can also be used as index. If the layout is Dense, a range can be used to select a slice.

If linear or slice indexing is possible but the array layout is not known, remap, remap_mut and into_mapping can be used to change layout. Alternatively, flatten, flatten_mut and into_flattened can be used to change to a one-dimensional array.

An array view can be created with the view and view_mut methods, which take indices per dimension as arguments. Each index can be either a range or usize. The resulting array layout depends on both the layout inferred from the indices and the input layout.

For two-dimensional arrays, a view of one column or row can be created with the col, col_mut, row and row_mut methods, and a view of the diagonal with diag and diag_mut.

Iteration

An iterator can be created from an array with the iter, iter_mut and into_iter methods like for Vec and slice.

Expressions are similar to iterators, but have consistency checking of shapes and support multidimensional iteration more efficiently. An expression is created with the expr, expr_mut and into_expr methods.

An expression consists of a base type Expression<P>, where the generic parameter is a tree of producer nodes. The base type has several methods for evaluating expressions or converting into other expressions, such as eval, for_each and map.

Two expressions can be merged to an expression of tuples with the zip method or free function. When merging expressions, if the rank differs the expression with the lower rank is broadcast into the larger shape by adding a number of outer dimensions. It is not possible to broadcast mutable arrays or when moving elements out of an array.

For multidimensional arrays, iteration over a single dimension can be done with outer_expr, outer_expr_mut, axis_expr and axis_expr_mut. The resulting expressions give array views of the remaining dimensions.

It is also possible to iterate over all except one dimension with cols, cols_mut, lanes, lanes_mut, rows and rows_mut.

Operators

Arithmetic, logical, negation, comparison and compound assignment operators are supported for arrays and expressions.

If at least one of the inputs is an array that is passed by value, the input buffer is reused for the result. Otherwise, if all input parameters are array references or expressions, a new array is created for the result. In the latter case, the result may have a different element type.

For comparison operators, the parameters must always be arrays that are passed by reference. For compound assignment operators, the first parameter is always a mutable reference to an array where the result is stored.

Scalar parameters must passed using the fill function that wraps a value in an Expression<Fill<T>> expression. If a type does not implement the Copy trait, the parameter must be passed by reference.

Note that for complex calculations, it can be more efficient to use expressions and element-wise operations to reduce memory accesses and allocations.

Example

This example implements matrix multiplication and addition C = A * B + C. The matrices use column-major ordering, and the inner loop runs over one column in A and C. By using iterator-like expressions the array bounds checking is avoided, and the compiler is able to vectorize the inner loop.

use mdarray::{grid, view, Grid, Span, View};

fn matmul(a: &Span<f64, 2>, b: &Span<f64, 2>, c: &mut Span<f64, 2>) {
    for (mut cj, bj) in c.cols_mut().zip(b.cols()) {
        for (ak, bkj) in a.cols().zip(bj) {
            for (cij, aik) in cj.expr_mut().zip(ak) {
                *cij = aik.mul_add(*bkj, *cij);
            }
        }
    }
}

let a = view![[1.0, 2.0, 3.0], [4.0, 5.0, 6.0]];
let b = view![[0.0, 1.0], [1.0, 1.0]];

let mut c = grid![[0.0; 3]; 2];

matmul(&a, &b, &mut c);

assert_eq!(c, view![[4.0, 5.0, 6.0], [5.0, 7.0, 9.0]]);

Modules

  • Buffer module for array storage.
  • Expression module, for multidimensional iteration.
  • Module for array span and view indexing, and for array axis subarray types.
  • Array layout mapping module.

Macros

  • Creates a dense multidimensional array containing the arguments.
  • Creates a multidimensional array view containing the arguments.

Structs

  • Multidimensional array type with static rank.
  • Type-level constant.
  • Dense array layout type.
  • Expression type, for multidimensional iteration.
  • Flat array layout type.
  • General array layout type.
  • Multidimensional array iterator type.
  • Range constructed from a unit spaced range with the given step size.
  • Strided array layout type.

Traits

  • Trait for applying a closure and returning a new or an existing array.
  • Array dimension trait, for rank and types for shape and strides.
  • Trait for generalization of Clone that can reuse an existing object.
  • Conversion trait into an expression.
  • Array memory layout trait.
  • Array shape trait.
  • Array strides trait.
  • Trait for layout types with uniform stride.
  • Trait for layout types with unit inner stride.

Functions

  • Creates a range with the given step size from a unit spaced range.

Type Aliases

  • Dense multidimensional array.
  • Multidimensional array span.
  • Multidimensional array view.
  • Mutable multidimensional array view.