Crate minidx

Expand description

Minidx helps you implement small to medium-sized neural networks.

§Defining network architecture

In minidx, you define your network using tuples of layers, with the dimensionality of inputs/outputs defined as generic constants. For instance, the below example defines a network which takes 2 inputs and produces 3 outputs, by first going through two hidden layers with a hidden dimension of 3 and a relu activation, before a softmax layer.

use minidx::prelude::*;
use layers::*;

type network = (
  (Linear::<2, 3>, Relu), // Fully-connected + bias layer with relu activation
  (Linear::<3, 3>, Relu),
  Softmax,
);

// Instantiates our neural network.
let mut network = Buildable::<f32>::build(&network::default());

You can see the full set of implemented layers in the layer_spec module.

§Random initialization of a network

Before training, you likely want to initialize the parameters of the network to reasonable random values.

use rand::{SeedableRng, rngs::SmallRng};
let mut rng = SmallRng::seed_from_u64(42);
network.rand_params(&mut rng, 0.5).unwrap();

rand_params performs sensible initialization of each layer using the given RNG. The float argument represents the max magnitude of random parameters. 0.5 to 1.0 is a good starting parameter.

§Training

Training a network in minidx requires two things:

An updater: some object that stores training state and implements the optimizer algorithm you want to use
A training loop: a loop where you call train_step or train_batch with network inputs and their correct outputs, and a closure that wires up the loss function you want to use.

// initialize training state
let mut updater = network.new_momentum(
    TrainParams::with_lr(1.0e-5).and_l2(1.0e-6), 0.4);

// train the network with 50 examples
for _i in 0..50 {
    // fake training data
    let input = [1.0, 2.0];
    let output = [1.0, 0.0, 0.0];
    // train on an individual input/output pair, using the
    // mean-square error (MSE) loss function.
    use loss::DiffLoss;
    train_step(
        &mut updater,
        &mut network,
        |got, want| (got.mse(want), got.mse_input_grads(want)),
        input,
        output,
    );
}

Everything is fairly self-explanatory except for the closure you need to pass for your loss function. That function takes both the output of the network as well as the correct output of the network, and needs to return the loss with respect to the output as well as the gradient of the loss with respect to the loss function. The prelude::loss module contains implemented loss functions and corresponding methods to compute their gradients.

Its also worth noting that there are batch and threaded-batch variants of train_step, namely train_batch and train_batch_parallel. Both batch training methods return the average loss over the samples.

§Inference

You can run inference over a trained network using forward():

let output = network.forward(&[1.0, 2.0]).unwrap(); // outputs [f32; 3]

Networks can be loaded and stored using LoadableModule.

Re-exports§

pub use minidx_core as core;

Modules§

layer_spec: Descriptors of different neural layers which can be composed into a network.
prelude: Common types and traits needed when using minidx.
problem: Toy functions that neural networks can learn.
recorder

Structs§

OneHotEncoder: OneHotEncoder describes the encoding of some integer value modulus N into a vector where exactly one value is set.

Traits§

Buildable: A layer or composition of layers that can be constructed, using some Dtype as the element type.

Functions§

train_batch: Does a training minibatch, updating a network based on averaged gradients from computing N input-output pairs.
train_batch_parallel: Parallel version of train_batch. More threads is not necessarily faster.
train_step: Does a training step, updating a network using a pair of inputs and outputs.

Crate minidxCopy item path