Intricate
A GPU accelerated library that creates/trains/runs neural networks in safe Rust code.
Table of contents
Architechture overview
Intricate has a layout very similar to popular libraries out there such as Keras.
It consists at the surface of a Model, which consists then of Layers which can be adjusted using a Loss Function that is also helped by a Optimizer.
Models
As said before, similar to Keras, Intricate defines Models as basically a list of Layers.
A model does not have much logic in it, mostly it delegates most of the work to the layers, all that it does is orchestrate how the layers should work together and how the data goes from a layer to another.
Layers
Every layer receives inputs and returns outputs following some rule that they must define.
They must also implement four methods that together constitute backpropagation:
optimize_parameterscompute_gradientsapply_gradientscompute_loss_to_input_derivatives
Mostly the optimize_parameters will rely on an Optimizer that will try to improve the parameters that the Layer allows it to optimize.
These methods together will be called sequentially to do backpropagation in the Model and
using the results from the compute_loss_to_input_derivatives we will then do the same for
the last layer and so on.
These layers can be really any type of transformation on the inputs and outputs. An example of this is the activation functions in Intricate which are actual layers instead of being one with other layers which does simplify calculations tremendously and works like a charm.
Optimizers
Optimizers the do just what you might think, they optimize.
Specifically they optimize both the parameters a Layer allows them to optimize, as well as the Layer's gradients so that the Layer can use them to apply the optimized gradients on itself.
This is useful because anyone using Intricate can develop and perhaps debug a Optimizer to see how well it does
for certain use cases which is very good for where I want Intricate to go. All you have to do is create some struct
that implements the Optimizer trait.
Loss Functions
Loss Functions are just basically some implementations of a certain trait that are used to determine how bad a Model is.
Loss Functions are NOT used in a layer, they are used for the Model itself. Even though a Layer will use derivatives with respect to the loss they don't really communicate with the Loss Function directly.
XoR using Intricate
If you look at the examples/ in the repository
you will find XoR implemented using Intricate.
The following is basically just that example with some separate explanation.
Setting up the training data
let training_inputs = vec!;
let expected_outputs = vec!;
Setting up the layers
use ;
let mut layers: = vec!;
Creating the model with the layers
use Model;
// Instantiate our model using the layers
let mut xor_model = new;
We make the model mut because we will call fit for training our model
which will tune each of the layers when necessary.
Setting up OpenCL's state
Since Intricate does use OpenCL under the hood for doing calculations,
we do need to initialize a OpenCLState which is just a struct
containing some necessary OpenCL stuff:
use
// you can change this device type to GPU if you want
let opencl_state = setup_opencl.unwrap;
For our Model to be able to actually do computations, we need to pass the OpenCL state
into the init method inside of the Model as follows:
xor_model.init.unwrap;
Fitting our model
For training our Model we just need to call the fit
method and pass in some parameters as follows:
use ;
let mut loss = new;
let mut optimizer = new;
// Fit the model however many times we want
xor_model
.fit
.unwrap;
As you can see it is extremely easy creating these models, and blazingly fast as well.
How to save and load models
For saving and loading models Intricate uses the savefile crate which makes it very simple and fast to save models.
Saving the model
As an example let's try saving and loading our XoR model.
For doing that we will first need to sync all of the relevant layer information
of the Model with OpenCL's host, (or just with the CPU), and then we will need
to call the save_file method as follows:
xor_model.sync_data_from_buffers_to_host.unwrap; // sends the weights and biases from
// OpenCL buffers to Rust Vec's
save_file.unwrap;
Loading the model
As for loading our XoR model, we just need to call the
counterpart of the save_file method: load_file.
let mut loaded_xor_model: Model = load_file.unwrap;
Now of curse, the savefile crate cannot load in the data to the GPU, so if you want
to use the Model after loading it, you must call the init method in the loaded_xor_model
(done in examples/xor.rs).
Things to be done still
- separate Intricate into more than one crate as to make development more lightweight with rust-analyzer
- implement convolutional layers and perhaps even solve some image classification problems in a example
- have some feature of Intricate, should be optional, that would contain preloaded datasets, such as MNIST and others
- add a way to send into the training process a callback closure that would be called everytime a epoch finished or even a step too with some cool info
- make an example after doing the thing above ^, that uses that same function to plot the loss realtime using a crate like
textplots - add embedding layers for text such as bag of words with an expected vocabulary size
- add optimizers to make Intricate actually be able to solve some problems