Crate juice[][src]

Expand description

Juice is a open, modular and clear-designed Machine Intelligence Framework providing state-of-the-art performance for distributed (Deep|Machine) Learning - sharing concepts from Tensorflow and Caffe.

An important module in Juice is the backend-agnostic, high-performance computation Framework Coaster, which combines performance and usability for Juice Networks. This allows you to run and deploy Juice Networks to servers, desktops or even mobiles using the full available computation power of GPUs or other CUDA/OpenCL supported devices for the learning of your Networks. And if your machine does not have a GPU or you do not want to install CUDA/OpenCL on your local machine, Juice will gracefully fall back to your native host CPU.

Architecture

Juice’s Networks are a compositional model, representing a collection of connected layers, making operations over numerical data.

Layers, the building block of a Juice Network, are small units, describing computation over numerical input data. Generally speaking Layers take input and produce an output, but essentially a Layer can describe any functionality e.g. logging as long as it obeys to the general behaviour specifications of a Layer. A Layer can be grouped in one of four Layer types which are closer defined at the Layers page. Every layer serves a special purpose and can occur zero, one or many times inside a Network.

Juice uses Coasters’ SharedTensor, an N-dimensional array for a unified memory interface over the actual data for automatic synchronization between different devices (CUDA, OpenCL, host CPU). A SharedTensor stores the actual data flowing through the system and the weights required for some Layers. The data in a SharedTensor can be copied from backend to backend and can be used for computations on CUDA, OpenCL and native host CPU. It provides performance optimizations and automatically takes care of memory management and synchronization.

A neural network can be created by combining container layers like the Sequential Layer. Those can be nested and allow for bigger neural networks to be constructed while still retaining the interface of a Layer.

The learning and optimization of the Network happens at the Solver and is decoupled from the network making the setup clean and flexible. One of the four layer types is a Loss Layer, which is used for the interaction of Network and Solver. The Network produces the loss and gradients, which the Solver uses to optimize the Network through parameter updates. Beside that, the Solver provides housekeeping and other evaluations of the Network. All operation on the Solver happen through Coaster, therefore can be executed on Cuda, OpenCL or native host CPU as well.

Juice provides a robust and modular design, which allows to express almost any numerical computation including SVMs, RNNs and other popular learning algorithms. We hope that Juice can help future research and production development alike as it combines expressiveness, performance and usability.

Philosophy

We are strong believers in the technology of Machine Learning.

We put our experience in software engineering into Juice, to solve our own need for a modern, performant and easy-to-use Deep Learning Framework. These principles direct our decisions on Juice and related projects.

  • Cutting Edge Performance:
    For research and industry speed and efficency are crucial for state-of-the-art machine learning over massive data and networks.
  • Open and Expressive Architecture:
    Designing an open architecture that follows best practices and concepts in Engineering such as modularity, flexibility and expressiveness is critical to stimulate future innovation.
  • Clear and Transparent Documentation:
    A well-written documentation that addresses both concepts and implementations, empowers developers and researchers to contribute their unique experience to the project for the benefit of everyone.

Development

The implementation of various Layers is pretty scarce at the moment.
There are around a dozen layers, which are really important and would increase the value and functionality of Juice tremendously.
Progress is tracked at

Modules

Provides the generics and interfaces for the specific Layers.

Provides the fundamental units of computation in a Neural Network.

Provides the generics and interfaces for the specific Solvers.

Provides the trainers for the Layers.

Provides common utility functions

Provides configuration of weights and their initialization.

Macros

macro helper to implement activation trait TODO see common

macro to implement ilayer common TODO use Some(1) as a default implementation for this trait

macro helper for default loss

Implement [ISolver][1] for [SGD solvers][2]. [1]: ./solver/trait.ISolver.html [2]: ./solvers/sgd/index.html