Skip to main content

Module network

Module network 

Source
Expand description

Neural network layers and network types (FP32, INT8, INT4).

Structs§

Conv2dLayer
2D convolution layer (FP32).
Conv2dLayerQ
INT8 quantized 2D convolution layer.
Conv2dLayerQ4
INT4 quantized 2D convolution layer.
FlattenLayer
Flatten layer that reshapes 4D tensors into 2D (FP32).
FlattenLayerQ
INT8 quantized flatten layer.
FlattenLayerQ4
INT4 quantized flatten layer.
LinearLayer
Fully-connected (linear) layer (FP32).
LinearLayerQ
INT8 quantized fully-connected (linear) layer.
LinearLayerQ4
INT4 quantized fully-connected (linear) layer.
MaxPool2dLayer
2D max pooling layer (FP32).
MaxPool2dLayerQ
INT8 quantized 2D max pooling layer.
MaxPool2dLayerQ4
INT4 quantized 2D max pooling layer.
NeuralNetwork
A FP32 neural network composed of sequential layers.
QuantizedNeuralNetwork
An INT8 quantized neural network.
QuantizedNeuralNetworkI4
An INT4 quantized neural network.
ReLuLayer
ReLU activation layer (FP32).
ReLuLayerQ
INT8 quantized ReLU activation layer.
ReLuLayerQ4
INT4 quantized ReLU activation layer.
SoftMaxLayer
Softmax activation layer (FP32).

Enums§

LayerType
Identifies the type of a neural network layer.

Traits§

Layer
Trait for FP32 neural network layers.
QuantizedLayer
Trait for INT8 quantized layers.
QuantizedLayerI4
Trait for INT4 quantized layers.