Hextral

A high-performance neural network library for Rust with comprehensive async/await support, advanced activation functions, multiple optimizers, and flexible architecture design.

Features

Core Architecture

Multi-layer perceptrons with configurable hidden layers
Batch normalization for improved training stability and convergence
Xavier weight initialization for stable gradient flow
Flexible network topology - specify any number of hidden layers and neurons
Full async/await support with intelligent yielding for non-blocking operations

Activation Functions (9 Available)

ReLU - Rectified Linear Unit (good for most cases)
Sigmoid - Smooth activation for binary classification
Tanh - Hyperbolic tangent for centered outputs
Leaky ReLU - Prevents dying ReLU problem
ELU - Exponential Linear Unit for smoother gradients
Linear - For regression output layers
Swish - Modern activation with smooth derivatives
GELU - Gaussian Error Linear Unit used in transformers
Mish - Self-regularizing activation function

Loss Functions (5 Available)

Mean Squared Error (MSE) - Standard regression loss
Mean Absolute Error (MAE) - Robust to outliers
Binary Cross-Entropy - Binary classification
Categorical Cross-Entropy - Multi-class classification
Huber Loss - Robust hybrid of MSE and MAE

Optimization Algorithms

Adam - Adaptive moment estimation (recommended for most cases)
SGD - Stochastic Gradient Descent (simple and reliable)
SGD with Momentum - Accelerated gradient descent

Regularization Techniques

L2 Regularization - Prevents overfitting by penalizing large weights
L1 Regularization - Encourages sparse networks and feature selection
Dropout - Randomly deactivates neurons during training

Training & Evaluation

Flexible loss computation with configurable loss functions
Batch normalization with training/inference modes
Training progress tracking with loss history
Batch and single-sample prediction
Model evaluation metrics and loss computation
Dual sync/async API for both blocking and non-blocking operations

Async/Concurrent Processing

Async training methods with cooperative multitasking
Parallel batch prediction using futures
Intelligent yielding - only yields for large workloads (>1000 elements)
Concurrent activation function processing
Performance-optimized async implementation alongside synchronous methods

Quick Start

Add this to your Cargo.toml:

[dependencies]

hextral = "0.6.0"

nalgebra = "0.33"

tokio = { version = "1.0", features = ["full"] }  # For async features

Basic Usage

use hextral::{Hextral, ActivationFunction, Optimizer};
use nalgebra::DVector;

fn main() {
    // Create a neural network: 2 inputs -> [4, 3] hidden -> 1 output
    let mut nn = Hextral::new(
        2,                                    // Input features
        &[4, 3],                             // Hidden layer sizes  
        1,                                    // Output size
        ActivationFunction::ReLU,             // Activation function
        Optimizer::Adam { learning_rate: 0.01 }, // Optimizer
    );

    // Training data for XOR problem
    let inputs = vec![
        DVector::from_vec(vec![0.0, 0.0]),
        DVector::from_vec(vec![0.0, 1.0]),
        DVector::from_vec(vec![1.0, 0.0]),
        DVector::from_vec(vec![1.0, 1.0]),
    ];
    let targets = vec![
        DVector::from_vec(vec![0.0]),
        DVector::from_vec(vec![1.0]),
        DVector::from_vec(vec![1.0]),
        DVector::from_vec(vec![0.0]),
    ];

    // Train the network
    println!("Training network...");
    let loss_history = nn.train(&inputs, &targets, 1.0, 100);

    // Make predictions
    for (input, expected) in inputs.iter().zip(targets.iter()) {
        let prediction = nn.predict(input);
        println!("Input: {:?} | Expected: {:.1} | Predicted: {:.3}", 
                 input.data.as_vec(), expected[0], prediction[0]);
    }

    // Evaluate performance
    let final_loss = nn.evaluate(&inputs, &targets);
    println!("Final loss: {:.6}", final_loss);
}

Async Usage

For high-performance applications requiring concurrent processing:

use hextral::{Hextral, ActivationFunction, Optimizer};
use nalgebra::DVector;

#[tokio::main]
async fn main() {
    // Create the same neural network
    let mut nn = Hextral::new(
        2, &[4, 3], 1,
        ActivationFunction::ReLU,
        Optimizer::Adam { learning_rate: 0.01 },
    );

    let inputs = vec![/* ... training data ... */];
    let targets = vec![/* ... target data ... */];

    // Async training with batch processing
    let loss_history = nn.train_async(&inputs, &targets, 1.0, 100, Some(32)).await;

    // Async batch predictions (parallel processing)
    let predictions = nn.predict_batch_async(&inputs).await;
    
    // Async evaluation
    let final_loss = nn.evaluate_async(&inputs, &targets).await;
    
    println!("Async training completed with final loss: {:.6}", final_loss);
}

Key Benefits of Async API:

Non-blocking operations - Other tasks can run during training/inference
Intelligent yielding - Only yields for large workloads (>1000 elements or >10 batches)
Better resource utilization - Cooperative multitasking optimized for performance
Scalable architecture - Ideal for web services and concurrent applications
Parallel batch processing - Multiple predictions processed concurrently using futures

Loss Functions

Configure different loss functions for your specific task:

use hextral::{Hextral, LossFunction, ActivationFunction, Optimizer};

let mut nn = Hextral::new(2, &[4], 1, ActivationFunction::ReLU, Optimizer::default());

// For regression tasks
nn.set_loss_function(LossFunction::MeanSquaredError);
nn.set_loss_function(LossFunction::MeanAbsoluteError);
nn.set_loss_function(LossFunction::Huber { delta: 1.0 });

// For classification tasks
nn.set_loss_function(LossFunction::BinaryCrossEntropy);
nn.set_loss_function(LossFunction::CategoricalCrossEntropy);

Batch Normalization

Enable batch normalization for improved training stability:

use hextral::{Hextral, ActivationFunction, Optimizer};

let mut nn = Hextral::new(10, &[64, 32], 1, ActivationFunction::ReLU, Optimizer::default());

// Enable batch normalization
nn.enable_batch_norm();

// Set training mode
nn.set_training_mode(true);

// Train your network...
let loss_history = nn.train(&inputs, &targets, 0.01, 100);

// Switch to inference mode
nn.set_training_mode(false);

// Make predictions...
let prediction = nn.predict(&input);

Modern Activation Functions

Use state-of-the-art activation functions:

use hextral::{Hextral, ActivationFunction, Optimizer};

// Swish activation (used in EfficientNet)
let mut nn = Hextral::new(2, &[4], 1, 
    ActivationFunction::Swish { beta: 1.0 }, Optimizer::default());

// GELU activation (used in BERT, GPT)
let mut nn = Hextral::new(2, &[4], 1, 
    ActivationFunction::GELU, Optimizer::default());

// Mish activation (self-regularizing)
let mut nn = Hextral::new(2, &[4], 1, 
    ActivationFunction::Mish, Optimizer::default());

Regularization

Prevent overfitting with built-in regularization techniques:

use hextral::{Hextral, Regularization, ActivationFunction, Optimizer};

let mut nn = Hextral::new(3, &[16, 8], 1, ActivationFunction::ReLU, 
                          Optimizer::Adam { learning_rate: 0.01 });

// L2 regularization (Ridge)
nn.set_regularization(Regularization::L2(0.01));

// L1 regularization (Lasso) 
nn.set_regularization(Regularization::L1(0.005));

// Dropout regularization
nn.set_regularization(Regularization::Dropout(0.3));

Different Optimizers

Choose the optimizer that works best for your problem:

// Adam: Good default choice, adaptive learning rates
let optimizer = Optimizer::Adam { learning_rate: 0.001 };

// SGD: Simple and interpretable
let optimizer = Optimizer::SGD { learning_rate: 0.1 };

// SGD with Momentum: Accelerated convergence
let optimizer = Optimizer::SGDMomentum { 
    learning_rate: 0.1, 
    momentum: 0.9 
};

Network Introspection

Get insights into your network:

// Network architecture
println!("Architecture: {:?}", nn.architecture()); // [2, 4, 3, 1]

// Parameter count  
println!("Total parameters: {}", nn.parameter_count()); // 25

// Save/load weights
let weights = nn.get_weights();
nn.set_weights(weights);

API Reference

Core Types

Hextral - Main neural network struct
ActivationFunction - Enum for activation functions
Optimizer - Enum for optimization algorithms
Regularization - Enum for regularization techniques

Key Methods

new() - Create a new neural network
train() - Train the network for multiple epochs
predict() - Make a single prediction
evaluate() - Compute loss on a dataset
set_regularization() - Configure regularization

Performance Tips

Use ReLU activation for hidden layers in most cases
Start with Adam optimizer - it adapts learning rates automatically
Apply L2 regularization if you see overfitting (test loss > train loss)
Use dropout for large networks to prevent co-adaptation
Normalize your input data to [0,1] or [-1,1] range for better training stability

Architecture Decisions

Built on nalgebra for efficient linear algebra operations
Xavier initialization for stable gradient flow from the start
Proper error handling throughout the API
Modular design allowing easy extension of activation functions and optimizers
Zero-copy predictions where possible for performance

Contributing

We welcome contributions! Please feel free to:

Report bugs by opening an issue
Suggest new features or improvements
Submit pull requests with enhancements
Improve documentation
Add more test cases

License

This project is licensed under the MIT License - see the LICENSE file for details.

Changelog

v0.6.0 (Latest)

Full Async/Await Support: Complete async API alongside synchronous methods
Intelligent Yielding: Performance-optimized async with yielding only for large workloads (>1000 elements)
Concurrent Processing: Parallel batch predictions using futures and join_all
Async Training: Non-blocking training with cooperative multitasking
Code Optimization: Removed verbose AI-generated patterns, cleaner professional code
Performance Improvements: Smart async yielding prevents unnecessary overhead
Enhanced Documentation: Updated examples and API documentation

v0.5.1

Improved Documentation: Enhanced README with comprehensive examples of all new features
Better Crates.io Presentation: Updated documentation to properly showcase library capabilities

v0.5.0

Major Feature Expansion: Added comprehensive loss functions, batch normalization, and modern activation functions
5 Loss Functions: MSE, MAE, Binary Cross-Entropy, Categorical Cross-Entropy, Huber Loss
Batch Normalization: Full implementation with training/inference modes
3 New Activation Functions: Swish, GELU, Mish (total of 9 activation functions)
Code Organization: Separated tests into dedicated files for cleaner library structure
Enhanced API: Flexible loss function configuration and batch normalization controls

v0.4.0

Complete rewrite with proper error handling and fixed implementations
Implemented all documented features - train(), predict(), evaluate() methods
Fixed critical bugs in batch normalization and backward pass
Added regularization support - L1, L2, and Dropout
Improved documentation with usage examples and API reference

hextral 0.6.0