hill_descent_lib

A Rust genetic algorithm library for n-dimensional optimization problems.

Quick Start

use hill_descent_lib::{GlobalConstants, SingleValuedFunction, setup_world, TrainingData};
use std::ops::RangeInclusive;

// Define your fitness function (lower scores are better)
#[derive(Debug)]
struct Quadratic;

impl SingleValuedFunction for Quadratic {
    fn single_run(&self, params: &[f64]) -> f64 {
        // Minimize: x² + y²
        // Optimal solution: (0, 0) with score 0
        params[0].powi(2) + params[1].powi(2)
    }
}

fn main() {
    // Define parameter bounds
    let bounds = vec![-10.0..=10.0, -10.0..=10.0];
    
    // Configure the algorithm (population size, number of regions)
    let constants = GlobalConstants::new(100, 10);
    
    // Create the optimization world
    let mut world = setup_world(&bounds, constants, Box::new(Quadratic));
    
    // Run optimization for 100 generations
    for _ in 0..100 {
        world.training_run(TrainingData::None { floor_value: 0.0 });
    }
    
    println!("Best score: {}", world.get_best_score());
}

Features

N-dimensional optimization - Handles any number of parameters
Spatial regions - Divides search space into adaptive regions for efficient exploration
Genetic algorithm - Uses crossover, mutation, and selection for evolution
Deterministic - Seeded RNG ensures reproducible results
Zero-copy parallelism - Leverages Rayon for efficient multi-core processing
Flexible fitness functions - Easy-to-implement trait for custom optimization problems
Optional tracing - Built-in logging support for debugging (feature: enable-tracing)

Installation

Add this to your Cargo.toml:

[dependencies]

hill_descent_lib = "0.1.0"

Usage Examples

Basic 2D Optimization

use hill_descent_lib::{GlobalConstants, SingleValuedFunction, setup_world, TrainingData};
use std::ops::RangeInclusive;

#[derive(Debug)]
struct Himmelblau;

impl SingleValuedFunction for Himmelblau {
    fn single_run(&self, params: &[f64]) -> f64 {
        let x = params[0];
        let y = params[1];
        // Himmelblau's function has four identical local minima
        (x.powi(2) + y - 11.0).powi(2) + (x + y.powi(2) - 7.0).powi(2)
    }
}

fn main() {
    let bounds = vec![-5.0..=5.0, -5.0..=5.0];
    let constants = GlobalConstants::new(500, 10);
    let mut world = setup_world(&bounds, constants, Box::new(Himmelblau));
    
    for generation in 0..1000 {
        world.training_run(TrainingData::None { floor_value: 0.0 });
        
        if generation % 100 == 0 {
            println!("Generation {}: Best score = {}", generation, world.get_best_score());
        }
    }
}

Higher Dimensional Problems

use hill_descent_lib::{GlobalConstants, SingleValuedFunction, setup_world, TrainingData};
use std::ops::RangeInclusive;

#[derive(Debug)]
struct NDimSphere;

impl SingleValuedFunction for NDimSphere {
    fn single_run(&self, params: &[f64]) -> f64 {
        // Sphere function: sum of squares
        params.iter().map(|&x| x * x).sum()
    }
}

fn main() {
    // Optimize in 20 dimensions
    let bounds = vec![-10.0..=10.0; 20];
    let constants = GlobalConstants::new(1000, 50);
    let mut world = setup_world(&bounds, constants, Box::new(NDimSphere));
    
    for _ in 0..500 {
        world.training_run(TrainingData::None { floor_value: 0.0 });
    }
    
    println!("Best score: {}", world.get_best_score());
}

Custom Function Floor

By default, the algorithm assumes the optimal fitness is 0. For functions with different optima:

use hill_descent_lib::{GlobalConstants, SingleValuedFunction, setup_world};
use std::ops::RangeInclusive;

#[derive(Debug)]
struct CustomFloor;

impl SingleValuedFunction for CustomFloor {
    fn single_run(&self, params: &[f64]) -> f64 {
        // Your function that has a minimum of -100
        -100.0 + params[0].powi(2)
    }
    
    fn function_floor(&self) -> f64 {
        -100.0  // Tell the algorithm the expected minimum
    }
}

Machine Learning / Neural Network Optimization

For large-scale parameter optimization (e.g., neural network weights), hill_descent_lib excels at finding good initializations or tuning thousands of parameters:

use hill_descent_lib::{GlobalConstants, SingleValuedFunction, setup_world, format_score, TrainingData};
use std::ops::RangeInclusive;

// Simulate a neural network with training data managed internally
#[derive(Debug)]
struct NeuralNetOptimizer {
    // Your training data stored here
    train_inputs: Vec<Vec<f64>>,
    train_labels: Vec<f64>,
    // Network structure: e.g., [input_size, hidden1, hidden2, output_size]
    layer_sizes: Vec<usize>,
}

impl NeuralNetOptimizer {
    fn new(train_inputs: Vec<Vec<f64>>, train_labels: Vec<f64>) -> Self {
        Self {
            train_inputs,
            train_labels,
            layer_sizes: vec![10, 50, 50, 1], // 10 inputs, 2x50 hidden, 1 output
        }
    }
    
    fn param_count(&self) -> usize {
        // Calculate total weights + biases
        // weights: (10*50) + (50*50) + (50*1) = 500 + 2500 + 50 = 3050
        // biases: 50 + 50 + 1 = 101
        // Total: 3151 parameters
        self.layer_sizes.windows(2)
            .map(|w| w[0] * w[1] + w[1]) // weights + biases per layer
            .sum()
    }
    
    fn forward(&self, inputs: &[f64], weights: &[f64]) -> f64 {
        // Simplified forward pass (implement your actual network logic)
        // This is just an example - use your real neural network implementation
        let mut activation = inputs.to_vec();
        let mut weight_idx = 0;
        
        for layer_idx in 0..self.layer_sizes.len() - 1 {
            let in_size = self.layer_sizes[layer_idx];
            let out_size = self.layer_sizes[layer_idx + 1];
            let mut next_activation = vec![0.0; out_size];
            
            // Apply weights
            for o in 0..out_size {
                let mut sum = 0.0;
                for i in 0..in_size {
                    sum += activation[i] * weights[weight_idx];
                    weight_idx += 1;
                }
                // Add bias
                sum += weights[weight_idx];
                weight_idx += 1;
                
                // ReLU activation (except last layer)
                next_activation[o] = if layer_idx < self.layer_sizes.len() - 2 {
                    sum.max(0.0)
                } else {
                    sum // Linear output for regression
                };
            }
            activation = next_activation;
        }
        
        activation[0] // Return single output
    }
    
    fn mse_loss(&self, weights: &[f64]) -> f64 {
        // Calculate mean squared error over all training examples
        let mut total_error = 0.0;
        for (inputs, &label) in self.train_inputs.iter().zip(&self.train_labels) {
            let prediction = self.forward(inputs, weights);
            let error = prediction - label;
            total_error += error * error;
        }
        total_error / self.train_inputs.len() as f64
    }
}

impl SingleValuedFunction for NeuralNetOptimizer {
    fn single_run(&self, params: &[f64]) -> f64 {
        // Params are the neural network weights
        self.mse_loss(params)
    }
    
    fn function_floor(&self) -> f64 {
        0.0 // Minimum possible MSE
    }
}

fn main() {
    // Generate synthetic training data (replace with your real data)
    let train_inputs: Vec<Vec<f64>> = (0..100)
        .map(|i| vec![i as f64 / 100.0; 10]) // 100 samples, 10 features each
        .collect();
    let train_labels: Vec<f64> = (0..100)
        .map(|i| (i as f64 / 100.0).sin()) // Target: sin(x)
        .collect();
    
    let optimizer = NeuralNetOptimizer::new(train_inputs, train_labels);
    let param_count = optimizer.param_count();
    
    println!("Optimizing neural network with {} parameters", param_count);
    
    // Define parameter bounds (typical weight initialization range)
    let bounds: Vec<RangeInclusive<f64>> = vec![-1.0..=1.0; param_count];
    
    // Configuration for large parameter spaces:
    // - Population size: 10-20x number of params for good coverage
    // - Regions: sqrt(population) is often a good heuristic
    let population_size = (param_count * 15).min(10000); // Scale up but cap at 10k
    let num_regions = (population_size as f64).sqrt() as usize;
    
    let constants = GlobalConstants::new(population_size, num_regions);
    let mut world = setup_world(&bounds, constants, Box::new(optimizer));
    
    println!("Population size: {}, Regions: {}", population_size, num_regions);
    println!("Initial score: {}", format_score(world.get_best_score()));
    
    // Run optimization
    let epochs = 200;
    for epoch in 1..=epochs {
        world.training_run(TrainingData::None { floor_value: 0.0 });
        
        if epoch % 20 == 0 {
            println!("Epoch {}: MSE = {}", epoch, format_score(world.get_best_score()));
        }
    }
    
    println!("\nFinal MSE: {}", format_score(world.get_best_score()));
    
    // Extract best weights
    let best = world.get_best_organism(TrainingData::None { floor_value: 0.0 });
    let best_weights = best.phenotype().expression_problem_values();
    
    println!("Optimized {} weights ready for use", best_weights.len());
}

Key considerations for large-scale optimization:

Parameter count: This library has been tested with 50,000+ parameters. For neural networks, this is suitable for smaller architectures or as an initialization strategy before gradient descent.
Population scaling: Use 10-20x the number of parameters for population size, but cap at a reasonable limit (e.g., 10,000) for memory and performance.
Region count: A good heuristic is sqrt(population_size) for balanced exploration/exploitation.
Hybrid approach: Use hill_descent_lib to find a good initialization, then switch to gradient-based methods (SGD, Adam, etc.) for fine-tuning. Genetic algorithms excel at escaping local minima.
Data management: Keep training data inside your fitness function struct to avoid passing large datasets through the API.

Scaling Guidelines

Understanding how hill_descent_lib scales helps you configure it effectively for your problem size:

Parameter Count vs Performance

Parameters	Recommended Pop Size	Recommended Regions	Memory (approx)	Time per Epoch
2-10	100-200	10-15	< 1 MB	< 1ms
10-100	500-1,000	20-30	< 10 MB	10-50ms
100-1,000	1,000-5,000	30-70	10-100 MB	50-500ms
1,000-10,000	5,000-10,000	70-100	100 MB - 1 GB	0.5-5s
10,000+	10,000 (capped)	100	1-10 GB	5-30s

Times measured on modern multi-core CPU (Ryzen/Intel i7+). Actual performance varies with fitness function complexity.

Configuration Guidelines

Population Size:

Small problems (< 100 params): 10-20x parameter count works well
Medium problems (100-1,000 params): Use 5-10x parameter count
Large problems (1,000+ params): Cap at 10,000 for practical memory/time constraints
Rule of thumb: More population = better exploration, but diminishing returns past 10,000

Region Count:

Formula: sqrt(population_size) provides good balance
Minimum: At least 10 regions for any problem
Maximum: Diminishing returns past 100 regions
Trade-off: More regions = finer spatial resolution but more overhead

Epochs (Generations):

Simple functions: 100-500 epochs usually sufficient
Complex landscapes: 1,000-5,000 epochs may be needed
Early stopping: Monitor get_best_score() - stop if no improvement for 100+ epochs
Convergence: Expect 70-90% of final quality within first 20% of epochs

Memory Usage

Memory consumption is primarily driven by:

Memory ≈ population_size × parameter_count × 24 bytes
       + region_count × 1KB overhead
       + function-specific data

Example calculations:

1,000 params, 5,000 pop: ~120 MB
10,000 params, 10,000 pop: ~2.4 GB
50,000 params, 10,000 pop: ~12 GB

Tips for large problems:

Use cargo build --release - debug builds use significantly more memory
Monitor with get_state() sparingly (serialization copies data)
Keep fitness function data on disk/database if > 1 GB

When to Use vs When Not to Use

✅ Good use cases:

No gradient information available (black-box optimization)
Multimodal landscapes with many local minima
Discrete or mixed parameter spaces (with custom encoding)
Initial parameter search before fine-tuning with gradient methods
Robust optimization where avoiding poor local minima is critical
Small to medium problems (< 10,000 parameters)

❌ Consider alternatives when:

Smooth, unimodal functions: Use gradient descent (much faster)
Very large parameter spaces (> 50,000 params): Consider specialized methods
Real-time constraints: Genetic algorithms need many evaluations
Analytical solutions exist: Don't optimize if you can solve directly
Limited compute budget: Each epoch evaluates entire population

Parallel Performance

hill_descent_lib uses Rayon for parallel region processing:

Scaling: Near-linear speedup up to min(num_regions, num_cores)
Bottlenecks: Fitness function evaluation time dominates
Sweet spot: 4-16 cores see excellent utilization
Diminishing returns: Beyond 32 cores, overhead increases

Optimization tips:

Ensure fitness function is computationally intensive (> 1μs)
Avoid I/O or locks in fitness function (breaks parallelism)
Use release builds - parallelism overhead matters more in debug

Using the Tracing Feature

For debugging and analysis, enable the optional tracing feature:

[dependencies]

hill_descent_lib = { version = "0.1.0", features = ["enable-tracing"] }

use hill_descent_lib::{init_tracing, GlobalConstants, SingleValuedFunction, setup_world};

fn main() {
    // Initialize tracing (only available with feature enabled)
    #[cfg(feature = "enable-tracing")]
    init_tracing();
    
    // Your optimization code...
}

How It Works

The library implements a genetic algorithm with spatial partitioning:

Initialization - Creates a population of random organisms within specified bounds
Regions - Divides the search space into adaptive regions based on organism distribution
Evaluation - Each organism is scored using your fitness function
Selection - Better-performing organisms in each region get more reproduction opportunities
Reproduction - Sexual reproduction with crossover and mutation creates offspring
Adaptation - Regions dynamically adjust based on population distribution and fitness landscape

The algorithm automatically manages:

Region boundaries and subdivision
Carrying capacity per region based on fitness
Organism aging and death
Mutation rates and adjustment parameters
Search space expansion when needed

API Overview

Core Types

setup_world() - Initialize the optimization environment
GlobalConstants - Configuration (population size, region count, seed)
World - Main optimization container with methods:
- training_run() - Run one generation
- get_best_score() - Get current best fitness
- get_best_organism() - Get the best organism
- get_state() - Get JSON representation of world state

Traits to Implement

SingleValuedFunction - Define your fitness function
- single_run(&self, params: &[f64]) -> f64 - Required
- function_floor(&self) -> f64 - Optional (default: 0.0)

Performance Characteristics

Parallelism: Region processing uses Rayon for multi-core efficiency
Memory: Allocates based on population size and dimensionality
Determinism: Seeded RNG ensures reproducible runs with same configuration
Scalability: Tested with 100+ dimensions and populations of 10,000+

Common Use Cases

Parameter optimization for machine learning models
Neural network weight initialization and tuning
Function minimization/maximization problems
Engineering design optimization
Scientific computing and simulation tuning
Hyperparameter search

Comparison to Other Libraries

Unlike gradient-based optimizers, hill_descent_lib:

✅ Doesn't require differentiable functions
✅ Handles discrete and continuous parameters
✅ Explores multiple regions simultaneously
✅ Less likely to get stuck in local minima
❌ Generally requires more function evaluations
❌ Not as fast for smooth, convex problems

Documentation

Full API documentation is available on docs.rs.

Examples

See the examples/ directory for more usage patterns:

simple_optimization.rs - Basic 2D example
custom_function.rs - Implementing custom fitness functions
multi_dimensional.rs - High-dimensional optimization

Run examples with:

cargo run --example simple_optimization

Minimum Supported Rust Version (MSRV)

Rust 2024 edition (Rust 1.82.0+)

Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

License

Licensed under either of:

Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Acknowledgments

Developed on Windows with cross-platform compatibility in mind.

Repository

Source code: https://github.com/cainem/hill_descent

hill_descent_lib 0.3.0