runnx-0.1.1 has been yanked.

RunNX

A minimal, mathematically verifiable ONNX runtime implementation in Rust.

RunNX

Overview

Fast, fearless, and formally verified ONNX in Rust.

This project provides a minimal, educational ONNX runtime implementation focused on:

Simplicity: Easy to understand and modify
Verifiability: Formal mathematical verification using Why3 and property-based testing
Performance: Efficient operations using ndarray
Safety: Memory-safe Rust implementation with mathematical guarantees

Features

✅ Dual Format Support: JSON and binary ONNX protobuf formats
✅ Auto-detection: Automatic format detection based on file extension
✅ Basic tensor operations (Add, Mul, MatMul, Conv, Relu, Sigmoid, Reshape, Transpose)
✅ Formal mathematical specifications with Why3
✅ Property-based testing for mathematical correctness
✅ Runtime invariant verification
✅ Model loading and validation
✅ Inference execution
✅ Error handling and logging
✅ Benchmarking support
✅ Async support (optional)
✅ Command-line runner
✅ Comprehensive examples

Quick Start

Prerequisites

RunNX requires the Protocol Buffers compiler (protoc) to build:

# Ubuntu/Debian
sudo apt-get install protobuf-compiler

# macOS  
brew install protobuf

# Windows
choco install protoc

Installation

Add this to your Cargo.toml:

[dependencies]
runnx = "0.1.1"

Basic Usage

use runnx::{Model, Tensor};

// Load a model (supports both JSON and ONNX binary formats)  
let model = Model::from_file("model.onnx")?;  // Auto-detects format
// Or explicitly:
// let model = Model::from_onnx_file("model.onnx")?;  // Binary ONNX
// let model = Model::from_json_file("model.json")?;  // JSON format

// Create input tensor
let input = Tensor::from_array(ndarray::array![[1.0, 2.0, 3.0]]);

// Run inference
let outputs = model.run(&[("input", input)])?;

// Get results
let result = outputs.get("output").unwrap();
println!("Result: {:?}", result.data());

Saving Models

use runnx::Model;

let model = /* ... create or load model ... */;

// Save in different formats
model.to_file("output.onnx")?;    // Auto-detects format from extension  
model.to_onnx_file("binary.onnx")?;  // Explicit binary ONNX format
model.to_json_file("readable.json")?;  // Explicit JSON format

Command Line Usage

# Run inference on a model (supports both .onnx and .json files)
cargo run --bin runnx-runner -- --model model.onnx --input input.json
cargo run --bin runnx-runner -- --model model.json --input input.json

# Run with async support
cargo run --features async --bin runnx-runner -- --model model.onnx --input input.json

Architecture

The runtime is organized into several key components:

Core Components

Model: ONNX model representation and loading
Graph: Computational graph with nodes and edges
Tensor: N-dimensional array wrapper with type safety
Operators: Implementation of ONNX operations
Runtime: Execution engine with optimizations

File Format Support

RunNX supports both JSON and binary ONNX protobuf formats:

📄 JSON Format

Human-readable: Easy to inspect and debug
Text-based: Can be viewed and edited in any text editor
Larger file size: More verbose due to text representation
Extension: .json

🔧 Binary ONNX Format

Standard format: Official ONNX protobuf serialization
Compact: Smaller file sizes due to binary encoding
Interoperable: Compatible with other ONNX runtime implementations
Extension: .onnx

🎯 Auto-Detection

The Model::from_file() method automatically detects the format based on file extension:

.onnx files → Binary ONNX protobuf format
.json files → JSON format
Other extensions → Attempts JSON parsing as fallback

For explicit control, use:

Model::from_onnx_file() for binary ONNX files
Model::from_json_file() for JSON files

Supported Operators

Operator	Status	Notes
`Add`	✅	Element-wise addition
`Mul`	✅	Element-wise multiplication
`MatMul`	✅	Matrix multiplication
`Conv`	✅	2D Convolution
`Relu`	✅	Rectified Linear Unit
`Sigmoid`	✅	Sigmoid activation
`Reshape`	✅	Tensor reshaping
`Transpose`	✅	Tensor transposition

Examples

Format Compatibility Demo

use runnx::*;

fn main() -> runnx::Result<()> {
    // Create a simple model
    let mut graph = graph::Graph::new("demo_graph".to_string());
    
    // Add input/output specifications
    let input_spec = graph::TensorSpec::new("input".to_string(), vec![Some(1), Some(4)]);
    let output_spec = graph::TensorSpec::new("output".to_string(), vec![Some(1), Some(4)]);
    graph.add_input(input_spec);
    graph.add_output(output_spec);
    
    // Add a ReLU node
    let relu_node = graph::Node::new(
        "relu_1".to_string(),
        "Relu".to_string(), 
        vec!["input".to_string()],
        vec!["output".to_string()],
    );
    graph.add_node(relu_node);
    
    let model = model::Model::with_metadata(
        model::ModelMetadata {
            name: "demo_model".to_string(),
            version: "1.0".to_string(),
            description: "A simple ReLU demo model".to_string(),
            producer: "RunNX Demo".to_string(),
            onnx_version: "1.9.0".to_string(),
            domain: "".to_string(),
        },
        graph,
    );

    // Save in both formats
    model.to_json_file("demo_model.json")?;
    model.to_onnx_file("demo_model.onnx")?;
    
    // Load from both formats
    let json_model = model::Model::from_json_file("demo_model.json")?;
    let onnx_model = model::Model::from_onnx_file("demo_model.onnx")?;
    
    // Auto-detection also works
    let auto_json = model::Model::from_file("demo_model.json")?;
    let auto_onnx = model::Model::from_file("demo_model.onnx")?;
    
    println!("✅ All formats loaded successfully!");
    println!("Original: {}", model.name());
    println!("JSON: {}", json_model.name());
    println!("ONNX: {}", onnx_model.name());
    
    Ok(())
}

Simple Linear Model

use runnx::{Model, Tensor};
use ndarray::Array2;

fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Initialize logging
    env_logger::init();

    // Create a simple linear transformation: y = x * w + b
    let weights = Array2::from_shape_vec((3, 2), vec![0.5, 0.3, 0.2, 0.4, 0.1, 0.6])?;
    let bias = Array2::from_shape_vec((1, 2), vec![0.1, 0.2])?;
    
    let input = Tensor::from_array(Array2::from_shape_vec((1, 3), vec![1.0, 2.0, 3.0])?);
    let w_tensor = Tensor::from_array(weights);
    let b_tensor = Tensor::from_array(bias);
    
    // Manual computation for verification
    let result1 = input.matmul(&w_tensor)?;
    let result2 = result1.add(&b_tensor)?;
    
    println!("Linear transformation result: {:?}", result2.data());
    Ok(())
}

Model Loading and Inference

# Format compatibility demonstration
cargo run --example onnx_demo

# Format conversion between JSON and ONNX binary
cargo run --example format_conversion

# Simple model operations
cargo run --example simple_model

# Formal verification examples
cargo run --example formal_verification

# Tensor operations
cargo run --example tensor_ops

use runnx::{Model, Tensor};
use std::collections::HashMap;

fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Load model from file
    let model = Model::from_file("path/to/model.onnx")?;
    
    // Print model information
    println!("Model: {}", model.name());
    println!("Inputs: {:?}", model.input_names());
    println!("Outputs: {:?}", model.output_names());
    
    // Prepare inputs
    let mut inputs = HashMap::new();
    inputs.insert("input", Tensor::zeros(&[1, 3, 224, 224]));
    
    // Run inference
    let outputs = model.run(&inputs)?;
    
    // Process outputs
    for (name, tensor) in outputs {
        println!("Output '{}': shape {:?}", name, tensor.shape());
    }
    
    Ok(())
}

Performance

The runtime includes benchmarking capabilities:

# Run benchmarks
cargo bench

# Generate HTML reports
cargo bench -- --output-format html

Example benchmark results:

Basic operations: ~10-50 µs
Small model inference: ~100-500 µs
Medium model inference: ~1-10 ms

Formal Verification

RunNX includes comprehensive formal verification capabilities to ensure mathematical correctness:

🔬 Mathematical Specifications

The runtime includes formal specifications for all tensor operations using Why3:

(** Addition operation specification *)
function add_spec (a b: tensor) : tensor
  requires { valid_tensor a /\ valid_tensor b }
  requires { a.shape = b.shape }
  ensures  { valid_tensor result }
  ensures  { result.shape = a.shape }
  ensures  { forall i. 0 <= i < length result.data ->
             result.data[i] = a.data[i] + b.data[i] }

🧪 Property-Based Testing

Automatic verification of mathematical properties:

use runnx::formal::contracts::{AdditionContracts, ActivationContracts};

// Test addition commutativity: a + b = b + a
let result1 = tensor_a.add_with_contracts(&tensor_b)?;
let result2 = tensor_b.add_with_contracts(&tensor_a)?;
assert_eq!(result1.data(), result2.data());

// Test ReLU idempotency: ReLU(ReLU(x)) = ReLU(x)  
let relu_once = tensor.relu_with_contracts()?;
let relu_twice = relu_once.relu_with_contracts()?;
assert_eq!(relu_once.data(), relu_twice.data());

🔍 Runtime Verification

Dynamic checking of invariants during execution:

use runnx::formal::runtime_verification::InvariantMonitor;

let monitor = InvariantMonitor::new();
let result = tensor.add(&other)?;

// Verify numerical stability and bounds
assert!(monitor.verify_operation(&[&tensor, &other], &[&result]));

🎯 Verified Properties

The formal verification system proves:

Addition: Commutativity, associativity, identity
Matrix Multiplication: Associativity, distributivity
ReLU: Idempotency, monotonicity, non-negativity
Sigmoid: Boundedness (0, 1), monotonicity, symmetry
Numerical Stability: Overflow/underflow prevention

📊 Running Formal Verification

# Install Why3 (optional, for complete formal proofs)
make -C formal install-why3

# Run all verification (tests + proofs)
make -C formal all

# Run only property-based tests (no Why3 required)
cargo test formal --lib

# Run verification example
cargo run --example formal_verification

# Generate verification report  
make -C formal report

Development

Quick Development with Justfile

RunNX includes a Justfile with convenient shortcuts for common development tasks:

# Install just command runner (one time setup)
cargo install just

# Show all available commands
just --list

# Quick development cycle
just dev          # Format, lint, and test
just test         # Run all tests
just build        # Build the project
just examples     # Run all examples

# Code quality
just format       # Format code
just lint         # Run clippy
just quality      # Run quality check script

# Documentation  
just docs-open    # Build and open docs

# Benchmarks
just bench        # Run benchmarks

# Formal verification
just formal-test  # Test formal verification setup

# CI simulation
just ci          # Simulate CI checks locally

Alternatively, if you don't have just installed, use the included shell script:

# Show all available commands
./dev.sh help

# Quick development cycle
./dev.sh dev      # Format, lint, and test
./dev.sh test     # Run all tests  
./dev.sh examples # Run all examples

Running Tests

# Run all tests
cargo test
# or with just
just test

# Run tests with logging
RUST_LOG=debug cargo test

# Run specific test
cargo test test_tensor_operations

Building Documentation

# Build and open documentation
cargo doc --open
# or with just  
just docs-open

# Build with private items
cargo doc --document-private-items

Contributing

We welcome contributions! Please follow our development quality standards:

Fork the repository
Create a feature branch
Make your changes following our Development QA Guidelines
Add tests and documentation
Run quality checks: ./scripts/quality-check.sh
Commit your changes (pre-commit hooks will run automatically)
Submit a pull request

Development Quality Assurance

RunNX uses automated quality assurance tools to maintain code quality:

Pre-commit hooks: Automatically run formatting, linting, and tests before each commit
Code formatting: Consistent style enforced by rustfmt
Linting: Comprehensive checks with clippy (warnings treated as errors)
Comprehensive testing: Unit tests, integration tests, property-based tests, and doc tests
Build verification: Ensures all code compiles successfully

For detailed information, see Development QA Guidelines.

To run quality checks manually:

# Run all quality checks with auto-fixes
./scripts/quality-check.sh

# Or run individual checks
cargo fmt           # Format code
cargo clippy        # Run linting
cargo test          # Run all tests

License

This project is licensed under

Apache License, Version 2.0, (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)

Acknowledgments

ONNX - Open Neural Network Exchange format
ndarray - Rust's ndarray library
Candle - Inspiration for some design patterns

Roadmap

✅ Completed

Dual Format Support: Both JSON and binary ONNX protobuf formats
Auto-detection: Automatic format detection based on file extension
Core Operators: Add, Mul, MatMul, Conv, ReLU, Sigmoid, Reshape, Transpose
Formal Verification: Mathematical specifications with Why3
CLI Tool: Command-line runner for model inference

🚧 Planned

Add more operators (Softmax, BatchNorm, etc.)
GPU acceleration support
Quantization support
Model optimization passes
WASM compilation target
Python bindings

runnx 0.1.1

RunNX

Overview

Features

Quick Start

Prerequisites

Installation

Basic Usage

Saving Models

Command Line Usage

Architecture

Core Components

File Format Support

📄 JSON Format

🔧 Binary ONNX Format

🎯 Auto-Detection

Supported Operators

Examples

Format Compatibility Demo

Simple Linear Model

Model Loading and Inference

Performance

Formal Verification

🔬 Mathematical Specifications

🧪 Property-Based Testing

🔍 Runtime Verification

🎯 Verified Properties

📊 Running Formal Verification

Development

Quick Development with Justfile

Running Tests

Building Documentation

Contributing

Development Quality Assurance

License

Acknowledgments

Roadmap

✅ Completed

🚧 Planned