scematica-nn

A pure-Rust Double / Dueling Deep Q* reinforcement-learning agent with no external ML-framework dependency — the network, replay buffer, and training loop are all implemented from scratch in safe Rust.

Originally built to drive trade decisions in the Scematica Solana trading stack, it is general enough for any discrete-action RL task.

Features

Dueling DQN head: Q(s,a) = V(s) + A(s,a) − mean(A)
Double DQN target: online net selects actions, target net evaluates them
Prioritized experience replay with N-step returns
Epsilon-greedy exploration with decay; periodic hard target-net copy
Per-regime network branching and a parallel agent tournament
JSON checkpointing with dimension-mismatch–safe loading
Serde-serializable state/stats for external dashboards

Quick start

use scematica_nn::{DQNAgent, TradeState};

let mut agent = DQNAgent::default();
let state = TradeState::default();
let (action, q_values) = agent.advise(&state);
println!("chose {} — Q={:?}", action.label(), q_values);

Live viewer

Install the crate and run the bundled scema-ddqn viewer — a terminal dashboard that trains the agent on a synthetic task in real time so you can watch it learn: epsilon annealing, falling loss, separating Q-values, the chosen-action distribution, and a policy-accuracy curve climbing toward the optimal policy.

cargo install scematica-nn

scema-ddqn          # space = pause · s = step · +/- = speed · q = quit

The viewer ships behind a default cli feature. Depending on the crate as a library and want just the agent? Skip the TUI stack:

scematica-nn = { version = "1.12", default-features = false }

Core types

Type	Purpose
`DQNAgent`	The agent: `advise`, `observe`, `save`/`load`, `stats`
`TradeState`	24-feature state vector, normalized to `[0,1]`
`TradeAction`	`Hold` · `BuyStandard` · `BuyAggressive` · `SellPartial` · `SellAll`
`AgentTournament`	Runs conservative/balanced/aggressive variants in parallel

License