entropy

entropy is a pure Rust statistical test suite for pseudorandom number generators.

It aims to provide a readable, hackable implementation of the major classic batteries:

NIST SP 800-22 Rev. 1a
DIEHARD
DIEHARDER

This is a serious audit tool, but it is not a magical oracle. Some tests are fully faithful to the published or reference implementations, some are close ports of the Dieharder source, and a small number are still approximate. The project is strongest when it is explicit about which is which.

Dependency Note

This repository depends on Darrell Long's cryptography repository via a local Cargo path dependency. It supplies:

Block ciphers used by the CTR-mode RNGs: Camellia-128, Twofish-128, Serpent-128, SM4, Grasshopper-256, CAST-128, SEED.
Stream ciphers: Rabbit, Salsa20, Snow3G, ZUC-128.
Elliptic-curve primitives: P-256 scalar multiplication used by Dual_EC_DRBG.
DRBG: AES-256-CTR DRBG (CtrDrbgAes256).

What This Repository Is For

Use this repository when you want to:

run a broad battery of statistical checks against RNGs implemented in Rust
compare obviously bad generators against stronger ones
inspect the test code directly instead of treating a binary as a black box
experiment with classic randomness batteries in one codebase

Do not use it as the sole basis for claiming a generator is cryptographically secure.

Current State

The crate builds and tests cleanly:

cargo build
cargo test

The test runner lives in src/main.rs and the library entrypoints are split across:

The current external audit is in PEERREVIEW.md.

Running

Full audit (canonical)

tests/run_all.sh

Runs the complete audit path — NIST/DIEHARD/DIEHARDER battery plus all five auxiliary probes — and saves a timestamped log to logs/run_all-<host>-<date>.log. Feed that log to scripts/parse_battery.py to regenerate TESTS.md.

Main battery only

tests/run_battery.sh
# or with options:
tests/run_battery.sh --suite nist
tests/run_battery.sh --suite diehard --quick
tests/run_battery.sh --test nist::spectral

Auxiliary probes only

tests/run_aux.sh

Runs the five standalone research probes (Knuth + ApEn, TestU01 Hamming, TestU01 Lempel-Ziv, Webster-Tavares, Gorilla) with their default parameters. Use the individual binaries for filtered or resized runs:

cargo run --release --bin bib_tests    -- --rng AES
cargo run --release --bin upstream_tests -- --rng AES
cargo run --release --bin testu01_lz   -- --rng AES --k 27
cargo run --release --bin webster_tavares -- --samples 2048
cargo run --release --bin gorilla      -- --rng AES

Throughput benchmarks

Throughput is measured with pilot-bench, a statistical benchmarking harness that reports MW/s (10⁶ u32 words/second) with confidence intervals. Build pilot_rng and run the benchmark script:

tests/run_benchmarks.sh                        # quick preset, skip already-measured
tests/run_benchmarks.sh --preset normal        # tighter CIs (takes longer)
tests/run_benchmarks.sh --force aes_ctr rabbit # re-measure specific generators

Results land in stats/<machine>/*.bench. After measuring, regenerate the radar charts with:

python3 scripts/make_radar.py

See BENCHMARKS.md for the full results table and radar charts.

What The Runner Exercises

The default runner compares 43 built-in generators across six categories:

OS entropy

OsRng (/dev/urandom)

Degenerate (must fail everything)

ConstantRng, CounterRng

Historical broken generators (negative controls)

Unix libc: System V rand(), mrand48(), BSD random(), Linux glibc rand()/random(), FreeBSD rand_r() compat
Windows: CRT rand(), VB6/VBA Rnd(), .NET Random compat
Classic LCGs: ANSI C, MINSTD, Borland C++

Quality simulation generators

MT19937, Xorshift32, Xorshift64
PCG32, PCG64, Xoshiro256, Xoroshiro128
WyRand, SFC64, JSF64

Cipher-based CSPRNGs (block-CTR mode, from the cryptography crate)

AES-128-CTR, Camellia-128-CTR, Twofish-128-CTR, Serpent-128-CTR
SM4-CTR, Grasshopper-CTR, CAST-128-CTR, SEED-CTR
Stream ciphers: Rabbit, Salsa20, Snow3G, ZUC-128

Cryptographic DRBGs

ChaCha20, SpongeBob (SHA3-512), Squidward (SHA-256)
HmacDrbg, HashDrbg, CtrDrbgAes256 (AES-256-CTR DRBG)
DualEcDrbg (P-256, known-backdoored — negative control)

That mix makes output useful both for regression testing and for verifying that the batteries correctly punish weak and broken constructions while passing strong ones.

Implementation Status

Status here means "how comfortable this repository should be claiming fidelity," not "whether the test compiles."

Area	Status
NIST SP 800-22: frequency, block_frequency, runs, longest_run, matrix_rank, spectral, serial, approximate_entropy, cumulative_sums, universal, linear_complexity	Faithful or close faithful implementations
Maurer (1992): parametric universal family `L=6..16`	Added alongside the NIST-locked single setting; emits every parameter set that fits the available sample
NIST SP 800-22: non_overlapping_template	Faithful for all 148 aperiodic 9-bit templates with the standard `N = 8` block setup
NIST SP 800-22: random_excursions, random_excursions_variant	Faithful family outputs; runner emits all per-state results
DIEHARD: runs_float, binary_rank, birthday_spacings, bitstream, monkey tests, count_ones_stream, craps	Faithful or close to the Dieharder reference implementation
Removed on purpose	See the explicit removed-test list below
DIEHARDER: fill_tree, gcd	Faithful; runner emits both underlying sub-results
DIEHARDER: bit_distribution	Faithful `rgb_bitdist` core statistic with explicit per-width, per-pattern Vtest outputs instead of Brown's random one-pattern collapse
Several geometric / higher-level Dieharder-style tests	Plausible and useful, but still best treated as implementation-reviewed rather than externally validated
Webster–Tavares (1985): strict avalanche / bit-independence probe over seeded RNG families	Implemented as a research binary (`webster_tavares`); computes the dependence matrix and avalanche-variable correlations from the paper
Knuth TAOCP Vol. 2 §3.3.2: permutation, gap, and Wald-Wolfowitz runs-above/below-median tests	Implemented as a research binary (`bib_tests`) over uniform `[0,1)` streams
NIST SP 800-22 §2.12 ApEn statistic swept over multiple embedding dimensions `m=2..6`	Implemented as part of `bib_tests`; reveals at which pattern lengths a sequence departs from randomness beyond the single fixed NIST setting
TestU01 (2009): `scomp_LempelZiv` core statistic and official empirical calibration table	Implemented as a research binary (`testu01_lz`); exact per-replication `LZ78` phrase count and TestU01 `μ/σ` normalization, but not yet the full TestU01 goodness-of-fit reporting stack
TestU01 (2009): `sstring_HammingCorr` and `sstring_HammingIndep` core statistics	Implemented as part of `upstream_tests`; faithful TestU01 bit extraction, asymptotic normal `HammingCorr`, and TestU01-style `gofs_MinExpected=10` lumping for the main `HammingIndep` chi-square
PractRand pre-0.95: `FPF(4,14,6)` core statistic	Implemented as part of `upstream_tests`; faithful stride-spaced windowing and exponent/significand bucket counts, but without PractRand's empirical calibration tables/suspicion scores

Important Caveats

Passing these tests does not prove unpredictability, backtracking resistance, or cryptographic suitability.
A single low p-value is not automatically evidence that a generator is broken.
Some tests naturally emit families of p-values; the runner now preserves many of those families instead of flattening them into one fake verdict.
A few historically famous tests are themselves weak. In particular, Dieharder explicitly calls out some classic tests as poor discriminators.

Removed On Purpose

These are not accidental omissions. They were removed because the Dieharder reference source or documentation says they are broken, deprecated, or effectively obsolete.

DIEHARD removed: operm5 Dieharder describes the original overlapping Diehard OPERM5 as the broken/defunct test that rgb_operm was meant to replace.
DIEHARD removed: overlapping_sums Dieharder says this test is completely useless, broken, and not worth fixing, and explicitly says not to use it.
DIEHARD removed: count_ones_specific_bytes Dieharder says this byte-lane variant is effectively obsolete compared to the stream variant and rgb_bitdist.
DIEHARDER removed: none currently Deprecated internals such as the Kuiper KS path are intentionally not exposed as active tests in this crate.

Project Layout

src/math.rs: special functions, KS helper, FFT support
src/result.rs: shared result type and display logic
src/rng: RNG implementations used by the harness
src/nist: NIST SP 800-22 tests
src/diehard: DIEHARD tests
src/dieharder: DIEHARDER tests

Attribution

Functions adapted from DIEHARD or DIEHARDER include # Author citations in their doc comments. The goal is not to erase provenance behind a Rust rewrite.

Reference Corpus

This repository keeps a local reference shelf under pubs/ so people can check the implementation work against the actual standards, manuals, source releases, and papers instead of trusting summaries.

Included now:

standards: NIST-SP-800-22r1a.pdf, NIST-SP-800-90Ar1.pdf, NIST-SP-800-90B.pdf, NIST-SP-800-90C.pdf, NIST-FIPS-140-3.pdf
classic source and docs: Diehard.zip, diehard-doc.txt, diehard-tests.txt, dieharder-3.31.1.tgz, dieharder-manual.pdf, dieharder-tests.txt
core survey and extension papers: lecuyer-simard-2007-testu01.pdf, maurer-1992-universal-test.pdf, marsaglia-tsang-2002-difficult-tests.pdf, webster-tavares-1985-sbox-design.pdf, hughes-2022-badrandom-the-effect-and-mitigations-for-low-entropy-random-numbers-in-tls.pdf

When the code claims fidelity to a published test, these are the documents the project is expected to match.

References

Primary references used by the code and audit:

NIST SP 800-22 Rev. 1a
George Marsaglia, DIEHARD: A Battery of Tests of Randomness (1995)
Robert G. Brown, Dieharder 3.31.x source
Marsaglia and Tsang, "Some Difficult-to-pass Tests of Randomness," Journal of Statistical Software 7(3), 2002

Additional suites and tests surveyed (candidates for future implementation):

L'Ecuyer and Simard, "TestU01: A C Library for Empirical Testing of Random Number Generators," ACM TOMS 33(4), 2007 — the current gold standard; BigCrush contains ~106 tests including BirthdaySpacings, Gap, CouponCollector, MaxOft, LempelZiv, HammingCorr, RandomWalk, and LinearComplexity profile tests, many of which catch defects invisible to all three batteries here.
Chris Doty-Humphrey (Crow), PractRand 0.95, 2018 — streaming suite; BCFN, DC6, FPF, and TMFn tests are designed specifically for small-state generators (xorshift*, PCG) that pass all classic batteries.
Knuth, The Art of Computer Programming Vol. 2 §3.3.2 — classical tests not in NIST/Diehard: Gap, Poker (hand-type), Permutation, Wald-Wolfowitz runs above/below median, and the Serial Correlation Coefficient with exact variance.
Maurer, "A Universal Statistical Test for Random Bit Generators," Journal of Cryptology 5(2), 1992 — the full parametric form (L=10–16) is substantially more sensitive than the fixed NIST implementation.
Hellekalek and Wegenkittl, "Empirical Evidence Concerning AES," ACM Trans. Modeling and Computer Simulation 13(4), 2003 — Walsh-Hadamard spectral test; sensitive to nonlinear Boolean structure in keystream generators.
Golić, "On the Linear Complexity and Multidimensional Distribution of Decimated m-Sequences," IEEE Trans. Inf. Theory 43(3), 1997 — decimated linear complexity; directly relevant to stream ciphers and LFSR-based generators.
Doganaksoy and Göloglu, "On the Weakness of Non-Dual Bent Functions," SAC 2005, LNCS 3897 — L1-norm DFT variant; catches diffuse periodic structure missed by NIST's peak-count statistic.
Webster and Tavares, "On the Design of S-Boxes," CRYPTO 1985 — Strict Avalanche Criterion and Bit Independence Criterion; applicable to seeded PRNGs to test differential output behavior.

The source PDFs, manuals, and source archives live under pubs/. Full BibTeX entries and implementation notes are in BIB.md.

rng-entropy 0.5.0