PRISM-Q
██████╗ ██████╗ ██╗███████╗███╗ ███╗ ██████╗
██╔══██╗██╔══██╗██║██╔════╝████╗ ████║ ██╔═══██╗
██████╔╝██████╔╝██║███████╗██╔████╔██║█████╗██║ ██║
██╔═══╝ ██╔══██╗██║╚════██║██║╚██╔╝██║╚════╝██║▄▄ ██║
██║ ██║ ██║██║███████║██║ ╚═╝ ██║ ╚██████╔╝
╚═╝ ╚═╝ ╚═╝╚═╝╚══════╝╚═╝ ╚═╝ ╚══▀▀═╝
PRISM-Q is a Rust quantum circuit simulator built for speed. It dispatches across multiple specialized backends, runs a multi pass fusion pipeline, and uses AVX2, FMA, and BMI2 SIMD kernels in the inner loop. CPU kernels are the default path, with optional CUDA support for statevector and experimental stabilizer workloads. Input is OpenQASM 3.0 with backward compatible 2.0 syntax.
Installation
Add PRISM-Q to a Rust project:
Enable Rayon parallelism for larger circuits:
For CUDA support, install CUDA Toolkit 12.x or newer, then build with:
Building from source or pinning to a git revision is covered in
CONTRIBUTING.md.
Quick start
use run_qasm;
let qasm = r#"
OPENQASM 3.0;
include "stdgates.inc";
qubit[2] q;
bit[2] c;
h q[0];
cx q[0], q[1];
c[0] = measure q[0];
c[1] = measure q[1];
"#;
let result = run_qasm.unwrap;
println!;
// Bell state: ~50% |00⟩, ~50% |11⟩
Shot-based sampling
use ;
let circuit = parse.unwrap;
let result = simulate.seed.shots.unwrap;
println!;
// 00: 512
// 11: 512
let counts = simulate
.seed
.sample_counts
.unwrap;
for in counts.into_counts
Backend dispatch
use ;
let circuit = parse.unwrap;
// Auto picks the optimal backend based on circuit properties.
let auto = simulate.seed.run.unwrap;
// Or choose explicitly.
let stab = simulate
.backend
.seed
.run
.unwrap;
let mps = simulate
.backend
.seed
.run
.unwrap;
let sparse = simulate
.backend
.seed
.run
.unwrap;
Programmatic circuit construction
use CircuitBuilder;
let result = new
.h
.cx
.cx
.run
.unwrap;
CircuitBuilder chains gate, control, and execution methods. For lower-level access,
use Circuit directly:
use ;
let mut c = new;
c.add_gate;
c.add_gate;
c.add_gate;
let result = simulate.seed.run.unwrap;
Backends
| Backend | Best for | Scaling | Key property |
|---|---|---|---|
| Statevector | General circuits | O(2ⁿ) | Full SIMD, tiled L2/L3 kernels, optional CUDA path |
| Stabilizer | Clifford only | O(n²) | SIMD optimized, scales to thousands of qubits |
| Sparse | Few live amplitudes | O(k) | HashMap with parallel measurement |
| MPS | Low entanglement or 1D | O(nχ²) | Hybrid faer / Jacobi SVD |
| Product State | No entanglement | O(n) | Per qubit, instant |
| Tensor Network | Low treewidth | Depends on contraction order | Greedy min size heuristic |
| Factored | Partial entanglement | Dynamic | Tracks independent sub-states |
BackendKind::Auto selects at dispatch time. Non-entangling circuits go to Product
State, all-Clifford circuits go to Stabilizer, large circuits fall through to MPS with
bond dimension 256 once they exceed the statevector memory budget, and everything else
runs on Statevector. The memory budget is dynamic, derived from available RAM at
dispatch time, and can be overridden with PRISM_MAX_SV_QUBITS.
Gates and OpenQASM support
Covers the standard OpenQASM stdgates.inc set, common controlled and multi-controlled
variants, Qiskit exporter gates, IonQ and Google/Cirq native gate names, decomposed
multi-instruction gates, and IBM legacy u1/u2/u3 syntax. Modifiers inv @, ctrl @,
pow(k) @ chain arbitrarily for direct gates, and user-defined gate declarations are
supported.
The authoritative list of supported gate keywords, language features, and modifiers
lives in the parser at src/circuit/openqasm.rs. See
resolve_gate() and resolve_decomposed_gate(). Smoke tests in
tests/smoke_openqasm.rs exercise each feature end to end.
Build and test
For Rayon parallelism on larger circuits:
Thread count defaults to logical cores. Set RAYON_NUM_THREADS to override.
GPU backend (optional)
The gpu feature enables CUDA paths for the statevector and (experimentally) the
stabilizer backend. Requires CUDA Toolkit 12.x or newer and a CUDA capable device.
PTX is compiled at runtime via NVRTC against the device's compute capability.
Opt in through the simulation builder. The circuit still goes through fusion and independent subsystem decomposition, and a size aware crossover keeps small sub circuits on the CPU:
use ;
let ctx = new?;
let result = simulate.gpu.seed.run?;
BackendKind::StabilizerGpu runs Clifford circuits on the device, and
CompiledSampler::with_gpu(ctx) accelerates large shot counts for compiled BTS
sampling. Crossover thresholds are conservative by default and can be tuned through
PRISM_GPU_MIN_QUBITS, PRISM_STABILIZER_GPU_MIN_QUBITS, and
PRISM_GPU_BTS_MIN_SHOTS.
BackendKind::Auto does not yet route to GPU. See
docs/architecture.md for kernel design, crossover analysis,
and the full set of tuning knobs.
Coverage
Requires rustup component add llvm-tools-preview and cargo install cargo-llvm-cov.
CI generates coverage on every push and PR, and updates the badge automatically.
Benchmarks
Always use --features parallel; baselines were taken with Rayon enabled. Do not run
two cargo bench invocations concurrently on the same machine — Rayon thread pools
contend for cores and skew results.
Baseline capture, regression checks, and the markdown table workflow used in PRs live
in CONTRIBUTING.md.
Profiling
Needs cargo install flamegraph:
SVGs land in bench_results/ (gitignored).
Roadmap
- Density matrix backend: mixed state simulation for noise and decoherence modeling.
- GPU auto dispatch: thread a GPU context into
BackendKind::Autoso large circuits route to GPU without an explicitBackendKind::StatevectorGpu. Crossover and decomposition already work through the explicit variant. - Expanded classical control: mid circuit branching beyond the current
ifform, and parameterized circuit reuse for variational workloads. - Distributed statevector: multi node sharding for circuits beyond single host memory.
Architecture
See docs/architecture.md for the full picture: layered design,
backend trait contract, SIMD strategy, fusion pipeline, compiled samplers, and how to
add a new backend.
Contributing
See CONTRIBUTING.md for the build, test, and benchmark workflow.
The pull request template at
.github/PULL_REQUEST_TEMPLATE.md captures the
required checklist.