rsfgsea 0.3.2

High-performance fgsea-compatible preranked Gene Set Enrichment Analysis in Rust
Documentation

rsfgsea

High-performance Rust implementation of preranked Gene Set Enrichment Analysis (GSEA), designed as a drop-in, optimized alternative to the R fgsea package.

What it focuses on:

  • fgsea-compatible simple and multilevel workflows
  • library and CLI use from one Rust crate
  • deterministic parity-focused CPU path
  • optional hybrid GPU acceleration for large simple-stage screening

Headline results from the main project benchmarks:

  • multilevel, small workload, 1 worker: 2 ms vs R 43 ms (21.5x faster)
  • multilevel, large workload, 16 workers: 104 ms vs R 939 ms (9.0x faster)
  • simple, small workload, 1 worker: 814 ms vs R 2544 ms (3.1x faster)
  • simple, large workload, 16 workers: 687 ms vs R 774 ms (1.13x faster)

Current CPU multilevel parity vs R is near floating-point noise in this repo's validation setup:

  • max |ES| diff: 4.988e-09
  • max |NES| diff: 4.983e-09
  • max |pval| diff: 4.975e-09
  • max |padj| diff: 4.965e-09

Install

Library:

[dependencies]
rsfgsea = "0.3.2"

CLI:

cargo install rsfgsea

Minimal Rust Example

use rsfgsea::prelude::*;

let ranks = RankedList::new(
    vec!["g1".into(), "g2".into(), "g3".into(), "g4".into()],
    vec![2.0, 1.0, -1.0, -2.0],
);

let pathways = vec![
    Pathway {
        name: "PW_A".into(),
        description: None,
        genes: vec!["g1".into(), "g2".into()],
    },
];

let results = fgsea(
    &ranks,
    &pathways,
    None,
    1000,
    42,
    1,
    ranks.len() - 1,
    1e-50,
    ScoreType::Std,
    1.0,
);

println!("{}", results[0].pathway_name);

CLI

rsfgsea \
  --ranks data/example.rnk \
  --gmt data/pathways.gmt \
  --output results.tsv

For deeper usage, benchmarks, parity notes, and reproducibility details, see the main repository: https://github.com/deminden/rsfgsea