eegpt-rs 0.0.1 - Docs.rs

# eegpt-rs

Pure-Rust inference for the **EEGPT** (Pretrained Transformer for Universal and Reliable Representation of EEG Signals) foundation model, built on [Burn 0.20](https://burn.dev).

EEGPT uses dual self-supervised learning (spatio-temporal representation alignment + masked reconstruction) pretrained on large-scale EEG data. It employs channel embeddings for spatial flexibility and summary tokens for compact global representation.

## Architecture

```
EEG [B, C, T]
    │
    ├─ Conv2d Patch Embedding (patch_size=64, stride=32)
    │  → [B, n_patches, C, embed_dim]
    │
    ├─ Channel Embedding (learned per-channel vectors)
    │  → [B, n_patches, C, embed_dim]
    │
    ├─ Per-patch Transformer (concat summary tokens, self-attention)
    │  [B*n_patches, C+embed_num, embed_dim] → blocks → extract summary
    │  → [B, n_patches, embed_num, embed_dim]
    │
    └─ LinearConstraintProbe (flatten → Linear → flatten → Linear)
       → [B, n_outputs]
```

## Benchmarks

Benchmarked on Apple M3 Max (arm64, macOS 26.3.1) with `embed_dim=512`, `depth=2`, `heads=8`.
Python baseline: PyTorch 2.8.0. Rust backends: NdArray, Accelerate BLAS, Metal GPU.

### Inference Latency

![Inference Latency](figures/inference_latency.png)

### Speedup vs Python

![Speedup](figures/speedup.png)

The **Metal GPU** backend achieves up to **3.3x speedup** over PyTorch, with consistent 2–3x improvement across all configurations.

### Channel Scaling

![Channel Scaling](figures/channel_scaling.png)

### Time Scaling

![Time Scaling](figures/time_scaling.png)

### Summary Table

| Configuration | Python (PyTorch) | Rust (Accelerate) | Rust (Metal GPU) |
|---------------|:----------------:|:-----------------:|:----------------:|
| 4ch × 1000t   | 5.46 ms          | 12.91 ms          | **2.61 ms** (2.1x) |
| 8ch × 1000t   | 6.93 ms          | 15.15 ms          | **2.59 ms** (2.7x) |
| 16ch × 1000t  | 10.13 ms         | 20.84 ms          | **4.38 ms** (2.3x) |
| 22ch × 1000t  | 13.33 ms         | 24.43 ms          | **4.05 ms** (3.3x) |
| 32ch × 1000t  | 16.27 ms         | 32.62 ms          | **6.18 ms** (2.6x) |
| 64ch × 1000t  | 28.68 ms         | —                 | **9.43 ms** (3.0x) |
| 22ch × 2000t  | 22.18 ms         | 44.03 ms          | **8.77 ms** (2.5x) |
| 22ch × 4000t  | 41.82 ms         | 85.27 ms          | **17.01 ms** (2.5x) |

> **Note:** EEGPT processes each temporal patch through the full transformer independently (B×n_patches forward passes), making it more compute-intensive per sample than models like REVE. The Metal GPU backend excels by parallelizing these independent patch computations.

### Numerical Parity

Python ↔ Rust output difference: **< 7.3×10⁻⁷** (f32 precision limit).

## Build

```bash
cargo build --release                              # CPU (NdArray)
cargo build --release --features blas-accelerate    # macOS Accelerate
cargo build --release --no-default-features --features metal  # Metal GPU
```

## Pretrained Weights

Available on [HuggingFace](https://huggingface.co/braindecode/eegpt-pretrained).

## Citation

```bibtex
@inproceedings{wang2024eegpt,
    title     = {{EEGPT}: Pretrained Transformer for Universal and Reliable Representation of {EEG} Signals},
    author    = {Wang, Guangyi and Liu, Wei and He, Yifan and Xu, Cheng and Ma, Liangyu and Li, He},
    booktitle = {Advances in Neural Information Processing Systems},
    volume    = {37},
    pages     = {39249--39280},
    year      = {2024}
}

@software{hauptmann2025eegptrustinference,
    title     = {eegpt-rs: {EEGPT} {EEG} Foundation Model Inference in Rust},
    author    = {Hauptmann, Eugene},
    year      = {2025},
    url       = {https://github.com/eugenehp/eegpt-rs},
    version   = {0.0.1}
}
```

## Author

[Eugene Hauptmann](https://github.com/eugenehp)

## License

Apache-2.0