GPU-First Embedded Analytics with SIMD Fallback
GPU-first embedded analytics database with graceful degradation: GPU → SIMD → Scalar
Features
- Cost-based dispatch: GPU only when compute > 5x transfer time
- Morsel-based paging: Out-of-core execution (128MB chunks)
- JIT WGSL compiler: Kernel fusion for single-pass execution
- GPU kernels: SUM, MIN, MAX, COUNT, AVG, fused filter+sum
- SIMD fallback: Trueno integration (AVX-512/AVX2/SSE2)
- SQL interface: SELECT, WHERE, aggregations, ORDER BY, LIMIT
Installation
[]
= "0.3"
# Optional: GPU acceleration
= { = "0.3", = ["gpu"] }
Quick Start
use ;
use StorageEngine;
async
Performance
SIMD Aggregation (1M rows, AMD Threadripper 7960X):
| Operation | SIMD | Scalar | Speedup |
|---|---|---|---|
| SUM | 228µs | 634µs | 2.78x |
| MIN | 228µs | 1,048µs | 4.60x |
| MAX | 228µs | 257µs | 1.13x |
| AVG | 228µs | 634µs | 2.78x |
Examples
Development
License
MIT