moeflux 0.1.0-pre.3

Pure-Rust streaming-experts MoE inference on Metal. Forked from flash-moe; only the Metal kernels remain from upstream.
1
2
3
4
5
6
7
8
9
pub mod embedding;
pub mod expert_io_mode;
pub mod expert_io;
pub mod gpu_lm_head;
pub mod layer_weight_cache;
pub mod lm_head;
pub mod mtl_weight_buf;
pub mod prefetch;
pub mod weight_file;