boostr 0.1.0

ML framework built on numr - attention, quantization, model architectures
Documentation
1
2
3
4
5
6
7
pub mod decode_attention;
pub mod flash;
pub mod flash_helpers;
pub mod fused_qkv;
pub mod mla;
pub mod paged_attention;
pub mod varlen_attention;