oxibonsai-kernels 0.1.4

1-bit Q1_0_g128 compute kernels (dequant, GEMV, GEMM) for OxiBonsai
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
//! Batched prefill (multi-token) GPU dispatch for OxiBonsai.
//!
//! Split into:
//! - `types`     — `PrefillBuffers`, `LayerWeightRefs`, `LayerConfig` (`pub(crate)`)
//! - `functions` — `MetalGraph` impl: encoder helpers + `encode_full_forward_prefill*`
//! - `functions_2` — public `try_metal_full_forward_prefill*` entry points

pub(crate) mod functions;
pub(crate) mod functions_2;
pub(crate) mod types;

pub(crate) use types::*;

pub use functions_2::*;