wasmicro 0.3.0

Tiny transformer inference for the web. BERT, GPT-2 and T5 in a 199 KB WASM bundle.
Documentation
//! Forward-only tensor operations.
//!
//! Every op is a free function. Inputs are borrowed (`&Tensor`), outputs are
//! either returned by value or written into a caller-provided `&mut Tensor`.
//! This gives callers full control over allocations — critical for
//! low-latency inference.
//!
//! Optimized paths (SIMD, blocked matmul, int8) will live alongside the
//! reference implementations and be selected at compile time via features.

pub mod activations;
pub mod attention;
pub mod elementwise;
pub mod embedding;
pub mod layernorm;
pub mod linear;
pub mod matmul;
pub mod quantized;
pub mod rms_norm;
pub mod softmax;