Expand description
Interpreter — executes IR graphs directly on CPU.
Uses optimized kernels from the kernels module for compute-heavy
operations (matmul, rms_norm). Validates correctness and serves as
the primary inference path until the AOT codegen is ready.
Functions§
- forward
- Run a single forward pass for one token through the model.