1 2 3 4 5 6
//! LLM-specific code like tokenization and kv-cache logic which (currently) has to live outside //! the model graph. pub mod models; pub mod run; pub mod serve; pub mod utils;