ai_tokenopt 0.5.8

Adaptive token optimization engine for LLM inference pipelines — compresses prompts, conversation history, tool schemas, and output streams to minimize token usage while preserving response quality.

Documentation

//! Prompt optimization — system prompt and RAG context

pub mod rag_cross_turn_dedup;
pub mod rag_dedup;
pub mod rag_ranker;
pub mod structured;
pub mod system_prompt;
pub mod template_loader;