ai_tokenopt 0.5.8

Adaptive token optimization engine for LLM inference pipelines — compresses prompts, conversation history, tool schemas, and output streams to minimize token usage while preserving response quality.
Documentation
1
2
3
4
5
6
7
8
//! Prompt optimization — system prompt and RAG context

pub mod rag_cross_turn_dedup;
pub mod rag_dedup;
pub mod rag_ranker;
pub mod structured;
pub mod system_prompt;
pub mod template_loader;