Expand description
Statistical token importance filtering (LLMLingua-inspired, model-free)
This module implements a compression strategy similar to LLMLingua but using pure statistical heuristics instead of model-based perplexity scoring.
Enhanced with token-aware semantic preservation:
- Protects code blocks, JSON, paths, identifiers
- Contextual stopword filtering
- Preserves negations, comparators, domain terms
Structsยง
- Statistical
Filter - Statistical token filter (model-free alternative to LLMLingua)
- Statistical
Filter Config - Configuration for statistical filtering
- Word
Importance - Importance score for a word based on statistical features