Expand description
Deterministic hashed bag-of-words embedder.
Not semantic (no synonyms) — surface-token overlap only. It exists so the whole pipeline builds, runs, and tests with zero network/model, and as a fallback on machines without the fastembed feature. The hybrid keyword boost in the ranker compensates for its lack of semantics on exact terms.