Expand description
MetaEmbed-specific search helpers.
MetaEmbed (ICLR 2026) stores K pre-computed Meta Token vectors per document
instead of one vector per token. At query time the Matryoshka ordering of
the query Meta Tokens allows a budget parameter to trade recall for
latency: budget=k is full accuracy; budget=1 is fastest.
meta_embed_search wires together:
- Optional PLAID candidate pruning.
- Budgeted MaxSim scoring over the remaining candidates.
- Top-k selection.
Functionsยง
- meta_
embed_ search - Search a
MultiVectorStoreusing budgeted MaxSim with optional PLAID candidate pruning.