v0.7.0 H7 — dimension-aware outcome of a recall-time cosine comparison
between a live query embedding and a stored embedding whose producing
model may have changed since the row was written.
Retrieval role of a text handed to the embedder. Drives the
asymmetric task-instruction prefix for backends that require one
(Ollama nomic-embed-text-v1.5); symmetric backends (the in-process
candle MiniLM-L6-v2) ignore it. See #1520.
v0.7.0 F6 — soft cap on the input size handed to the embedder.
64 KiB matches the F10 store-path threshold so a single content
blob that the embedder can’t realistically process is reported as
Skipped("content > 65536 bytes") rather than blowing up the
chat/embed RPC. Operators who want larger embeddings can grow this
constant alongside the F10 HTTP threshold.
Number of f32 elements encoded in bytes, regardless of header presence.
Used by the dim_violations stats path to compute per-row dim without
allocating a Vec<f32>.
#1595 — single source of the EMBED_MAX_BYTES oversize check +
its human-readable skip reason. Some(reason) when byte_len
exceeds the cap, None otherwise. Shared by
Embedder::embed_with_status (store path) and the backfill /
reembed sweeps so the client-side guard and its WARN text can never
drift between the write-time and batch paths.