Skip to main content

SearchIndex

Struct SearchIndex 

Source
pub struct SearchIndex { /* private fields */ }
Expand description

Inverted word index for fast entity search.

For each entity, we tokenize its name, type, and observations, store each token → set of matching entity indices.

Uses a flat Vec<(StrId, u32)> sorted by (token, entity_idx) for cache-friendly lookups via binary search.

Implementations§

Source§

impl SearchIndex

Source

pub fn new() -> Self

Source

pub fn clear(&mut self)

Source

pub const fn len(&self) -> usize

Source

pub const fn is_empty(&self) -> bool

Source

pub fn index_entity( &mut self, interner: &mut StringInterner, entity_idx: u32, name: StrId, entity_type: StrId, observations: &[StrId], )

Index a single entity by its name, type, and observations. All strings must already be interned. entity_idx is the position in the entity storage vec.

Source

pub fn index_additional( &mut self, interner: &mut StringInterner, entity_idx: u32, texts: &[StrId], )

Incrementally index additional strings (e.g. newly added observations) for an entity that is already indexed, without removing and rebuilding its existing entries (P3). Token entries that already exist are deduped during the merge, so calling this with text that overlaps existing tokens is safe.

Source

pub fn remove_entity(&mut self, entity_idx: u32)

Remove all entries for a given entity (before re-indexing).

Source

pub fn search(&self, query: &str, interner: &StringInterner) -> Vec<u32>

Search for entities whose name/type/observation tokens match query case-insensitively by prefix ("cof" matches "coffee").

Note: this is an O(n) scan over every index entry. The binary-search step below only narrows exact-token hits, but the subsequent prefix scan already covers those (an exact match is also a prefix match), so the scan dominates — do not read the binary search as making this sublinear.

Source

pub fn search_ranked( &self, query: &str, interner: &StringInterner, ) -> Vec<(u32, u32)>

Like [search], but returns (entity_idx, score) pairs sorted by descending score (then ascending idx for stability). score is the number of indexed-token hits the entity accumulated for the query — a cheap relevance proxy so callers can surface the best matches first.

The scan is a single linear pass over the flat entries vec (no per-entity allocation until the final compaction), keeping it cache-friendly. A small Vec<(idx, score)> is gathered then sorted.

Trait Implementations§

Source§

impl Clone for SearchIndex

Source§

fn clone(&self) -> SearchIndex

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Default for SearchIndex

Source§

fn default() -> Self

Returns the “default value” for a type. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DT
where ST: ?Sized, DT: ?Sized,

Source§

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DT
where ST: ?Sized, DT: ?Sized,

Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> FromRef<T> for T
where T: Clone,

Source§

fn from_ref(input: &T) -> T

Converts to this type from a reference to the input type.
Source§

impl<A, B, T> HttpServerConnExec<A, B> for T
where B: Body,

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> Read<Exclusive, BecauseExclusive> for T
where T: ?Sized,

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more