Skip to main content

BuiltInEmbedder

Struct BuiltInEmbedder 

Source
pub struct BuiltInEmbedder { /* private fields */ }
Expand description

A TF-IDF vectorizer that works entirely offline with no external dependencies. Call BuiltInEmbedder::fit with a corpus to build the vocabulary, then use Embedder::embed to compute vectors.

Implementations§

Source§

impl BuiltInEmbedder

Source

pub fn new() -> Self

Create an empty embedder. You must call Self::fit before embedding.

Source

pub fn fit(&mut self, documents: &[&str])

Build the vocabulary and IDF weights from a corpus of documents.

The vocabulary is capped at 1024 terms. Terms are selected by document frequency (the most widely occurring terms across documents come first).

Trait Implementations§

Source§

impl Debug for BuiltInEmbedder

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for BuiltInEmbedder

Source§

fn default() -> Self

Returns the “default value” for a type. Read more
Source§

impl Embedder for BuiltInEmbedder

Source§

fn embed(&self, text: &str) -> PunchResult<Vec<f32>>

Compute an embedding vector for a single piece of text.
Source§

fn embed_batch(&self, texts: &[&str]) -> PunchResult<Vec<Vec<f32>>>

Compute embedding vectors for a batch of texts.
Source§

fn dimensions(&self) -> usize

The dimensionality of vectors produced by this embedder.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> PolicyExt for T
where T: ?Sized,

Source§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow only if self and other return Action::Follow. Read more
Source§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow if either self or other returns Action::Follow. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more