Skip to main content

EnsembleNER

Struct EnsembleNER 

Source
pub struct EnsembleNER { /* private fields */ }
Expand description

Ensemble NER that runs ALL backends and resolves conflicts via weighted voting.

Unlike StackedNER (priority-based cascade), EnsembleNER:

  1. Runs ALL backends in parallel (conceptually)
  2. Groups overlapping spans into conflict clusters
  3. Resolves via weighted voting with type-conditioned weights
  4. Applies agreement bonus when multiple backends agree

§When to Use

  • EnsembleNER: Maximum accuracy, latency not critical
  • StackedNER: Production, predictable latency, early exit

§Example

use anno::{EnsembleNER, Model, RegexNER, HeuristicNER};

// Default: uses all available backends
let ensemble = EnsembleNER::new();

// Custom: specific backends
let custom = EnsembleNER::with_backends(vec![
    Box::new(RegexNER::new()),
    Box::new(HeuristicNER::new()),
]);

let entities = custom.extract_entities("Contact us at test@example.com", None)?;

§Algorithm

  1. Collect candidates: Run each backend, tag results with provenance
  2. Cluster overlaps: Group candidates with >50% span overlap
  3. Weighted vote: Score each candidate by backend_weight * confidence
  4. Agreement bonus: Add +0.10 when 2+ backends agree on type
  5. Select winner: Highest weighted score wins the cluster

Implementations§

Source§

impl EnsembleNER

Source

pub fn new() -> Self

Create ensemble with all available backends.

Source

pub fn with_backends(backends: Vec<Box<dyn Model + Send + Sync>>) -> Self

Create with custom backends.

Source

pub fn with_weights(self, weights: HashMap<String, BackendWeight>) -> Self

Set custom backend weights.

Source

pub fn with_agreement_bonus(self, bonus: f64) -> Self

Set the agreement bonus (added when multiple backends agree).

Source

pub fn with_min_confidence(self, min: f64) -> Self

Set minimum confidence threshold.

Trait Implementations§

Source§

impl BatchCapable for EnsembleNER

Source§

fn optimal_batch_size(&self) -> Option<usize>

Get the optimal batch size for this model, if applicable. Read more
Source§

fn extract_entities_batch( &self, texts: &[&str], language: Option<&str>, ) -> Result<Vec<Vec<Entity>>>

Extract entities from multiple texts in a batch. Read more
Source§

impl Default for EnsembleNER

Source§

fn default() -> Self

Returns the “default value” for a type. Read more
Source§

impl Model for EnsembleNER

Source§

fn extract_entities( &self, text: &str, language: Option<&str>, ) -> Result<Vec<Entity>>

Extract entities from text.
Source§

fn supported_types(&self) -> Vec<EntityType>

Get supported entity types.
Source§

fn is_available(&self) -> bool

Check if model is available and ready.
Source§

fn name(&self) -> &'static str

Get the model name/identifier.
Source§

fn description(&self) -> &'static str

Get a description of the model.
Source§

fn capabilities(&self) -> ModelCapabilities

Get capability summary for this model. Read more
Source§

fn version(&self) -> String

Get a version identifier for the model configuration/weights. Read more
Source§

impl StreamingCapable for EnsembleNER

Source§

fn recommended_chunk_size(&self) -> usize

Get the recommended chunk size for streaming extraction. Read more
Source§

fn extract_entities_streaming( &self, chunk: &str, offset: usize, ) -> Result<Vec<Entity>>

Extract entities from a chunk of text, adjusting offsets by the chunk’s position. Read more
Source§

impl NamedEntityCapable for EnsembleNER

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> PolicyExt for T
where T: ?Sized,

Source§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow only if self and other return Action::Follow. Read more
Source§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow if either self or other returns Action::Follow. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more