Skip to main content

Module ner

Module ner 

Source
Expand description

Named Entity Recognition (NER) engine — CE-4 GLiNER zero-shot NER.

Two-layer extraction pipeline:

  1. Rule-based pre-pass — regex extraction of dates, URLs, UUIDs, emails, IPs. Always on, zero latency, no model download required.
  2. GLiNER ONNX engine — zero-shot NER via GLiNER-medium ONNX INT8 (52 MB). Opt-in per namespace, lazy-loaded on first use.

Extracted entities are stored as tags: entity:person:Alice, entity:org:Anthropic.

Structs§

ExtractedEntity
A single extracted entity.
GlinerEngine
GLiNER zero-shot NER engine backed by ONNX Runtime.
NerEngine
Unified NER engine combining rule-based and GLiNER extraction.

Functions§

rule_based_extract
Run the rule-based pre-pass — O(n) regex scan, zero model overhead.