anno 0.9.0

NER, coreference resolution, relation extraction, PII detection, and zero-shot entity types
1
2
3
4
5
6
7
8
9
//! Document ingestion and text preparation.
//!
//! Text preprocessing (normalization, sanitization) lives here.
//! Format conversion (HTML, PDF) lives in the `deformat` crate.
//! URL resolution lives in the CLI crate (`anno-cli`).

pub mod preprocessor;

pub use preprocessor::{DocumentPreprocessor, PreparedDocument};