pub fn clean(raw_html: &str, max_chars: usize) -> CleanResultExpand description
Clean raw HTML into LLM-ready plain text.
Standalone and synchronous — no network or config required. Uses the two-stage pipeline: HTML noise removal → text sterilization.