servo-fetch 0.2.0

A browser engine in a binary. Fetch, render, and extract web content powered by Servo.
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
//! Web content extraction library powered by Servo and Readability.
//!
//! This crate provides utilities for extracting readable content from HTML:
//!
//! - [`extract`] — Convert HTML into Markdown or structured JSON using
//!   Mozilla's Readability algorithm.
//! - [`layout`] — CSS layout heuristics to detect and strip navbars,
//!   sidebars, and footers before extraction.
//! - [`sanitize`] — Strip ANSI escape sequences and control characters
//!   from output strings.

#![forbid(unsafe_code)]
#![warn(missing_docs)]

pub mod extract;
pub mod layout;
pub mod sanitize;