Crate readabilityrs

Crate readabilityrs 

Source
Expand description

§ReadabilityRS

A Rust port of Mozilla’s Readability library for extracting article content from web pages.

This library is a faithful port of the Mozilla Readability JavaScript library, used in Firefox Reader View.

§Example

use readabilityrs::{Readability, ReadabilityOptions};

let html = r#"<html><body><article><h1>Title</h1><p>Content...</p></article></body></html>"#;
let url = "https://example.com/article";

let options = ReadabilityOptions::default();
let readability = Readability::new(html, Some(url), Some(options)).unwrap();

if let Some(article) = readability.parse() {
    println!("Title: {:?}", article.title);
    println!("Content: {:?}", article.content);
}

Structs§

Article
Represents a successfully parsed article
Readability
The main Readability parser
ReadabilityOptions
Configuration options for the Readability parser

Enums§

ReadabilityError
Errors that can occur during readability parsing

Functions§

is_probably_readerable
Quick check to determine if a document is likely to be readerable Returns true if Readability.parse() is likely to succeed

Type Aliases§

Result
Result type alias for readability operations