ReadabilityError

Enum ReadabilityError 

Source
pub enum ReadabilityError {
    HtmlParseError(String),
    ReadabilityCheckFailed,
    ExtractionError(String),
    JsEvaluation {
        context: String,
        source: Error,
    },
    InvalidOptions(String),
}
Expand description

Errors that can occur during content extraction.

Variants§

§

HtmlParseError(String)

HTML could not be parsed (malformed, empty, etc.)

This typically occurs when:

  • HTML is severely malformed or incomplete
  • Empty or whitespace-only input
  • Input contains non-HTML content

§Examples

let reader = Readability::new()?;
// This will likely fail with HtmlParseError
let result = reader.parse("<not valid html>");
§

ReadabilityCheckFailed

Content failed internal readability checks

This usually means:

  • Page has too little readable content (< 140 characters by default)
  • Content couldn’t be reliably distinguished from navigation/ads
  • Page is mostly navigation, ads, or other non-content elements
  • Content has too high link density (likely navigation)

§What to do

Try lowering the char_threshold in ReadabilityOptions or check if the HTML actually contains substantial article content:

let options = ReadabilityOptions::new().char_threshold(50);
let reader = Readability::new()?;
let article = reader.parse_with_options(&html, None, Some(options))?;
§

ExtractionError(String)

Content extraction failed for other reasons

This is a catch-all error for unexpected extraction failures that don’t fit into other categories. Often indicates issues with the JavaScript execution environment or unexpected content structures.

§Examples

let reader = Readability::new()?;
match reader.parse(&html) {
    Err(ReadabilityError::ExtractionError(msg)) => {
        eprintln!("Extraction failed: {}", msg);
        // Maybe try with different options or fallback processing
    }
    Ok(article) => println!("Success: {}", article.title),
    Err(e) => eprintln!("Other error: {}", e),
}
§

JsEvaluation

JavaScript engine evaluation error

Occurs when the embedded JavaScript engine fails to execute Readability.js code. This could indicate:

  • Memory constraints
  • JavaScript syntax errors in the bundled code
  • Runtime exceptions in the JavaScript environment

§Examples

let reader = Readability::new()?;
match reader.parse(&html) {
    Err(ReadabilityError::JsEvaluation { context, source }) => {
        eprintln!("JavaScript error in {}: {}", context, source);
        // This usually indicates a bug - please report it!
    }
    Ok(article) => println!("Success: {}", article.title),
    Err(e) => eprintln!("Other error: {}", e),
}

Fields

§context: String
§source: Error
§

InvalidOptions(String)

Invalid input parameters (usually base URL)

This error occurs when:

  • Base URL has invalid format or unsupported scheme
  • URL uses dangerous schemes like javascript: or data:
  • URL is not HTTP(S) when validation is enabled

§Examples

let reader = Readability::new()?;
// This will fail with InvalidOptions
let result = reader.parse_with_url(&html, "javascript:alert('xss')");
assert!(matches!(result, Err(ReadabilityError::InvalidOptions(_))));

Trait Implementations§

Source§

impl Debug for ReadabilityError

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Display for ReadabilityError

Source§

fn fmt(&self, __formatter: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Error for ReadabilityError

Source§

fn source(&self) -> Option<&(dyn Error + 'static)>

Returns the lower-level source of this error, if any. Read more
1.0.0 · Source§

fn description(&self) -> &str

👎Deprecated since 1.42.0: use the Display impl or to_string()
1.0.0 · Source§

fn cause(&self) -> Option<&dyn Error>

👎Deprecated since 1.33.0: replaced by Error::source, which can support downcasting
Source§

fn provide<'a>(&'a self, request: &mut Request<'a>)

🔬This is a nightly-only experimental API. (error_generic_member_access)
Provides type-based access to context intended for error reports. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

fn to_string(&self) -> String

Converts the given value to a String. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<T> ErasedDestructor for T
where T: 'static,

Source§

impl<T> ParallelSend for T