Parsing, filtering, selecting and serializing HTML/XML markup.
See the project ../README for a feature overview.
Mutating visitor support for
Support for html5 parsing to
Support for XML parsing to
Compose a new filter closure, by chaining a list of 1 to many closures or
function paths. Each is executed in order, while the returned action remains
A tag attribute, e.g.
A depth-first iterator returned by
A DOM-like container for a tree of markup elements and text.
Document type definition details.
A markup element with name and attributes.
A set of confidence-weighted evidence that a text document is in a particular encoding.
A typed node (e.g. text, element, etc.) within a
Processing instruction details.
A fully qualified name (with a namespace), used to depict names of tags and attributes.
A selecting iterator returned by
The node kind and payload data associated with that kind.
Recommended confidence for hints based on a leading Byte-Order-Mark (BOM) at the start of a document stream.
Recommended confidence for an initial default encoding.
Recommended confidence for the sum of all hints from within an HTML head, in meta elements.
Recommended confidence for a hint from an HTTP Content-Type header with charset.
Initial parse buffer size in which encoding hints are considered, possibly triggering reparse.
Subsequent parse buffer size used for reading and parsing, after