Struct html5ever::driver::Parser [] [src]

pub struct Parser<Sink> where Sink: TreeSink {
    pub tokenizer: Tokenizer<TreeBuilder<Sink::Handle, Sink>>,
    pub input_buffer: BufferQueue,
}

An HTML parser, ready to recieve Unicode input through the tendril::TendrilSink trait’s methods.

Fields

Methods

impl<Sink: TreeSink> Parser<Sink>
[src]

Wrap this parser into a TendrilSink that accepts UTF-8 bytes.

Use this when your input is bytes that are known to be in the UTF-8 encoding. Decoding is lossy, like String::from_utf8_lossy.

Wrap this parser into a TendrilSink that accepts bytes and tries to detect the correct character encoding.

Currently this looks for a Byte Order Mark, then uses BytesOpts::transport_layer_encoding, then falls back to UTF-8.

FIXME(https://github.com/servo/html5ever/issues/18): this should look for <meta> elements and other data per https://html.spec.whatwg.org/multipage/syntax.html#determining-the-character-encoding

Trait Implementations

impl<Sink: TreeSink> TendrilSink<UTF8> for Parser<Sink>
[src]

Process this tendril.

Indicates that an error has occurred.

What the overall result of processing is.

Indicates the end of the stream.

Process one tendril and finish.

Consume an iterator of tendrils, processing each item, then finish.

Read from the given stream of bytes until exhaustion and process incrementally, then finish. Return Err at the first I/O error. Read more

Read from the file at the given path and process incrementally, then finish. Return Err at the first I/O error. Read more