Expand description
Structural character indexer — SIMD-powered bitmask generation (stage 1). Structural character indexer for HTML input.
Scans input in 64-byte blocks using SIMD dispatch, producing per-delimiter u64 bitmasks. Quote-aware masking (prefix XOR) ensures that delimiters inside quoted attribute values are not treated as structural.
This is stage 1 of the two-stage tokenizer pipeline (see crate docs).
Structs§
- Block
Bitmaps - Per-block bitmasks for each HTML delimiter type.
- Delimiter
Entry - A structural delimiter found by the indexer.
- Delimiter
Iter - Iterator over structural delimiter positions, yielded in ascending order.
- Structural
Index - Result of structural indexing: a sequence of
BlockBitmapscovering the entire input. - Structural
Indexer - SIMD-powered structural character indexer.