Skip to main content

Module structural

Module structural 

Source
Expand description

Structural character indexer — SIMD-powered bitmask generation (stage 1). Structural character indexer for HTML input.

Scans input in 64-byte blocks using SIMD dispatch, producing per-delimiter u64 bitmasks. Quote-aware masking (prefix XOR) ensures that delimiters inside quoted attribute values are not treated as structural.

This is stage 1 of the two-stage tokenizer pipeline (see crate docs).

Structs§

BlockBitmaps
Per-block bitmasks for each HTML delimiter type.
DelimiterEntry
A structural delimiter found by the indexer.
DelimiterIter
Iterator over structural delimiter positions, yielded in ascending order.
StructuralIndex
Result of structural indexing: a sequence of BlockBitmaps covering the entire input.
StructuralIndexer
SIMD-powered structural character indexer.