Struct regex_syntax::hir::literal::Seq

source ·

pub struct Seq { /* private fields */ }

Expand description

A sequence of literals.

A Seq is very much like a set in that it represents a union of its members. That is, it corresponds to a set of literals where at least one must match in order for a particular Hir expression to match. (Whether this corresponds to the entire Hir expression, a prefix of it or a suffix of it depends on how the Seq was extracted from the Hir.)

It is also unlike a set in that multiple identical literals may appear, and that the order of the literals in the Seq matters. For example, if the sequence is [sam, samwise] and leftmost-first matching is used, then samwise can never match and the sequence is equivalent to [sam].

States of a sequence

A Seq has a few different logical states to consider:

The sequence can represent “any” literal. When this happens, the set does not have a finite size. The purpose of this state is to inhibit callers from making assumptions about what literals are required in order to match a particular Hir expression. Generally speaking, when a set is in this state, literal optimizations are inhibited. A good example of a regex that will cause this sort of set to apppear is [A-Za-z]. The character class is just too big (and also too narrow) to be usefully expanded into 52 different literals. (Note that the decision for when a seq should become infinite is determined by the caller. A seq itself has no hard-coded limits.)
The sequence can be empty, in which case, it is an affirmative statement that there are no literals that can match the corresponding Hir. Consequently, the Hir never matches any input. For example, [a&&b].
The sequence can be non-empty, in which case, at least one of the literals must match in order for the corresponding Hir to match.

Example

This example shows how literal sequences can be simplified by stripping suffixes and minimizing while maintaining preference order.

use regex_syntax::hir::literal::{Literal, Seq};

let mut seq = Seq::new(&[
    "farm",
    "appliance",
    "faraway",
    "apple",
    "fare",
    "gap",
    "applicant",
    "applaud",
]);
seq.keep_first_bytes(3);
seq.minimize_by_preference();
// Notice that 'far' comes before 'app', which matches the order in the
// original sequence. This guarantees that leftmost-first semantics are
// not altered by simplifying the set.
let expected = Seq::from_iter([
    Literal::inexact("far"),
    Literal::inexact("app"),
    Literal::exact("gap"),
]);
assert_eq!(expected, seq);

Struct regex_syntax::hir::literal::Seq

Implementations§

impl Seq

pub fn empty() -> Seq

pub fn infinite() -> Seq

pub fn singleton(lit: Literal) -> Seq

pub fn new<I, B>(it: I) -> Seqwhere I: IntoIterator<Item = B>, B: AsRef<[u8]>,

pub fn literals(&self) -> Option<&[Literal]>

pub fn push(&mut self, lit: Literal)

pub fn make_inexact(&mut self)

pub fn make_infinite(&mut self)

pub fn cross_forward(&mut self, other: &mut Seq)

pub fn cross_reverse(&mut self, other: &mut Seq)

pub fn union(&mut self, other: &mut Seq)

pub fn union_into_empty(&mut self, other: &mut Seq)

pub fn dedup(&mut self)

pub fn sort(&mut self)

pub fn reverse_literals(&mut self)

pub fn minimize_by_preference(&mut self)

pub fn keep_first_bytes(&mut self, len: usize)

pub fn keep_last_bytes(&mut self, len: usize)

pub fn is_finite(&self) -> bool

pub fn is_empty(&self) -> bool

pub fn len(&self) -> Option<usize>

pub fn is_exact(&self) -> bool

pub fn is_inexact(&self) -> bool

pub fn min_literal_len(&self) -> Option<usize>

pub fn max_literal_len(&self) -> Option<usize>

pub fn longest_common_prefix(&self) -> Option<&[u8]>

pub fn longest_common_suffix(&self) -> Option<&[u8]>

pub fn optimize_for_prefix_by_preference(&mut self)

pub fn optimize_for_suffix_by_preference(&mut self)

Trait Implementations§

impl Clone for Seq

fn clone(&self) -> Seq

fn clone_from(&mut self, source: &Self)

impl Debug for Seq

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl FromIterator<Literal> for Seq

fn from_iter<T: IntoIterator<Item = Literal>>(it: T) -> Seq

impl PartialEq<Seq> for Seq

fn eq(&self, other: &Seq) -> bool

fn ne(&self, other: &Rhs) -> bool

impl Eq for Seq

impl StructuralEq for Seq

impl StructuralPartialEq for Seq

Auto Trait Implementations§

impl RefUnwindSafe for Seq

impl Send for Seq

impl Sync for Seq

impl Unpin for Seq

impl UnwindSafe for Seq

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>