Skip to main content

Token

Enum Token 

Source
pub enum Token {
Show 16 variants ControlSeq(String), ActiveChar, LBrace, RBrace, MathShift, Alignment, Parameter, Superscript, Subscript, Star, LBracket, RBracket, Prime(usize), Whitespaces, Comment, Char(char),
}
Expand description

Token types for LaTeX lexical analysis.

This lexer recognizes LaTeX tokens based on character categories (catcode). It provides a simplified view where special characters and control sequences are identified, while preserving enough information for parsing.

Variants§

§

ControlSeq(String)

Control sequence: \command

  • catcode 0 (Escape): backslash triggers control sequence scanning
  • Matches: <letters> (control word) or <single-char> (control symbol)
  • Returns the command name without the backslash
§

ActiveChar

Active character: ~

  • catcode 13: Active Character
  • Treated as a command but without escape character
  • In LaTeX, ~ produces a non-breaking space
§

LBrace

Left brace: {

  • catcode 1: Begin Group
  • Used for grouping and delimiting arguments
§

RBrace

Right brace: }

  • catcode 2: End Group
  • Closes groups started by LBrace
§

MathShift

Dollar sign: $

  • catcode 3: Math Shift
  • Toggles inline math mode; $$ indicates display math
§

Alignment

Ampersand: &

  • catcode 4: Alignment Tab
  • Used in tables and alignment environments
§

Parameter

Hash/pound sign: #

  • catcode 6: Parameter
  • Used in macro definitions and arguments
§

Superscript

Caret: ^

  • catcode 7: Superscript
  • Indicates superscript in math mode
§

Subscript

Underscore: _

  • catcode 8: Subscript
  • Indicates subscript in math mode
§

Star

Star/Asterisk: *

  • catcode 12: Other
  • Used for starred command variants (e.g., \section*)
  • Must be checked immediately after command names
§

LBracket

Left bracket: [

  • catcode 12: Other
  • Often used for optional arguments
§

RBracket

Right bracket: ]

  • catcode 12: Other
  • Closes optional arguments
§

Prime(usize)

Prime mark(s): one or more ’ or U+2019

  • In math mode, represents derivative notation (f’ = f^\prime)
  • Multiple primes are common: f’‘, f’‘’
  • We store the count to simplify parser handling
§

Whitespaces

Whitespace: spaces, tabs, newlines, form feeds, non-breaking space

  • catcode 10: Spacer
  • Multiple consecutive whitespace characters are merged
  • Includes U+00A0 (non-breaking space) for copy-paste behavior
§

Comment

Comment: % to end of line

  • catcode 14: Comment
  • Lexer consumes everything from % to line end (inclusive)
  • Comments are discarded and do not produce tokens
§

Char(char)

Regular character: letters, digits, punctuation, Unicode (excluding invalid chars)

  • catcode 11: Letter (a-z, A-Z)
  • catcode 12: Other (digits, punctuation, etc.)
  • Matches any single printable character not covered by above patterns
  • Has lowest priority (1) to act as fallback

Note: Control characters (catcode 9, 15) are NOT matched by any pattern and will cause lexing errors automatically:

  • catcode 9 (Ignore): \x00-\x08, \x0B-\x1F (control chars except \t, \n, \f)
  • catcode 15 (Invalid): \x7F (DEL character)

Trait Implementations§

Source§

impl Clone for Token

Source§

fn clone(&self) -> Token

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for Token

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Display for Token

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<'s> Logos<'s> for Token

Source§

type Error = ()

Error type returned by the lexer. This can be set using #[logos(error = MyError)]. Defaults to () if not set.
Source§

type Extras = ()

Associated type Extras for the particular lexer. This can be set using #[logos(extras = MyExtras)] and accessed inside callbacks.
Source§

type Source = str

Source type this token can be lexed from. This will default to str, unless one of the defined patterns explicitly uses non-unicode byte values or byte slices, in which case that implementation will use [u8].
Source§

fn lex(lex: &mut Lexer<'s, Self>)

The heart of Logos. Called by the Lexer. The implementation for this function is generated by the logos-derive crate.
Source§

fn lexer(source: &'source Self::Source) -> Lexer<'source, Self>
where Self::Extras: Default,

Create a new instance of a Lexer that will produce tokens implementing this Logos.
Source§

fn lexer_with_extras( source: &'source Self::Source, extras: Self::Extras, ) -> Lexer<'source, Self>

Create a new instance of a Lexer with the provided Extras that will produce tokens implementing this Logos.
Source§

impl PartialEq for Token

Source§

fn eq(&self, other: &Token) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 (const: unstable) · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl StructuralPartialEq for Token

Auto Trait Implementations§

§

impl Freeze for Token

§

impl RefUnwindSafe for Token

§

impl Send for Token

§

impl Sync for Token

§

impl Unpin for Token

§

impl UnsafeUnpin for Token

§

impl UnwindSafe for Token

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<'src, T> IntoMaybe<'src, T> for T
where T: 'src,

Source§

type Proj<U: 'src> = U

Source§

fn map_maybe<R>( self, _f: impl FnOnce(&'src T) -> &'src R, g: impl FnOnce(T) -> R, ) -> <T as IntoMaybe<'src, T>>::Proj<R>
where R: 'src,

Source§

impl<T> OrderedSeq<'_, T> for T
where T: Clone,

Source§

impl<'p, T> Seq<'p, T> for T
where T: Clone,

Source§

type Item<'a> = &'a T where T: 'a

The item yielded by the iterator.
Source§

type Iter<'a> = Once<&'a T> where T: 'a

An iterator over the items within this container, by reference.
Source§

fn seq_iter(&self) -> <T as Seq<'p, T>>::Iter<'_>

Iterate over the elements of the container.
Source§

fn contains(&self, val: &T) -> bool
where T: PartialEq,

Check whether an item is contained within this sequence.
Source§

fn to_maybe_ref<'b>(item: <T as Seq<'p, T>>::Item<'b>) -> Maybe<T, &'p T>
where 'p: 'b,

Convert an item of the sequence into a MaybeRef.
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

fn to_string(&self) -> String

Converts the given value to a String. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.