pub struct TextSpan {
pub byte_start: usize,
pub byte_end: usize,
pub char_start: usize,
pub char_end: usize,
}Expand description
A text span with both byte and character offsets.
This is the canonical representation for entity positions. Store both to avoid repeated conversion.
Fields§
§byte_start: usizeByte offset (start, inclusive)
byte_end: usizeByte offset (end, exclusive)
char_start: usizeCharacter offset (start, inclusive)
char_end: usizeCharacter offset (end, exclusive)
Implementations§
Source§impl TextSpan
impl TextSpan
Sourcepub fn from_bytes(text: &str, byte_start: usize, byte_end: usize) -> Self
pub fn from_bytes(text: &str, byte_start: usize, byte_end: usize) -> Self
Create a span from byte offsets, computing char offsets from text.
§Arguments
text- The full text (needed to compute char offsets)byte_start- Byte offset start (inclusive)byte_end- Byte offset end (exclusive)
§Example
use anno::offset::TextSpan;
let text = "Price €50";
// "Price " = 6 bytes, € = 3 bytes, "50" = 2 bytes = 11 total
let span = TextSpan::from_bytes(text, 6, 11); // "€50"
assert_eq!(span.char_start, 6);
assert_eq!(span.char_end, 9); // € is 1 char but 3 bytesSourcepub fn from_chars(text: &str, char_start: usize, char_end: usize) -> Self
pub fn from_chars(text: &str, char_start: usize, char_end: usize) -> Self
Create a span from character offsets, computing byte offsets from text.
§Arguments
text- The full text (needed to compute byte offsets)char_start- Character offset start (inclusive)char_end- Character offset end (exclusive)
Sourcepub const fn ascii(start: usize, end: usize) -> Self
pub const fn ascii(start: usize, end: usize) -> Self
Create a span for ASCII text where byte == char offsets.
This is a fast path for ASCII-only text.
Sourcepub const fn byte_range(&self) -> Range<usize>
pub const fn byte_range(&self) -> Range<usize>
Get byte range.
Sourcepub const fn char_range(&self) -> Range<usize>
pub const fn char_range(&self) -> Range<usize>
Get character range.
Trait Implementations§
Source§impl<'de> Deserialize<'de> for TextSpan
impl<'de> Deserialize<'de> for TextSpan
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
impl Copy for TextSpan
impl Eq for TextSpan
impl StructuralPartialEq for TextSpan
Auto Trait Implementations§
impl Freeze for TextSpan
impl RefUnwindSafe for TextSpan
impl Send for TextSpan
impl Sync for TextSpan
impl Unpin for TextSpan
impl UnsafeUnpin for TextSpan
impl UnwindSafe for TextSpan
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more