pub struct TokenizedInput {
pub token_ids: Vec<i64>,
pub segment_ids: Vec<i8>,
pub special_tokens_mask: Vec<i8>,
pub overflowing_tokens: Vec<i64>,
pub num_truncated_tokens: usize,
pub token_offsets: Vec<Option<Offset>>,
pub reference_offsets: Vec<Vec<OffsetSize>>,
pub mask: Vec<Mask>,
}Expand description
§Tokenized Input, ready for processing in language models
This represents the final output of the encoding process (tokenized sentence with encoded values)
Fields§
§token_ids: Vec<i64>Vector of token IDs
segment_ids: Vec<i8>Vector segments ids (for example for BERT segments are separated with a [SEP] marker, each incrementing the segment ID). This vector has the same length as token_ids.
special_tokens_mask: Vec<i8>Flags tokens as special tokens (1) or not (0). This vector has the same length as token_ids.
overflowing_tokens: Vec<i64>Vector containing overflowing tokens, populated following a truncation step
num_truncated_tokens: usizeNumber of overflowing tokens following a truncation step. this equals the length overflowing_tokens
token_offsets: Vec<Option<Offset>>Offset information (as start and end positions) in relation to the original text. Tokens that can not be related to the original source are registered as None.
reference_offsets: Vec<Vec<OffsetSize>>Offset information (as a sequence of positions) in relation to the original text. Tokens that can not be related to the original source are registered as None.
mask: Vec<Mask>Masks tokens providing information on the type of tokens. This vector has the same length as token_ids.
Trait Implementations§
Source§impl Clone for TokenizedInput
impl Clone for TokenizedInput
Source§fn clone(&self) -> TokenizedInput
fn clone(&self) -> TokenizedInput
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
source. Read moreSource§impl Debug for TokenizedInput
impl Debug for TokenizedInput
Source§impl PartialEq for TokenizedInput
impl PartialEq for TokenizedInput
Source§impl PartialOrd for TokenizedInput
impl PartialOrd for TokenizedInput
impl Eq for TokenizedInput
impl StructuralPartialEq for TokenizedInput
Auto Trait Implementations§
impl Freeze for TokenizedInput
impl RefUnwindSafe for TokenizedInput
impl Send for TokenizedInput
impl Sync for TokenizedInput
impl Unpin for TokenizedInput
impl UnwindSafe for TokenizedInput
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more