Skip to main content

RaggedBatch

anno_core::core::entity

Struct RaggedBatch

pub struct RaggedBatch {
    pub token_ids: Vec<u32>,
    pub cumulative_offsets: Vec<u32>,
    pub max_seq_len: usize,
}

Expand description

A ragged (unpadded) batch for efficient ModernBERT inference.

ModernBERT achieves its speed advantage by avoiding padding tokens entirely. Instead of [batch, max_seq_len], it uses a single contiguous 1D sequence with offset indices to track document boundaries.

§Memory Layout

Traditional (padded):
[doc1_tok1, doc1_tok2, PAD, PAD, PAD]  <- wasted compute
[doc2_tok1, doc2_tok2, doc2_tok3, PAD, PAD]

Ragged (unpadded):
[doc1_tok1, doc1_tok2, doc2_tok1, doc2_tok2, doc2_tok3]
cumulative_offsets: [0, 2, 5]  <- doc1 is [0..2], doc2 is [2..5]

Fields§

§token_ids: Vec<u32>

Token IDs flattened into a single contiguous array. Shape: [total_tokens] (1D, no padding)

§cumulative_offsets: Vec<u32>

Cumulative sequence lengths. Length: batch_size + 1 Document i spans tokens [offsets[i]..offsets[i+1])

§max_seq_len: usize

Maximum sequence length in this batch (for kernel bounds).

Implementations§

impl RaggedBatch

pub fn from_sequences(sequences: &[Vec<u32>]) -> Self

Create a new ragged batch from sequences.

pub fn batch_size(&self) -> usize

Get the number of documents in this batch.

pub fn total_tokens(&self) -> usize

Get the total number of tokens (no padding).

pub fn doc_range(&self, doc_idx: usize) -> Option<Range<usize>>

Get token range for a specific document.

pub fn doc_tokens(&self, doc_idx: usize) -> Option<&[u32]>

Get tokens for a specific document.

pub fn padding_savings(&self) -> f64

Calculate memory saved vs padded batch.

Trait Implementations§

impl Clone for RaggedBatch

fn clone(&self) -> RaggedBatch

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for RaggedBatch

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

impl Freeze for RaggedBatch

impl RefUnwindSafe for RaggedBatch

impl Send for RaggedBatch

impl Sync for RaggedBatch

impl Unpin for RaggedBatch

impl UnsafeUnpin for RaggedBatch

impl UnwindSafe for RaggedBatch

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

fn vzip(self) -> V