Skip to main content

REMEMBER_MAX_CONTROLLED_BATCH_PADDED_TOKENS

Constant REMEMBER_MAX_CONTROLLED_BATCH_PADDED_TOKENS 

Source
pub const REMEMBER_MAX_CONTROLLED_BATCH_PADDED_TOKENS: usize = 512;
Expand description

Maximum padded-token budget per controlled micro-batch in remember.

The budget uses max_tokens_no_batch * batch_size, approximating the real cost of BatchLongest padding. Values exceeding this fall back to smaller batches or serialisation.