Skip to main content

ModelLimit

bamboo_compression::limits

Struct ModelLimit

pub struct ModelLimit {
    pub model_pattern: String,
    pub max_context_tokens: u32,
    pub max_output_tokens: Option<u32>,
    pub safety_margin: Option<u32>,
}

Expand description

Model limit configuration (user-overridable).

Fields§

§model_pattern: String

Model identifier (partial match supported, e.g., “gpt-4” matches “gpt-4o”)

§max_context_tokens: u32

Maximum total context window size (input + output) in tokens

§max_output_tokens: Option<u32>

Maximum output tokens (defaults to min(max_context / 4, DEFAULT_MAX_OUTPUT_TOKENS) when unset — see Self::get_max_output_tokens)

§safety_margin: Option<u32>

Safety margin for token counting (defaults to 1000)

Implementations§

impl ModelLimit

pub fn new(model_pattern: impl Into<String>, max_context_tokens: u32) -> Self

Create a new model limit with defaults.

pub fn get_max_output_tokens(&self) -> u32

Get max output tokens with default calculation.

When unset, derive from the context window (max_context_tokens / 4) capped at the global DEFAULT_MAX_OUTPUT_TOKENS. The cap tracks the global default rather than a hard-coded 4096, so a user override like ModelLimit::new("gpt-4o", 128_000) (no explicit max_output_tokens) resolves to min(32_000, 128_000) = 32_000 instead of collapsing to 4096 — see issue #20, bug 4.

pub fn get_safety_margin(&self) -> u32

Get safety margin, scaling proportionally with context window.

Trait Implementations§

impl Clone for ModelLimit

fn clone(&self) -> ModelLimit

Returns a duplicate of the value. Read more

1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for ModelLimit

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl<'de> Deserialize<'de> for ModelLimit

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl Serialize for ModelLimit

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

Auto Trait Implementations§

impl Freeze for ModelLimit

impl RefUnwindSafe for ModelLimit

impl Send for ModelLimit

impl Sync for ModelLimit

impl Unpin for ModelLimit

impl UnsafeUnpin for ModelLimit

impl UnwindSafe for ModelLimit

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> Same for T

type Output = T

Should always be Self

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more