pub struct RetentionRatioTruncation {
pub retention_ratio: f32,
pub type: String,
pub token_limits: Option<TokenLimits>,
}Available on crate feature
realtime only.Fields§
§retention_ratio: f32Fraction of post-instruction conversation tokens to retain (0.0 - 1.0) when the conversation exceeds the input token limit. Setting this to 0.8 means that messages will be dropped until 80% of the maximum allowed tokens are used. This helps reduce the frequency of truncations and improve cache rates.
type: StringUse retention ratio truncation.
token_limits: Option<TokenLimits>Optional custom token limits for this truncation strategy. If not provided, the model’s default token limits will be used.
Trait Implementations§
Source§impl Clone for RetentionRatioTruncation
impl Clone for RetentionRatioTruncation
Source§fn clone(&self) -> RetentionRatioTruncation
fn clone(&self) -> RetentionRatioTruncation
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for RetentionRatioTruncation
impl Debug for RetentionRatioTruncation
Source§impl<'de> Deserialize<'de> for RetentionRatioTruncation
impl<'de> Deserialize<'de> for RetentionRatioTruncation
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations§
impl Freeze for RetentionRatioTruncation
impl RefUnwindSafe for RetentionRatioTruncation
impl Send for RetentionRatioTruncation
impl Sync for RetentionRatioTruncation
impl Unpin for RetentionRatioTruncation
impl UnwindSafe for RetentionRatioTruncation
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more