openai-client-base 0.12.0

# RetentionRatioTruncation

## Properties

Name | Type | Description | Notes
------------ | ------------- | ------------- | -------------
**r#type** | **String** | Use retention ratio truncation. | 
**retention_ratio** | **f64** | Fraction of post-instruction conversation tokens to retain (`0.0` - `1.0`) when the conversation exceeds the input token limit. Setting this to `0.8` means that messages will be dropped until 80% of the maximum allowed tokens are used. This helps reduce the frequency of truncations and improve cache rates.  | 
**token_limits** | Option<[**models::RetentionRatioTruncationTokenLimits**](Retention_ratio_truncation_token_limits.md)> |  | [optional]

[[Back to Model list]](../README.md#documentation-for-models) [[Back to API list]](../README.md#documentation-for-api-endpoints) [[Back to README]](../README.md)