Skip to main content

ResponseCache

Struct ResponseCache 

Source
pub struct ResponseCache { /* private fields */ }
Expand description

LRU cache for LLM completion responses.

Thread-safe via parking_lot::Mutex (never held across .await). Entries are keyed by FNV-1a hash of (system_prompt, messages, sorted tool names).

Uses a Vec with move-to-front on hit and eviction from back, giving O(n) operations per access. This is efficient for typical capacities (10–100). For very large caches (1000+), consider an alternative implementation.

parking_lot::Mutex is adopted on this hot path (every cached LLM call) for ~2× faster acquisition vs. std::sync::Mutex; see T2 in tasks/performance-audit-heartbit-core-2026-05-06.md.

Implementations§

Source§

impl ResponseCache

Source

pub fn new(capacity: usize) -> Self

Create a new cache with the given maximum number of entries.

Source

pub fn get(&self, key: u64) -> Option<CompletionResponse>

Look up a cached response by key. On hit, moves the entry to the front (LRU).

Source

pub fn put(&self, key: u64, response: CompletionResponse)

Insert a response into the cache. Evicts the least-recently-used entry if at capacity.

Source

pub fn compute_key( system_prompt: &str, messages: &[Message], tool_names: &[&str], ) -> u64

Compute a cache key from the request components.

Uses FNV-1a hash of system prompt, serialized messages, and sorted tool names.

Backward-compatible with single-tenant code: prefer ResponseCache::compute_key_scoped when the runner is shared across tenants/users (F-AGENT-3).

Source

pub fn compute_key_scoped( system_prompt: &str, messages: &[Message], tool_names: &[&str], namespace: Option<&str>, ) -> u64

Compute a cache key including a tenant/user namespace.

SECURITY (F-AGENT-3): when a single AgentRunner is shared across tenants (typical daemon deployment), the cache key MUST disambiguate otherwise-identical requests — otherwise tenant A’s cached response could be served to tenant B if their system_prompt + messages happened to coincide. Pass Some("{tenant_id}:{user_id}") (or any unique namespace string) to scope the cache.

Source

pub fn clear(&self)

Remove all entries from the cache.

Source

pub fn len(&self) -> usize

Number of entries currently in the cache.

Source

pub fn is_empty(&self) -> bool

Returns true if the cache contains no entries.

Trait Implementations§

Source§

impl Debug for ResponseCache

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> PolicyExt for T
where T: ?Sized,

Source§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow only if self and other return Action::Follow. Read more
Source§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow if either self or other returns Action::Follow. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more