Skip to main content

Module optimized_client

Module optimized_client 

Source
Expand description

Optimized LLM client with connection pooling and request batching

Structsยง

BatchedRequest
A request that can be batched with similar requests
CachedResponse
Cached response with TTL
ClientMetrics
Client performance metrics
ConnectionPool
Connection pool for HTTP clients
OptimizedLLMClient
Optimized LLM client with advanced performance features
OptimizedRequest
Simplified request structure for optimization
OptimizedResponse
Simplified response structure
RateLimiter
Rate limiter for API requests
RequestBatcher
Request batching manager for similar requests
TokenBucket
Token bucket for rate limiting