Skip to main content

Module performance_optimization

Module performance_optimization 

Source
Expand description

Performance Optimization Utilities

This module provides performance optimization utilities for model inference, including batch processing, memory optimization, and caching strategies.

Structs§

AdvancedPerformanceOptimizer
Advanced performance optimizer with workload analysis
BatchProcessor
Batch processor for efficient inference
BatchStatistics
Batch processing statistics
CacheStatistics
Cache statistics
CachedTensor
DynamicBatchManager
Dynamic batch manager
GpuCacheStatistics
Comprehensive GPU cache statistics
GpuMemoryChunk
GpuMemoryOptimizer
GPU memory optimizer with intelligent recommendations
GpuMemoryPool
Advanced GPU Memory Management
GpuMemoryStats
GpuOptimizationRecommendations
GPU memory optimization recommendations
GpuTensorCache
Advanced GPU tensor caching with memory-aware eviction
LruCache
LRU Cache implementation for tensors
MemoryOptimizer
Memory optimization utilities
PerformanceConfig
Configuration for performance optimization
PerformanceMonitor
Performance monitoring utilities
PerformanceStatistics
Performance statistics
WorkloadAnalysis
Workload analysis summary
WorkloadMetrics
Workload metrics for optimization analysis

Enums§

BatchingStrategy
Dynamic batching strategy