Expand description
Memory optimization for AI operations
This module provides memory-efficient operations for embeddings, model weights, and large-scale AI processing to minimize memory footprint in production.
Re-exports§
pub use compression::CompressionAlgorithm;pub use compression::Compressor;pub use pooling::MemoryPool;pub use pooling::PooledBuffer;pub use streaming::ChunkProcessor;pub use streaming::StreamProcessor;pub use tensor_ops::MemoryEfficientTensor;pub use tensor_ops::TensorOptimizer;
Modules§
- compression
- Compression for cached embeddings and model data
- pooling
- Memory pooling for efficient buffer reuse
- streaming
- Streaming processing for large datasets
- tensor_
ops - Memory-efficient tensor operations
Structs§
- Memory
Metrics - Memory usage metrics
- Memory
Optimization Config - Configuration for memory optimization
- Memory
Optimizer - Memory optimization manager