Module memory

ruvector_sparse_inference

Module memory

Expand description

Memory management for sparse inference.

This module provides weight quantization and neuron caching for efficient memory usage during inference.

Structs§

CacheStats: Cache statistics.
NeuronCache: Neuron activation cache for hot/cold management.
QuantizedWeights: Quantized weight storage for reduced memory usage.