pub fn generate_data(config: GeneratorConfig) -> DataBufferExpand description
Generate data with full configuration (ZERO-COPY - returns DataBuffer)
§Algorithm
- Fill blocks with Xoshiro256++ keystream (high entropy baseline)
- Add local back-references for compression
- Use round-robin deduplication across unique blocks
- Parallel generation via rayon (NUMA-aware if enabled)
§Performance
- 5-15 GB/s per core with incompressible data
- 1-4 GB/s with compression enabled (depends on compress factor)
- Near-linear scaling with CPU cores
§Returns
DataBuffer that holds the generated data without copying:
- UMA: Vec
wrapper - NUMA: hwlocality Bytes wrapper (when numa_node is specified)
Python accesses this memory directly via buffer protocol - ZERO COPY!