Skip to main content

Module iterator

Module iterator 

Source
Expand description

Streaming iterator API for datasets exceeding RAM.

Provides a lazy, chunk-based iteration interface over multiple data sources (in-memory vectors, CSV files, directories of files). Each iteration step yields a StreamingDataChunk holding at most chunk_size rows, enabling processing of arbitrarily large datasets with bounded memory usage.

Structs§

NewStreamingIterator
Streaming iterator over a DataSource, yielding StreamingDataChunks.
StreamingDataChunk
A single chunk produced by NewStreamingIterator.
StreamingIteratorConfig
Configuration for a NewStreamingIterator.

Enums§

DataSource
Origin of data for a NewStreamingIterator.