Skip to main content

ParquetStreamingSink

Struct ParquetStreamingSink 

Source
pub struct ParquetStreamingSink<T: ToParquetBatch + Send> { /* private fields */ }
Expand description

Parquet streaming sink that buffers items and writes in row groups.

Items are buffered until a row group is full, then written to the Parquet file. This approach provides efficient compression while supporting streaming input.

§Type Parameters

  • T - The type of items to write. Must implement ToParquetBatch.

§Note

For custom types, implement the ToParquetBatch trait to define how your data maps to Arrow schema and batches.

Implementations§

Source§

impl<T: ToParquetBatch + Send> ParquetStreamingSink<T>

Source

pub fn new(path: PathBuf, row_group_size: usize) -> SynthResult<Self>

Creates a new Parquet streaming sink.

§Arguments
  • path - Path to the output Parquet file
  • row_group_size - Number of rows per row group (default: 10000)
§Errors

Returns an error if the file cannot be created.

Source

pub fn with_defaults(path: PathBuf) -> SynthResult<Self>

Creates a Parquet streaming sink with default row group size (10000).

Source

pub fn path(&self) -> &PathBuf

Returns the path to the output file.

Trait Implementations§

Source§

impl<T: ToParquetBatch + Send> StreamingSink<T> for ParquetStreamingSink<T>

Source§

fn process(&mut self, event: StreamEvent<T>) -> SynthResult<()>

Processes a stream event.
Source§

fn flush(&mut self) -> SynthResult<()>

Flushes any buffered data.
Source§

fn close(self) -> SynthResult<()>

Closes the sink and releases resources.
Source§

fn items_processed(&self) -> u64

Returns the number of items processed.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more