Skip to main content

ParquetStreamingSink

Struct ParquetStreamingSink 

Source
pub struct ParquetStreamingSink<T: ToParquetBatch + Send> { /* private fields */ }
Expand description

Parquet streaming sink that buffers items and writes in row groups.

Items are buffered until a row group is full, then written to the Parquet file. This approach provides efficient compression while supporting streaming input.

§Type Parameters

  • T - The type of items to write. Must implement ToParquetBatch.

§Note

For custom types, implement the ToParquetBatch trait to define how your data maps to Arrow schema and batches.

Implementations§

Source§

impl<T: ToParquetBatch + Send> ParquetStreamingSink<T>

Source

pub fn new(path: PathBuf, row_group_size: usize) -> SynthResult<Self>

Creates a new Parquet streaming sink.

§Arguments
  • path - Path to the output Parquet file
  • row_group_size - Number of rows per row group (default: 10000)
§Errors

Returns an error if the file cannot be created.

Source

pub fn with_defaults(path: PathBuf) -> SynthResult<Self>

Creates a Parquet streaming sink with default row group size (10000).

Source

pub fn path(&self) -> &PathBuf

Returns the path to the output file.

Trait Implementations§

Source§

impl<T: ToParquetBatch + Send> StreamingSink<T> for ParquetStreamingSink<T>

Source§

fn process(&mut self, event: StreamEvent<T>) -> SynthResult<()>

Processes a stream event.
Source§

fn flush(&mut self) -> SynthResult<()>

Flushes any buffered data.
Source§

fn close(self) -> SynthResult<()>

Closes the sink and releases resources.
Source§

fn items_processed(&self) -> u64

Returns the number of items processed.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

Source§

fn to_subset(&self) -> Option<SS>

The inverse inclusion map: attempts to construct self from the equivalent element of its superset. Read more
Source§

fn is_in_subset(&self) -> bool

Checks if self is actually part of its subset T (and can be converted to it).
Source§

fn to_subset_unchecked(&self) -> SS

Use with care! Same as self.to_subset but without any property checks. Always succeeds.
Source§

fn from_subset(element: &SS) -> SP

The inclusion map: converts self to the equivalent element of its superset.
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more