Expand description
DataFusion Parquet Reader: ParquetSource
Re-exports§
pub use access_plan::ParquetAccessPlan;pub use access_plan::RowGroupAccess;pub use file_format::*;
Modules§
- access_
plan - file_
format ParquetFormat: ParquetFileFormatabstractions- metadata
DFParquetMetadatafor fetching Parquet file metadata, statistics and schema information.- source
- ParquetSource implementation for reading parquet files
Structs§
- Cached
Parquet File Reader - Implements
AsyncFileReaderfor a Parquet file in object storage. Reads the file metadata from theFileMetadataCache, if available, otherwise reads it directly from the file and then updates the cache. - Cached
Parquet File Reader Factory - Implementation of
ParquetFileReaderFactorysupporting the caching of footer and page metadata. Reads and updates theFileMetadataCachewith theParquetMetaDatadata. This reader always loads the entire metadata (including page index, unless the file is encrypted), even if not required by the current query, to ensure it is always available for those that need it. - Cached
Parquet Meta Data - Wrapper to implement
FileMetadataforParquetMetaData. - Default
Parquet File Reader Factory - Default implementation of
ParquetFileReaderFactory - Int96
Coercer - Builder for coercing INT96-originated Timestamp columns in
file_schemato a specificTimeUnit, optionally attaching a timezone. - Page
Pruning Access Plan Filter - Filters a
ParquetAccessPlanbased on the Parquet PageIndex, if present - Parquet
File Metrics - Stores metrics about the parquet execution for a particular parquet file.
- Parquet
File Reader - Implements
AsyncFileReaderfor a parquet file in object storage. - Parquet
Sink - Implements
DataSinkfor writing to a parquet file. - RowGroup
Access Plan Filter - Reduces the
ParquetAccessPlanbased on row group level metadata.
Traits§
- Parquet
File Reader Factory - Interface for reading Apache Parquet files.
Functions§
- apply_
file_ schema_ type_ coercions - Apply necessary schema type coercions to make file schema match table schema.
- build_
row_ filter - Build a
RowFilterfrom the given predicate expression if possible. - can_
expr_ be_ pushed_ down_ with_ schemas - Checks if a predicate expression can be pushed down to the parquet decoder.
- coerce_
file_ schema_ to_ string_ type Deprecated - If the table schema uses a string type, coerce the file schema to use a string type.
- coerce_
file_ schema_ to_ view_ type Deprecated - Coerces the file schema if the table schema uses a view type.
- coerce_
int96_ to_ resolution Deprecated - Coerces the file schema’s Timestamps to the provided TimeUnit if the Parquet schema contains INT96.
- plan_
to_ parquet - Executes a query and writes the results to a partitioned Parquet file.
- transform_
binary_ to_ string - Transform a schema so that any binary types are strings
- transform_
schema_ to_ view - Transform a schema to use view types for Utf8 and Binary