Expand description
A table that uses the ObjectStore
listing capability
to get the list of files to process.
Re-exports§
pub use self::file::as_file_source;
pub use self::url::ListingTableUrl;
Modules§
- decoder
- Module containing helper methods for the various file formats See write.rs for write related helper methods
- display
- file
- Common behaviors that every file format needs to implement
- file_
compression_ type - File Compression type abstraction
- file_
format - Module containing helper methods for the various file formats See write.rs for write related helper methods
- file_
groups - Logic for managing groups of
PartitionedFile
s in DataFusion - file_
meta - file_
scan_ config FileScanConfig
to configure scanning of possibly partitioned file sources.- file_
sink_ config - file_
stream - A generic stream over file format readers that can be used by any file format that read its files from start to end.
- memory
- schema_
adapter SchemaAdapter
andSchemaAdapterFactory
to adapt file-level record batches to a table schema.- sink
- Execution plan for writing data to
DataSink
s - source
DataSource
andDataSourceExec
- url
- write
- Module containing helper methods/traits related to enabling write support for the various file formats
Structs§
- File
Range - Only scan a subset of Row Groups from the Parquet file whose data “midpoint” lies within the [start, end) byte offsets. This option can be used to scan non-overlapping sections of a Parquet file in parallel.
- Partitioned
File - A single file or part of a file that should be read, along with its schema, statistics and partition column values that need to be appended to each row.
Enums§
- Range
Calculation - Represents the possible outcomes of a range calculation.
Functions§
- add_
row_ stats Deprecated - calculate_
range - Calculates an appropriate byte range for reading from an object based on the provided metadata.
- compute_
all_ files_ statistics - Computes statistics for all files across multiple file groups.
- generate_
test_ files - Generates test files with min-max statistics in different overlap patterns.
- verify_
sort_ integrity - Used by tests and benchmarks
Type Aliases§
- Partitioned
File Stream - Stream of files get listed from object store