Module parquet2::schema[][src]

Modules

Structs

Embedded BSON logical type annotation

Description for ColumnIndex. Each [i] refers to the page at OffsetIndex.page_locations[i]

Description for column metadata

Data page header

New page format allowing reading levels without decompressing the data Repetition and definition levels are uncompressed The remaining section containing the data is compressed if is_compressed is true

Decimal logical type annotation

TODO: *

Description for file metadata

Integer logical type annotation

Embedded JSON logical type annotation

Wrapper struct to store key values

Time units for logical types

Logical type to annotate a column that is always null.

statistics of a given page type and encoding

Represents a element inside a schema definition.

Wrapper struct to specify sort order

Statistics per row group and per page All fields are optional.

Empty structs to use as logical type annotations

Time logical type annotation

Timestamp logical type annotation

Empty struct to signal the order defined by the physical or logical type

Enums

Enum to annotate whether lists of min/max elements inside ColumnIndex are ordered and if so, in which direction.

Supported compression algorithms.

Common types used by frameworks(e.g. hive, pig) using parquet. This helps map between types in those frameworks to the base types in parquet. This is only metadata and not needed to read or write the data.

Encodings supported by Parquet. Not all encodings are valid for all types. These enums are also used to specify the encoding of definition and repetition levels. See the accompanying doc for the details of the more complicated encodings.

Representation of Schemas

Representation of Schemas

Types supported by Parquet. These types are intended to be used in combination with the encodings to control the on disk storage format. For example INT16 is not included as a type since a good encoding of INT32 would handle this.