TableParquetOptions

Struct TableParquetOptions 

Source
pub struct TableParquetOptions {
    pub global: ParquetOptions,
    pub column_specific_options: HashMap<String, ParquetColumnOptions>,
    pub key_value_metadata: HashMap<String, Option<String>>,
    pub crypto: ParquetEncryptionOptions,
}
Expand description

Options that control how Parquet files are read, including global options that apply to all columns and optional column-specific overrides

Closely tied to ParquetWriterOptions. Properties not included in TableParquetOptions may not be configurable at the external API (e.g. sorting_columns).

Fields§

§global: ParquetOptions

Global Parquet options that propagates to all columns.

§column_specific_options: HashMap<String, ParquetColumnOptions>

Column specific options. Default usage is parquet.XX::column.

§key_value_metadata: HashMap<String, Option<String>>

Additional file-level metadata to include. Inserted into the key_value_metadata for the written FileMetaData.

Multiple entries are permitted

OPTIONS (
   'format.metadata::key1' '',
   'format.metadata::key2' 'value',
   'format.metadata::key3' 'value has spaces',
   'format.metadata::key4' 'value has special chars :: :',
   'format.metadata::key_dupe' 'original will be overwritten',
   'format.metadata::key_dupe' 'final'
)
§crypto: ParquetEncryptionOptions

Options for configuring Parquet modular encryption

To use Parquet encryption, you must enable the parquet_encryption feature flag, as it is not activated by default. See ConfigFileEncryptionProperties and ConfigFileDecryptionProperties in datafusion/common/src/config.rs These can be set via ‘format.crypto’, for example:

OPTIONS (
   'format.crypto.file_encryption.encrypt_footer' 'true',
   'format.crypto.file_encryption.footer_key_as_hex' '30313233343536373839303132333435',  -- b"0123456789012345" */
   'format.crypto.file_encryption.column_key_as_hex::double_field' '31323334353637383930313233343530', -- b"1234567890123450"
   'format.crypto.file_encryption.column_key_as_hex::float_field' '31323334353637383930313233343531', -- b"1234567890123451"
    -- Same for decryption
   'format.crypto.file_decryption.footer_key_as_hex' '30313233343536373839303132333435', -- b"0123456789012345"
   'format.crypto.file_decryption.column_key_as_hex::double_field' '31323334353637383930313233343530', -- b"1234567890123450"
   'format.crypto.file_decryption.column_key_as_hex::float_field' '31323334353637383930313233343531', -- b"1234567890123451"
)

See datafusion-cli/tests/sql/encrypted_parquet.sql for a more complete example. Note that keys must be provided as in hex format since these are binary strings.

Implementations§

Source§

impl TableParquetOptions

Source

pub fn new() -> Self

Return new default TableParquetOptions

Source

pub fn with_skip_arrow_metadata(self, skip: bool) -> Self

Set whether the encoding of the arrow metadata should occur during the writing of parquet.

Default is to encode the arrow schema in the file kv_metadata.

Source

pub fn entries(self: &TableParquetOptions) -> Vec<ConfigEntry>

Retrieves all configuration entries from this TableParquetOptions.

§Returns

A vector of ConfigEntry instances, representing all the configuration options within this

Source§

impl TableParquetOptions

Source

pub fn arrow_schema(&mut self, schema: &Arc<Schema>)

Available on crate feature parquet only.

Add the arrow schema to the parquet kv_metadata. If already exists, then overwrites.

Trait Implementations§

Source§

impl Clone for TableParquetOptions

Source§

fn clone(&self) -> TableParquetOptions

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl ConfigField for TableParquetOptions

Source§

fn visit<V: Visit>( &self, v: &mut V, key_prefix: &str, description: &'static str, )

Source§

fn set(&mut self, key: &str, value: &str) -> Result<()>

Source§

impl Debug for TableParquetOptions

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for TableParquetOptions

Source§

fn default() -> TableParquetOptions

Returns the “default value” for a type. Read more
Source§

impl PartialEq for TableParquetOptions

Source§

fn eq(&self, other: &TableParquetOptions) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl TryFrom<&TableParquetOptions> for ParquetWriterOptions

Available on crate feature parquet only.
Source§

type Error = DataFusionError

The type returned in the event of a conversion error.
Source§

fn try_from(parquet_table_options: &TableParquetOptions) -> Result<Self>

Performs the conversion.
Source§

impl TryFrom<&TableParquetOptions> for WriterPropertiesBuilder

Available on crate feature parquet only.
Source§

fn try_from(table_parquet_options: &TableParquetOptions) -> Result<Self>

Convert the session’s TableParquetOptions into a single write action’s WriterPropertiesBuilder.

The returned WriterPropertiesBuilder includes customizations applicable per column. Note that any encryption options are ignored as building the FileEncryptionProperties might require other inputs besides the TableParquetOptions.

Source§

type Error = DataFusionError

The type returned in the event of a conversion error.
Source§

impl StructuralPartialEq for TableParquetOptions

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> Allocation for T
where T: RefUnwindSafe + Send + Sync,

Source§

impl<T> ErasedDestructor for T
where T: 'static,

Source§

impl<T> Ungil for T
where T: Send,