Struct ExternalDataConfiguration

Source

pub struct ExternalDataConfiguration {Show 27 fields
    pub autodetect: Option<bool>,
    pub avro_options: Option<AvroOptions>,
    pub bigtable_options: Option<BigtableOptions>,
    pub compression: Option<String>,
    pub connection_id: Option<String>,
    pub csv_options: Option<CsvOptions>,
    pub date_format: Option<String>,
    pub datetime_format: Option<String>,
    pub decimal_target_types: Option<Vec<String>>,
    pub file_set_spec_type: Option<String>,
    pub google_sheets_options: Option<GoogleSheetsOptions>,
    pub hive_partitioning_options: Option<HivePartitioningOptions>,
    pub ignore_unknown_values: Option<bool>,
    pub json_extension: Option<String>,
    pub json_options: Option<JsonOptions>,
    pub max_bad_records: Option<i32>,
    pub metadata_cache_mode: Option<String>,
    pub object_metadata: Option<String>,
    pub parquet_options: Option<ParquetOptions>,
    pub reference_file_schema_uri: Option<String>,
    pub schema: Option<TableSchema>,
    pub source_format: Option<String>,
    pub source_uris: Option<Vec<String>>,
    pub time_format: Option<String>,
    pub time_zone: Option<String>,
    pub timestamp_format: Option<String>,
    pub timestamp_target_precision: Option<Vec<i32>>,
}

Expand description

There is no detailed description.

This type is not used in any activity, and only used as part of another schema.

Fields§

§autodetect: Option<bool>

Try to detect schema and format options automatically. Any option specified explicitly will be honored.

§avro_options: Option<AvroOptions>

Optional. Additional properties to set if sourceFormat is set to AVRO.

§bigtable_options: Option<BigtableOptions>

Optional. Additional options if sourceFormat is set to BIGTABLE.

§compression: Option<String>

Optional. The compression type of the data source. Possible values include GZIP and NONE. The default value is NONE. This setting is ignored for Google Cloud Bigtable, Google Cloud Datastore backups, Avro, ORC and Parquet formats. An empty string is an invalid value.

§connection_id: Option<String>

Optional. The connection specifying the credentials to be used to read external storage, such as Azure Blob, Cloud Storage, or S3. The connection_id can have the form {project_id}.{location_id};{connection_id} or projects/{project_id}/locations/{location_id}/connections/{connection_id}.

§csv_options: Option<CsvOptions>

Optional. Additional properties to set if sourceFormat is set to CSV.

§date_format: Option<String>

Optional. Format used to parse DATE values. Supports C-style and SQL-style values.

§datetime_format: Option<String>

Optional. Format used to parse DATETIME values. Supports C-style and SQL-style values.

§decimal_target_types: Option<Vec<String>>

Defines the list of possible SQL data types to which the source decimal values are converted. This list and the precision and the scale parameters of the decimal field determine the target type. In the order of NUMERIC, BIGNUMERIC, and STRING, a type is picked if it is in the specified list and if it supports the precision and the scale. STRING supports all precision and scale values. If none of the listed types supports the precision and the scale, the type supporting the widest range in the specified list is picked, and if a value exceeds the supported range when reading the data, an error will be thrown. Example: Suppose the value of this field is [“NUMERIC”, “BIGNUMERIC”]. If (precision,scale) is: * (38,9) -> NUMERIC; * (39,9) -> BIGNUMERIC (NUMERIC cannot hold 30 integer digits); * (38,10) -> BIGNUMERIC (NUMERIC cannot hold 10 fractional digits); * (76,38) -> BIGNUMERIC; * (77,38) -> BIGNUMERIC (error if value exceeds supported range). This field cannot contain duplicate types. The order of the types in this field is ignored. For example, [“BIGNUMERIC”, “NUMERIC”] is the same as [“NUMERIC”, “BIGNUMERIC”] and NUMERIC always takes precedence over BIGNUMERIC. Defaults to [“NUMERIC”, “STRING”] for ORC and [“NUMERIC”] for the other file formats.

§file_set_spec_type: Option<String>

Optional. Specifies how source URIs are interpreted for constructing the file set to load. By default source URIs are expanded against the underlying storage. Other options include specifying manifest files. Only applicable to object storage systems.

§google_sheets_options: Option<GoogleSheetsOptions>

Optional. Additional options if sourceFormat is set to GOOGLE_SHEETS.

§hive_partitioning_options: Option<HivePartitioningOptions>

Optional. When set, configures hive partitioning support. Not all storage formats support hive partitioning – requesting hive partitioning on an unsupported format will lead to an error, as will providing an invalid specification.

§ignore_unknown_values: Option<bool>

Optional. Indicates if BigQuery should allow extra values that are not represented in the table schema. If true, the extra values are ignored. If false, records with extra columns are treated as bad records, and if there are too many bad records, an invalid error is returned in the job result. The default value is false. The sourceFormat property determines what BigQuery treats as an extra value: CSV: Trailing columns JSON: Named values that don’t match any column names Google Cloud Bigtable: This setting is ignored. Google Cloud Datastore backups: This setting is ignored. Avro: This setting is ignored. ORC: This setting is ignored. Parquet: This setting is ignored.

§json_extension: Option<String>

Optional. Load option to be used together with source_format newline-delimited JSON to indicate that a variant of JSON is being loaded. To load newline-delimited GeoJSON, specify GEOJSON (and source_format must be set to NEWLINE_DELIMITED_JSON).

§json_options: Option<JsonOptions>

Optional. Additional properties to set if sourceFormat is set to JSON.

§max_bad_records: Option<i32>

Optional. The maximum number of bad records that BigQuery can ignore when reading data. If the number of bad records exceeds this value, an invalid error is returned in the job result. The default value is 0, which requires that all records are valid. This setting is ignored for Google Cloud Bigtable, Google Cloud Datastore backups, Avro, ORC and Parquet formats.

§metadata_cache_mode: Option<String>

Optional. Metadata Cache Mode for the table. Set this to enable caching of metadata from external data source.

§object_metadata: Option<String>

Optional. ObjectMetadata is used to create Object Tables. Object Tables contain a listing of objects (with their metadata) found at the source_uris. If ObjectMetadata is set, source_format should be omitted. Currently SIMPLE is the only supported Object Metadata type.

§parquet_options: Option<ParquetOptions>

Optional. Additional properties to set if sourceFormat is set to PARQUET.

§reference_file_schema_uri: Option<String>

Optional. When creating an external table, the user can provide a reference file with the table schema. This is enabled for the following formats: AVRO, PARQUET, ORC.

§schema: Option<TableSchema>

Optional. The schema for the data. Schema is required for CSV and JSON formats if autodetect is not on. Schema is disallowed for Google Cloud Bigtable, Cloud Datastore backups, Avro, ORC and Parquet formats.

§source_format: Option<String>

[Required] The data format. For CSV files, specify “CSV”. For Google sheets, specify “GOOGLE_SHEETS”. For newline-delimited JSON, specify “NEWLINE_DELIMITED_JSON”. For Avro files, specify “AVRO”. For Google Cloud Datastore backups, specify “DATASTORE_BACKUP”. For Apache Iceberg tables, specify “ICEBERG”. For ORC files, specify “ORC”. For Parquet files, specify “PARQUET”. [Beta] For Google Cloud Bigtable, specify “BIGTABLE”.

§source_uris: Option<Vec<String>>

[Required] The fully-qualified URIs that point to your data in Google Cloud. For Google Cloud Storage URIs: Each URI can contain one ‘’ wildcard character and it must come after the ‘bucket’ name. Size limits related to load jobs apply to external data sources. For Google Cloud Bigtable URIs: Exactly one URI can be specified and it has be a fully specified and valid HTTPS URL for a Google Cloud Bigtable table. For Google Cloud Datastore backups, exactly one URI can be specified. Also, the ‘’ wildcard character is not allowed.

§time_format: Option<String>

Optional. Format used to parse TIME values. Supports C-style and SQL-style values.

§time_zone: Option<String>

Optional. Time zone used when parsing timestamp values that do not have specific time zone information (e.g. 2024-04-20 12:34:56). The expected format is a IANA timezone string (e.g. America/Los_Angeles).

§timestamp_format: Option<String>

Optional. Format used to parse TIMESTAMP values. Supports C-style and SQL-style values.

§timestamp_target_precision: Option<Vec<i32>>

Precisions (maximum number of total digits in base 10) for seconds of TIMESTAMP types that are allowed to the destination table for autodetection mode. Available for the formats: CSV. For the CSV Format, Possible values include: Not Specified, [], or [6]: timestamp(6) for all auto detected TIMESTAMP columns [6, 12]: timestamp(6) for all auto detected TIMESTAMP columns that have less than 6 digits of subseconds. timestamp(12) for all auto detected TIMESTAMP columns that have more than 6 digits of subseconds. [12]: timestamp(12) for all auto detected TIMESTAMP columns. The order of the elements in this array is ignored. Inputs that have higher precision than the highest target precision in this array will be truncated.

ExternalDataConfiguration

Struct ExternalDataConfiguration Copy item path

Fields§

Trait Implementations§

impl Clone for ExternalDataConfiguration

fn clone(&self) -> ExternalDataConfiguration

fn clone_from(&mut self, source: &Self)

impl Debug for ExternalDataConfiguration

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for ExternalDataConfiguration

fn default() -> ExternalDataConfiguration

impl<'de> Deserialize<'de> for ExternalDataConfiguration

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

impl Serialize for ExternalDataConfiguration

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

impl Part for ExternalDataConfiguration

Auto Trait Implementations§

impl Freeze for ExternalDataConfiguration

impl RefUnwindSafe for ExternalDataConfiguration

impl Send for ExternalDataConfiguration

impl Sync for ExternalDataConfiguration

impl Unpin for ExternalDataConfiguration

impl UnwindSafe for ExternalDataConfiguration

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

fn in_current_span(self) -> Instrumented<Self>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>where S: Into<Dispatch>,

fn with_current_subscriber(self) -> WithDispatch<Self>

impl<T> DeserializeOwned for Twhere T: for<'de> Deserialize<'de>,

Struct ExternalDataConfiguration

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,