pub struct PySparkJob {
pub archive_uris: Option<Vec<String>>,
pub args: Option<Vec<String>>,
pub file_uris: Option<Vec<String>>,
pub jar_file_uris: Option<Vec<String>>,
pub logging_config: Option<LoggingConfig>,
pub main_python_file_uri: Option<String>,
pub properties: Option<HashMap<String, String>>,
pub python_file_uris: Option<Vec<String>>,
}Expand description
A Dataproc job for running Apache PySpark (https://spark.apache.org/docs/latest/api/python/index.html#pyspark-overview) applications on YARN.
This type is not used in any activity, and only used as part of another schema.
Fields§
§archive_uris: Option<Vec<String>>Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.Note: Spark applications must be deployed in cluster mode (https://spark.apache.org/docs/latest/cluster-overview.html) for correct environment propagation.
args: Option<Vec<String>>Optional. The arguments to pass to the driver. Do not include arguments, such as –conf, that can be set as job properties, since a collision may occur that causes an incorrect job submission.
file_uris: Option<Vec<String>>Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.
jar_file_uris: Option<Vec<String>>Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.
logging_config: Option<LoggingConfig>Optional. The runtime log config for job execution.
main_python_file_uri: Option<String>Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.
properties: Option<HashMap<String, String>>Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API might be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.
python_file_uris: Option<Vec<String>>Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.
Trait Implementations§
Source§impl Clone for PySparkJob
impl Clone for PySparkJob
Source§fn clone(&self) -> PySparkJob
fn clone(&self) -> PySparkJob
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
source. Read more