rust-data-processing 0.3.3

Schema-first ingestion (CSV, JSON, Parquet, Excel) into an in-memory DataSet, plus Polars-backed pipelines, SQL, profiling, validation, and map/reduce-style processing.
1
2
3
4
5
6
7
SPARK_HANDOFF_URI=s3://rdp-spark-handoff/out.parquet
SPARK_MASTER_URL=spark://127.0.0.1:7077
AWS_ACCESS_KEY_ID=rdp_minio
AWS_SECRET_ACCESS_KEY=rdp_minio_secret
AWS_DEFAULT_REGION=us-east-1
AWS_ENDPOINT=http://127.0.0.1:9000
AWS_ALLOW_HTTP=true