rust-data-processing 0.3.1

Schema-first ingestion (CSV, JSON, Parquet, Excel) into an in-memory DataSet, plus Polars-backed pipelines, SQL, profiling, validation, and map/reduce-style processing.
1
2
3
4
5
6
7
8
9
10
11
# People fixture schemas

| File | Role |
| --- | --- |
| `../people.csv`, `../people.json`, `../people.xlsx` | Ingest samples |
| `schemas/*.schema.json` | Shared serde schemas for JVM / Python tests |
| `payloads/json_path_dataset.payload.json` | `rdp_ingest_ordered_paths_json` for `people.json` (`JsonParquetExcelSnippets.java`; Rust/Python `*_json_parquet_excel_snippets_*`) |
| `payloads/csv_path_dataset.payload.json` | Same for `people.csv` |
| `payloads/excel_sheet_dataset.payload.json` | Excel + `people_flat` schema (`ExcelSnippets.java`; Rust `excel_snippets_fixtures`, Python `test_excel_snippets_fixtures.py`, JVM `runExcelSnippetsViaPayloadContract`) |
| `payloads/json_path_ingest.options.json`, `csv_path_ingest.options.json` | Format options for path FFI (non-empty only) |
| `pipelines/csv_to_parquet.pipeline.json` | CSV → `parquet_file` (`ParquetSnippets.java`; Rust `tests/parquet_snippets_fixtures.rs`, `jvm-sys` `run_pipeline_people_csv_to_parquet_committed_fixture`, Python `test_parquet_snippets_fixtures.py`) |