data-doctor-cli 1.0.2

A powerful data validation and cleaning tool for JSON and CSV files
data-doctor-cli-1.0.2 is not a library.

DataDoctor CLI 🩺

Crates.io Downloads License: MIT

DataDoctor CLI is a powerful command-line tool for diagnosing and fixing data quality issues in your files instantly. It acts as a doctor for your datasets, healing broken JSON and CSV files with surgical precision.

🚀 Installation

From Crates.io (Recommended)

cargo install data-doctor-cli

Note: The binary name is data-doctor.

📖 Usage

1. Validate Data

Check your data for errors without modifying it.

data-doctor validate data.json

2. Fix Data

Auto-correct common issues effectively.

data-doctor fix broken.json --out fixed.json

What it fixes:

  • JSON: Trailing commas, missing quotes, single quotes, unclosed brackets, etc.
  • CSV: Missing columns (padding), extra columns (trimming), type mismatches, boolean normalization.

3. The "Doctor" Command

Run a full diagnosis, fix issues, and generate a report in one go.

data-doctor doctor input.csv --out cleaned.csv --report-json

Options

  • --format <json|csv>: Manually specify file format (auto-detected by default).
  • --report-json: Output the validation report in machine-readable JSON format.
  • --schema <file>: Validate against a custom schema.

âš¡ Examples

Fixing a broken JSON file:

# Input: { name: 'John', age: 30, }

data-doctor fix user.json --out clean.json

# Output: { "name": "John", "age": 30 }

Fixing CSV column alignment:

data-doctor fix data.csv --out fixed.csv

📄 License

This project is licensed under the MIT License.