data-doctor-cli-1.0.2 is not a library.
DataDoctor CLI 🩺
DataDoctor CLI is a powerful command-line tool for diagnosing and fixing data quality issues in your files instantly. It acts as a doctor for your datasets, healing broken JSON and CSV files with surgical precision.
🚀 Installation
From Crates.io (Recommended)
Note: The binary name is data-doctor.
📖 Usage
1. Validate Data
Check your data for errors without modifying it.
2. Fix Data
Auto-correct common issues effectively.
What it fixes:
- JSON: Trailing commas, missing quotes, single quotes, unclosed brackets, etc.
- CSV: Missing columns (padding), extra columns (trimming), type mismatches, boolean normalization.
3. The "Doctor" Command
Run a full diagnosis, fix issues, and generate a report in one go.
Options
--format <json|csv>: Manually specify file format (auto-detected by default).--report-json: Output the validation report in machine-readable JSON format.--schema <file>: Validate against a custom schema.
âš¡ Examples
Fixing a broken JSON file:
# Input: { name: 'John', age: 30, }
# Output: { "name": "John", "age": 30 }
Fixing CSV column alignment:
📄 License
This project is licensed under the MIT License.