# Data Quality Pipeline
Demonstrates the full data quality pipeline for fine-tuning (GH-453):
1. **PII filtering** -- detection and redaction of emails, IPs, etc.
2. **Quality scoring and filtering** -- length, diversity, repetition, structure
3. **EvolKit-style instruction evolution** -- complexity enhancement
## Run
```bash
cargo run --example data_quality_pipeline
```
## Source
```rust,ignore
// Run this example:
// cargo run --example data_quality_pipeline
//
// See the CLI reference and source code in crates/ for implementation details.
```