scyros 0.2.2 - Docs.rs

Removes duplicate rows from a CSV file.

Two rows are considered duplicates if they share the same value in a user-specified column. By default, the command uses the 'id' column, which typically contains repository IDs.

Prints statistics about the number of duplicates found and writes the deduplicated rows to a new CSV file.

By default, the output file name is the input file name with '.unique.csv' appended. 

Output CSV file format:
  * Same columns as the input file