Crate csv_diff[−][src]
Expand description
Find the difference between two CSVs - with ludicrous speed!π
csv-diff
is a crate for finding the difference between two CSVs.
It is the fastest CSV-diffing library in the world!
Comparing two 1,000,000 rows x 9 columns CSVs takes under 500ms (when using raw bytes).
It is thread-pool-agnostic, meaning you can provide your own existing thread-pool
or you can use the default rayon thread-pool
that is used in this crate.
Use Case
This crate should be used on CSV data that has some sort of primary key for uniquely identifying a record. It is not a general line-by-line diffing crate. You can imagine dumping a database table in CSV format from your test and production system and comparing it with each other to find differences.
Overview
The most important types you will use are:
CsvByteDiff
for comparing two CSVs byte-wise.DiffByteRecords
for getting the actual differences between the compared CSVs as raw bytes.