diff_core 0.1.0

Semantic document matching and diffing.
Documentation
1
2
3
4
5
6
7
8
9
10
# diff_core

Semantic block matching, text hunks, move detection, confidence, and severity defaults.

Current matching uses deterministic exact normalized-text anchors, emits
inserted/deleted/modified changes, relabels matching insert+delete pairs as
moves, and reports layout-only changes when exact text moves beyond the
configured page/bounding-box tolerance. Ordered fuzzy matching uses token-LCS
similarity inside unmatched exact-anchor windows and honors `min_match_score`.
Richer text hunk extraction remains later MVP work.