disarm 0.10.0

Unicode canonicalization and TR39 confusable analysis: building blocks for text-security pipelines (homoglyph/bidi/zalgo handling) plus standards-based transliteration
Documentation
1
2
3
4
5
6
7
8
9
# Turkish (tr) language-specific overrides
# Convention: Turkish dotted/dotless I distinction (İ→I, ı→i)

0130	I
0131	i
011E	G
011F	g
015E	S
015F	s