disarm 0.10.0

Unicode canonicalization and TR39 confusable analysis: building blocks for text-security pipelines (homoglyph/bidi/zalgo handling) plus standards-based transliteration
Documentation
1
2
3
4
5
6
7
8
9
10
# German (de) language-specific overrides
# Convention: standard German umlaut expansion (ä→ae, ö→oe, ü→ue, ß→ss)

00C4	Ae
00D6	Oe
00DC	Ue
00E4	ae
00F6	oe
00FC	ue
1E9E	SS