disarm 0.10.0

Unicode canonicalization and TR39 confusable analysis: building blocks for text-security pipelines (homoglyph/bidi/zalgo handling) plus standards-based transliteration
Documentation
1
2
3
4
5
6
7
8
9
# Estonian (et) language-specific overrides
# Convention: Estonian orthographic conventions (ä→ae, ö→oe, ü→ue, š→sh, ž→zh)

00C4	Ae
00E4	ae
00D6	Oe
00F6	oe
00DC	Ue
00FC	ue