disarm 0.10.0

Unicode canonicalization and TR39 confusable analysis: building blocks for text-security pipelines (homoglyph/bidi/zalgo handling) plus standards-based transliteration
Documentation
1
2
3
4
5
# Icelandic (is) language-specific overrides
# Convention: Icelandic orthographic conventions (Æ→Ae, ð→d, þ→th)

00C6	Ae
00E6	ae