disarm 0.10.0

Unicode canonicalization and TR39 confusable analysis: building blocks for text-security pipelines (homoglyph/bidi/zalgo handling) plus standards-based transliteration
Documentation
1
2
3
4
5
6
7
# Bulgarian (bg) language-specific overrides
# Standard: BGN/PCGN Bulgarian romanization

042A	A
044A	a
0429	Sht
0449	sht