disarm 0.10.0

Unicode canonicalization and TR39 confusable analysis: building blocks for text-security pipelines (homoglyph/bidi/zalgo handling) plus standards-based transliteration
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
# Russian (ru) language-specific overrides
# Standard: BGN/PCGN Russian romanization (1947, revised 1994)
# Overrides default for: Ё→Yo, Й→Y, Ъ→", Ь→'

0401	Yo
0451	yo
0419	Y
0439	y
042A	\"
044A	\"
042C	'
044C	'
042D	E
044D	e
042E	Yu
044E	yu
042F	Ya
044F	ya