disarm 0.10.0

Unicode canonicalization and TR39 confusable analysis: building blocks for text-security pipelines (homoglyph/bidi/zalgo handling) plus standards-based transliteration
Documentation
# Greek reverse transliteration: romanized ASCII → Greek.
#
# Format: romanized_key\tnative_value

# Digraphs
Th	Θ
th	θ
Ph	Φ
ph	φ
Ch	Χ
ch	χ
Ps	Ψ
ps	ψ

# Single characters — uppercase
A	Α
B	Β
G	Γ
D	Δ
E	Ε
Z	Ζ
I	Η
K	Κ
L	Λ
M	Μ
N	Ν
X	Ξ
O	Ο
P	Π
R	Ρ
S	Σ
T	Τ
# Upsilon: the forward direction romanizes Υ/ύ as "Y" (incl. in the ου/αυ/ευ
# diphthongs → OY/AY/EY), so "Y" must reverse to Υ or a literal Latin Y leaks
# into the Greek output (#82). "U" is also accepted as a secondary convention.
U	Υ
Y	Υ
F	Φ

# Single characters — lowercase
a	α
b	β
g	γ
d	δ
e	ε
z	ζ
i	η
k	κ
l	λ
m	μ
n	ν
x	ξ
o	ο
p	π
r	ρ
s	σ
t	τ
u	υ
y	υ
f	φ