disarm 0.10.0

Unicode canonicalization and TR39 confusable analysis: building blocks for text-security pipelines (homoglyph/bidi/zalgo handling) plus standards-based transliteration
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
# Serbian (sr) language-specific overrides
# Standard: BGN/PCGN Serbian romanization

0402	Dj
0452	dj
040B	C
045B	c
040F	Dz
045F	dz
0409	Lj
0459	lj
040A	Nj
045A	nj
0408	J
0458	j
0419	Y
0439	y