alphabet_detector 0.2.0

Natural language alphabet detection library
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
# Alphabet Detector

[![Crate](https://img.shields.io/crates/v/alphabet_detector.svg)](https://crates.io/crates/alphabet_detector)
[![API](https://docs.rs/alphabet_detector/badge.svg)](https://docs.rs/alphabet_detector)

### Detects 387 alphabets in 170 scripts
> one spoken language can be written in multiple scripts, so it will be detected as a different alphabet/language

> look at the [alphabet.rs]https://github.com/RoDmitry/alphabet_detector/blob/main/src/lang/alphabet.rs#L72 to understand what languages have already defined alphabets. Some of them need validation

Separates words in text (from iterator `CharIndices`), and detects language of words by used alphabets (chars).

Warning: can return words with chars from the Unicode private area (for `Yoruba` or `Nuer` language)