Structs§
- HALFWIDTH_
KATAKANA - HALFWIDTH_
KATAKANA_ TO_ ROMAJI - HIRAGANA
- HIRAGANA_
TO_ ROMAJI - JAPANESE
- JAPANESE_
PUNCTUATION - JOUYOU_
ATEJI_ INDEX - JOUYOU_
KANJI - JOUYOU_
PRONUNCIATION_ INDEX - JOUYOU_
TABLE - Jouyou
Record - KATAKANA
- KATAKANA_
TO_ ROMAJI - SIMPLIFIED_
CODEPOINTS - TRADITIONAL_
CODEPOINTS - UNIHAN_
ANY_ VARIANT - UNIHAN_
CHARACTERS - UNIHAN_
RADICALS - UNIHAN_
SIMPLIFIED_ CHINESE - Unihan
Character - Unihan
Radical - Unihan
Radical Stroke Count
Statics§
Functions§
- get_
parts - get_parts returns all radicals or their variants present in the given character.
- get_
radical - get_radical returns the Kangxi numerical index of the given character.
- get_
radicals - get_radicals returns the Kangxi numerical indexes of all radicals or their variants present in the given character.
- get_
stroke_ count - get_stroke_count returns the number of strokes typically used to write the given character.
- get_
variants - get_variants returns all equivalent variants of the given character.
- is_
cjk_ codepoint - is_cjk_codepoint returns true if the character falls within the CJK unicode block. The CJK unicode block does not contain all chinese, japanese, korean, or vietnamese characters.
- is_
cjk_ punctuation_ codepoint - is_cjk_punctuation_codepoint returns true if the character falls within the CJK punctuation unicode block. The CJK punctuation unicode block does not contain all chinese, japanese, korean, or vietnamese characters.
- is_
cjkish_ codepoint - is_cjkish_codepoint returns true if the character falls within the CJK or related unicode blocks. The CJK-ish unicode blocks do contain most chinese, japanese, korean, or vietnamese characters. However this comes at the price of specificity and contains many potentially illegible codepoints.
- is_
japanese - is_japanese returns true if the string can almost certainly be read as japanese.
- is_
japanese_ codepoint - is_japanese_codepoint returns true if the character falls within the CJK or japanese related unicode blocks.
- is_
korean - is_korean returns true if the string can almost certainly be read as korean.
- is_
korean_ codepoint - is_korean_codepoint returns true if the character falls within the CJK or korean related unicode blocks.
- is_
simplified_ chinese - is_simplified_chinese returns true if the string can almost certainly be read as simplified chinese.
- is_
traditional_ chinese - is_traditional_chinese returns true if the string can almost certainly be read as traditional chinese.
- is_
vietnamese - is_vietnamese returns true if the string can almost certainly be read as vietnamese.
- to_
hangul - to_hangul attempts to convert mixed Korean text to hangul.
- to_
pinyin - to_pinyin attempts to convert Chinese text to pinyin.
- to_
romaji - to_romaji attempts to convert Japanese text to romaji.