Crate cjk

Source

Structs§

HALFWIDTH_KATAKANA
HALFWIDTH_KATAKANA_TO_ROMAJI
HIRAGANA
HIRAGANA_TO_ROMAJI
JAPANESE
JAPANESE_PUNCTUATION
JOUYOU_ATEJI_INDEX
JOUYOU_KANJI
JOUYOU_PRONUNCIATION_INDEX
JOUYOU_TABLE
JouyouRecord
KATAKANA
KATAKANA_TO_ROMAJI
SIMPLIFIED_CODEPOINTS
TRADITIONAL_CODEPOINTS
UNIHAN_ANY_VARIANT
UNIHAN_CHARACTERS
UNIHAN_RADICALS
UNIHAN_SIMPLIFIED_CHINESE
UnihanCharacter
UnihanRadical
UnihanRadicalStrokeCount

Statics§

HALFWIDTH_KATAKANA_TO_ROMAJI_DATA
HIRAGANA_TO_ROMAJI_DATA
KATAKANA_TO_ROMAJI_DATA

Functions§

get_parts
get_parts returns all radicals or their variants present in the given character.
get_radical
get_radical returns the Kangxi numerical index of the given character.
get_radicals
get_radicals returns the Kangxi numerical indexes of all radicals or their variants present in the given character.
get_stroke_count
get_stroke_count returns the number of strokes typically used to write the given character.
get_variants
get_variants returns all equivalent variants of the given character.
is_cjk_codepoint
is_cjk_codepoint returns true if the character falls within the CJK unicode block. The CJK unicode block does not contain all chinese, japanese, korean, or vietnamese characters.
is_cjk_punctuation_codepoint
is_cjk_punctuation_codepoint returns true if the character falls within the CJK punctuation unicode block. The CJK punctuation unicode block does not contain all chinese, japanese, korean, or vietnamese characters.
is_cjkish_codepoint
is_cjkish_codepoint returns true if the character falls within the CJK or related unicode blocks. The CJK-ish unicode blocks do contain most chinese, japanese, korean, or vietnamese characters. However this comes at the price of specificity and contains many potentially illegible codepoints.
is_japanese
is_japanese returns true if the string can almost certainly be read as japanese.
is_japanese_codepoint
is_japanese_codepoint returns true if the character falls within the CJK or japanese related unicode blocks.
is_korean
is_korean returns true if the string can almost certainly be read as korean.
is_korean_codepoint
is_korean_codepoint returns true if the character falls within the CJK or korean related unicode blocks.
is_simplified_chinese
is_simplified_chinese returns true if the string can almost certainly be read as simplified chinese.
is_traditional_chinese
is_traditional_chinese returns true if the string can almost certainly be read as traditional chinese.
is_vietnamese
is_vietnamese returns true if the string can almost certainly be read as vietnamese.
to_hangul
to_hangul attempts to convert mixed Korean text to hangul.
to_pinyin
to_pinyin attempts to convert Chinese text to pinyin.
to_romaji
to_romaji attempts to convert Japanese text to romaji.