get_parts returns all radicals or their variants
present in the given character.
get_radical returns the Kangxi numerical index of
the given character.
get_radicals returns the Kangxi numerical indexes of
all radicals or their variants present in the given character.
get_stroke_count returns the number of strokes
typically used to write the given character.
get_variants returns all equivalent variants
of the given character.
is_cjk_codepoint returns true if the character falls
within the CJK unicode block. The CJK unicode block does not
contain all chinese, japanese, korean, or vietnamese characters.
is_cjk_punctuation_codepoint returns true if the character falls
within the CJK punctuation unicode block. The CJK punctuation unicode block does not
contain all chinese, japanese, korean, or vietnamese characters.
is_cjkish_codepoint returns true if the character falls
within the CJK or related unicode blocks. The CJK-ish unicode blocks do
contain most chinese, japanese, korean, or vietnamese characters.
However this comes at the price of specificity and contains many potentially
illegible codepoints.
is_japanese returns true if the string can
almost certainly be read as japanese.
is_japanese_codepoint returns true if the character falls
within the CJK or japanese related unicode blocks.
is_korean returns true if the string can
almost certainly be read as korean.
is_korean_codepoint returns true if the character falls
within the CJK or korean related unicode blocks.
is_simplified_chinese returns true if the string can
almost certainly be read as simplified chinese.
is_traditional_chinese returns true if the string can
almost certainly be read as traditional chinese.
is_vietnamese returns true if the string can
almost certainly be read as vietnamese.
to_hangul attempts to convert mixed Korean text to hangul.
to_pinyin attempts to convert Chinese text to pinyin.
to_romaji attempts to convert Japanese text to romaji.