Expand description
§Unicode Blocks
This crate contains a list of all unicode blocks and provides some functions to search across them.
The latest version of Unicode: 15.1.0
§Examples
§Given a character, determine what unicode block contains it.
assert_eq!(unicode_blocks::BASIC_LATIN, unicode_blocks::find_unicode_block('A').unwrap());
§Given a unicode block, determine whether it is used in CJK.
assert!(unicode_blocks::is_cjk_block(unicode_blocks::CJK_UNIFIED_IDEOGRAPHS));
§Given a character, determine whether it is in CJK.
assert!(unicode_blocks::is_cjk('。'));
Structs§
Constants§
- ADLAM
- AEGEAN_
NUMBERS - AHOM
- ALCHEMICAL_
SYMBOLS - ALPHABETIC_
PRESENTATION_ FORMS - ANATOLIAN_
HIEROGLYPHS - ANCIENT_
GREEK_ MUSICAL_ NOTATION - ANCIENT_
GREEK_ NUMBERS - ANCIENT_
SYMBOLS - ARABIC
- ARABIC_
EXTENDED_ A - ARABIC_
EXTENDED_ B - ARABIC_
EXTENDED_ C - ARABIC_
MATHEMATICAL_ ALPHABETIC_ SYMBOLS - ARABIC_
PRESENTATION_ FORMS_ A - ARABIC_
PRESENTATION_ FORMS_ B - ARABIC_
SUPPLEMENT - ARMENIAN
- ARROWS
- AVESTAN
- BALINESE
- BAMUM
- BAMUM_
SUPPLEMENT - BASIC_
LATIN - BASSA_
VAH - BATAK
- BENGALI
- BHAIKSUKI
- BLOCK_
ELEMENTS - BOPOMOFO
- BOPOMOFO_
EXTENDED - BOX_
DRAWING - BRAHMI
- BRAILLE_
PATTERNS - BUGINESE
- BUHID
- BYZANTINE_
MUSICAL_ SYMBOLS - CARIAN
- CAUCASIAN_
ALBANIAN - CHAKMA
- CHAM
- CHEROKEE
- CHEROKEE_
SUPPLEMENT - CHESS_
SYMBOLS - CHORASMIAN
- CJK_
COMPATIBILITY - CJK_
COMPATIBILITY_ FORMS - CJK_
COMPATIBILITY_ IDEOGRAPHS - CJK_
COMPATIBILITY_ IDEOGRAPHS_ SUPPLEMENT - CJK_
RADICALS_ SUPPLEMENT - CJK_
STROKES - CJK_
SYMBOLS_ AND_ PUNCTUATION - CJK_
UNIFIED_ IDEOGRAPHS - CJK_
UNIFIED_ IDEOGRAPHS_ EXTENSION_ A - CJK_
UNIFIED_ IDEOGRAPHS_ EXTENSION_ B - CJK_
UNIFIED_ IDEOGRAPHS_ EXTENSION_ C - CJK_
UNIFIED_ IDEOGRAPHS_ EXTENSION_ D - CJK_
UNIFIED_ IDEOGRAPHS_ EXTENSION_ E - CJK_
UNIFIED_ IDEOGRAPHS_ EXTENSION_ F - CJK_
UNIFIED_ IDEOGRAPHS_ EXTENSION_ G - CJK_
UNIFIED_ IDEOGRAPHS_ EXTENSION_ H - CJK_
UNIFIED_ IDEOGRAPHS_ EXTENSION_ I - COMBINING_
DIACRITICAL_ MARKS - COMBINING_
DIACRITICAL_ MARKS_ EXTENDED - COMBINING_
DIACRITICAL_ MARKS_ FOR_ SYMBOLS - COMBINING_
DIACRITICAL_ MARKS_ SUPPLEMENT - COMBINING_
HALF_ MARKS - COMMON_
INDIC_ NUMBER_ FORMS - CONTROL_
PICTURES - COPTIC
- COPTIC_
EPACT_ NUMBERS - COUNTING_
ROD_ NUMERALS - CUNEIFORM
- CUNEIFORM_
NUMBERS_ AND_ PUNCTUATION - CURRENCY_
SYMBOLS - CYPRIOT_
SYLLABARY - CYPRO_
MINOAN - CYRILLIC
- CYRILLIC_
EXTENDED_ A - CYRILLIC_
EXTENDED_ B - CYRILLIC_
EXTENDED_ C - CYRILLIC_
EXTENDED_ D - CYRILLIC_
SUPPLEMENT - DESERET
- DEVANAGARI
- DEVANAGARI_
EXTENDED - DEVANAGARI_
EXTENDED_ A - DINGBATS
- DIVES_
AKURU - DOGRA
- DOMINO_
TILES - DUPLOYAN
- EARLY_
DYNASTIC_ CUNEIFORM - EGYPTIAN_
HIEROGLYPHS - EGYPTIAN_
HIEROGLYPH_ FORMAT_ CONTROLS - ELBASAN
- ELYMAIC
- EMOTICONS
- ENCLOSED_
ALPHANUMERICS - ENCLOSED_
ALPHANUMERIC_ SUPPLEMENT - ENCLOSED_
CJK_ LETTERS_ AND_ MONTHS - ENCLOSED_
IDEOGRAPHIC_ SUPPLEMENT - ETHIOPIC
- ETHIOPIC_
EXTENDED - ETHIOPIC_
EXTENDED_ A - ETHIOPIC_
EXTENDED_ B - ETHIOPIC_
SUPPLEMENT - GENERAL_
PUNCTUATION - GEOMETRIC_
SHAPES - GEOMETRIC_
SHAPES_ EXTENDED - GEORGIAN
- GEORGIAN_
EXTENDED - GEORGIAN_
SUPPLEMENT - GLAGOLITIC
- GLAGOLITIC_
SUPPLEMENT - GOTHIC
- GRANTHA
- GREEK_
AND_ COPTIC - GREEK_
EXTENDED - GUJARATI
- GUNJALA_
GONDI - GURMUKHI
- HALFWIDTH_
AND_ FULLWIDTH_ FORMS - HANGUL_
COMPATIBILITY_ JAMO - HANGUL_
JAMO - HANGUL_
JAMO_ EXTENDED_ A - HANGUL_
JAMO_ EXTENDED_ B - HANGUL_
SYLLABLES - HANIFI_
ROHINGYA - HANUNOO
- HATRAN
- HEBREW
- HIGH_
PRIVATE_ USE_ SURROGATES - HIGH_
SURROGATES - HIRAGANA
- IDEOGRAPHIC_
DESCRIPTION_ CHARACTERS - IDEOGRAPHIC_
SYMBOLS_ AND_ PUNCTUATION - IMPERIAL_
ARAMAIC - INDIC_
SIYAQ_ NUMBERS - INSCRIPTIONAL_
PAHLAVI - INSCRIPTIONAL_
PARTHIAN - IPA_
EXTENSIONS - JAVANESE
- KAITHI
- KAKTOVIK_
NUMERALS - KANA_
EXTENDED_ A - KANA_
EXTENDED_ B - KANA_
SUPPLEMENT - KANBUN
- KANGXI_
RADICALS - KANNADA
- KATAKANA
- KATAKANA_
PHONETIC_ EXTENSIONS - KAWI
- KAYAH_
LI - KHAROSHTHI
- KHITAN_
SMALL_ SCRIPT - KHMER
- KHMER_
SYMBOLS - KHOJKI
- KHUDAWADI
- LAO
- LATIN_
1_ SUPPLEMENT - LATIN_
EXTENDED_ A - LATIN_
EXTENDED_ ADDITIONAL - LATIN_
EXTENDED_ B - LATIN_
EXTENDED_ C - LATIN_
EXTENDED_ D - LATIN_
EXTENDED_ E - LATIN_
EXTENDED_ F - LATIN_
EXTENDED_ G - LEPCHA
- LETTERLIKE_
SYMBOLS - LIMBU
- LINEAR_
A - LINEAR_
B_ IDEOGRAMS - LINEAR_
B_ SYLLABARY - LISU
- LISU_
SUPPLEMENT - LOW_
SURROGATES - LYCIAN
- LYDIAN
- MAHAJANI
- MAHJONG_
TILES - MAKASAR
- MALAYALAM
- MANDAIC
- MANICHAEAN
- MARCHEN
- MASARAM_
GONDI - MATHEMATICAL_
ALPHANUMERIC_ SYMBOLS - MATHEMATICAL_
OPERATORS - MAYAN_
NUMERALS - MEDEFAIDRIN
- MEETEI_
MAYEK - MEETEI_
MAYEK_ EXTENSIONS - MENDE_
KIKAKUI - MEROITIC_
CURSIVE - MEROITIC_
HIEROGLYPHS - MIAO
- MISCELLANEOUS_
MATHEMATICAL_ SYMBOLS_ A - MISCELLANEOUS_
MATHEMATICAL_ SYMBOLS_ B - MISCELLANEOUS_
SYMBOLS - MISCELLANEOUS_
SYMBOLS_ AND_ ARROWS - MISCELLANEOUS_
SYMBOLS_ AND_ PICTOGRAPHS - MISCELLANEOUS_
TECHNICAL - MODI
- MODIFIER_
TONE_ LETTERS - MONGOLIAN
- MONGOLIAN_
SUPPLEMENT - MRO
- MULTANI
- MUSICAL_
SYMBOLS - MYANMAR
- MYANMAR_
EXTENDED_ A - MYANMAR_
EXTENDED_ B - NABATAEAN
- NAG_
MUNDARI - NANDINAGARI
- NEWA
- NEW_
TAI_ LUE - NKO
- NUMBER_
FORMS - NUSHU
- NYIAKENG_
PUACHUE_ HMONG - OGHAM
- OLD_
HUNGARIAN - OLD_
ITALIC - OLD_
NORTH_ ARABIAN - OLD_
PERMIC - OLD_
PERSIAN - OLD_
SOGDIAN - OLD_
SOUTH_ ARABIAN - OLD_
TURKIC - OLD_
UYGHUR - OL_
CHIKI - OPTICAL_
CHARACTER_ RECOGNITION - ORIYA
- ORNAMENTAL_
DINGBATS - OSAGE
- OSMANYA
- OTTOMAN_
SIYAQ_ NUMBERS - PAHAWH_
HMONG - PALMYRENE
- PAU_
CIN_ HAU - PHAGS_
PA - PHAISTOS_
DISC - PHOENICIAN
- PHONETIC_
EXTENSIONS - PHONETIC_
EXTENSIONS_ SUPPLEMENT - PLAYING_
CARDS - PRIVATE_
USE_ AREA - PSALTER_
PAHLAVI - REJANG
- RUMI_
NUMERAL_ SYMBOLS - RUNIC
- SAMARITAN
- SAURASHTRA
- SHARADA
- SHAVIAN
- SHORTHAND_
FORMAT_ CONTROLS - SIDDHAM
- SINHALA
- SINHALA_
ARCHAIC_ NUMBERS - SMALL_
FORM_ VARIANTS - SMALL_
KANA_ EXTENSION - SOGDIAN
- SORA_
SOMPENG - SOYOMBO
- SPACING_
MODIFIER_ LETTERS - SPECIALS
- SUNDANESE
- SUNDANESE_
SUPPLEMENT - SUPERSCRIPTS_
AND_ SUBSCRIPTS - SUPPLEMENTAL_
ARROWS_ A - SUPPLEMENTAL_
ARROWS_ B - SUPPLEMENTAL_
ARROWS_ C - SUPPLEMENTAL_
MATHEMATICAL_ OPERATORS - SUPPLEMENTAL_
PUNCTUATION - SUPPLEMENTAL_
SYMBOLS_ AND_ PICTOGRAPHS - SUPPLEMENTARY_
PRIVATE_ USE_ AREA_ A - SUPPLEMENTARY_
PRIVATE_ USE_ AREA_ B - SUTTON_
SIGNWRITING - SYLOTI_
NAGRI - SYMBOLS_
AND_ PICTOGRAPHS_ EXTENDED_ A - SYMBOLS_
FOR_ LEGACY_ COMPUTING - SYRIAC
- SYRIAC_
SUPPLEMENT - TAGALOG
- TAGBANWA
- TAGS
- TAI_LE
- TAI_
THAM - TAI_
VIET - TAI_
XUAN_ JING_ SYMBOLS - TAKRI
- TAMIL
- TAMIL_
SUPPLEMENT - TANGSA
- TANGUT
- TANGUT_
COMPONENTS - TANGUT_
SUPPLEMENT - TELUGU
- THAANA
- THAI
- TIBETAN
- TIFINAGH
- TIRHUTA
- TOTO
- TRANSPORT_
AND_ MAP_ SYMBOLS - UGARITIC
- UNIFIED_
CANADIAN_ ABORIGINAL_ SYLLABICS - UNIFIED_
CANADIAN_ ABORIGINAL_ SYLLABICS_ EXTENDED - UNIFIED_
CANADIAN_ ABORIGINAL_ SYLLABICS_ EXTENDED_ A - VAI
- VARIATION_
SELECTORS - VARIATION_
SELECTORS_ SUPPLEMENT - VEDIC_
EXTENSIONS - VERSION
- VERTICAL_
FORMS - VITHKUQI
- WANCHO
- WARANG_
CITI - YEZIDI
- YIJING_
HEXAGRAM_ SYMBOLS - YI_
RADICALS - YI_
SYLLABLES - ZANABAZAR_
SQUARE - ZNAMENNY_
MUSICAL_ NOTATION
Functions§
- find_
unicode_ block - Given a character, determine what unicode block contains it.
- is_cjk
- Given a character, determine whether it is used in CJK.
- is_
cjk_ block - Given a
UnicodeBlock
, determine whether it is used in CJK.