Unicode Properties (from Unicode Version: 8.0.0)
- 1: Any
- 2: Assigned
- 3: C
- 4: Cc
- 5: Cf
- 6: Cn
- 7: Co
- 8: Cs
- 9: L
- 10: LC
- 11: Ll
- 12: Lm
- 13: Lo
- 14: Lt
- 15: Lu
- 16: M
- 17: Mc
- 18: Me
- 19: Mn
- 20: N
- 21: Nd
- 22: Nl
- 23: No
- 24: P
- 25: Pc
- 26: Pd
- 27: Pe
- 28: Pf
- 29: Pi
- 30: Po
- 31: Ps
- 32: S
- 33: Sc
- 34: Sk
- 35: Sm
- 36: So
- 37: Z
- 38: Zl
- 39: Zp
- 40: Zs
- 41: Math
- 42: Alphabetic
- 43: Lowercase
- 44: Uppercase
- 45: Cased
- 46: Case_Ignorable
+ 15: ASCII_Hex_Digit
+ 16: Ahom
+ 17: Alphabetic
+ 18: Anatolian_Hieroglyphs
+ 19: Any
+ 20: Arabic
+ 21: Armenian
+ 22: Assigned
+ 23: Avestan
+ 24: Balinese
+ 25: Bamum
+ 26: Bassa_Vah
+ 27: Batak
+ 28: Bengali
+ 29: Bidi_Control
+ 30: Bopomofo
+ 31: Brahmi
+ 32: Braille
+ 33: Buginese
+ 34: Buhid
+ 35: C
+ 36: Canadian_Aboriginal
+ 37: Carian
+ 38: Case_Ignorable
+ 39: Cased
+ 40: Caucasian_Albanian
+ 41: Cc
+ 42: Cf
+ 43: Chakma
+ 44: Cham
+ 45: Changes_When_Casefolded
+ 46: Changes_When_Casemapped
47: Changes_When_Lowercased
- 48: Changes_When_Uppercased
- 49: Changes_When_Titlecased
- 50: Changes_When_Casefolded
- 51: Changes_When_Casemapped
- 52: ID_Start
- 53: ID_Continue
- 54: XID_Start
- 55: XID_Continue
- 56: Default_Ignorable_Code_Point
- 57: Grapheme_Extend
- 58: Grapheme_Base
- 59: Grapheme_Link
- 60: Common
- 61: Latin
- 62: Greek
- 63: Cyrillic
- 64: Armenian
- 65: Hebrew
- 66: Arabic
- 67: Syriac
- 68: Thaana
- 69: Devanagari
- 70: Bengali
- 71: Gurmukhi
- 72: Gujarati
- 73: Oriya
- 74: Tamil
- 75: Telugu
- 76: Kannada
- 77: Malayalam
- 78: Sinhala
- 79: Thai
- 80: Lao
- 81: Tibetan
- 82: Myanmar
- 83: Georgian
- 84: Hangul
- 85: Ethiopic
- 86: Cherokee
- 87: Canadian_Aboriginal
- 88: Ogham
- 89: Runic
- 90: Khmer
- 91: Mongolian
- 92: Hiragana
- 93: Katakana
- 94: Bopomofo
- 95: Han
- 96: Yi
- 97: Old_Italic
- 98: Gothic
- 99: Deseret
-100: Inherited
-101: Tagalog
-102: Hanunoo
-103: Buhid
-104: Tagbanwa
-105: Limbu
-106: Tai_Le
-107: Linear_B
-108: Ugaritic
-109: Shavian
-110: Osmanya
-111: Cypriot
-112: Braille
-113: Buginese
-114: Coptic
-115: New_Tai_Lue
-116: Glagolitic
-117: Tifinagh
-118: Syloti_Nagri
-119: Old_Persian
-120: Kharoshthi
-121: Balinese
-122: Cuneiform
-123: Phoenician
-124: Phags_Pa
-125: Nko
-126: Sundanese
-127: Lepcha
-128: Ol_Chiki
-129: Vai
-130: Saurashtra
-131: Kayah_Li
-132: Rejang
-133: Lycian
-134: Carian
+ 48: Changes_When_Titlecased
+ 49: Changes_When_Uppercased
+ 50: Cherokee
+ 51: Cn
+ 52: Co
+ 53: Common
+ 54: Coptic
+ 55: Cs
+ 56: Cuneiform
+ 57: Cypriot
+ 58: Cyrillic
+ 59: Dash
+ 60: Default_Ignorable_Code_Point
+ 61: Deprecated
+ 62: Deseret
+ 63: Devanagari
+ 64: Diacritic
+ 65: Duployan
+ 66: Egyptian_Hieroglyphs
+ 67: Elbasan
+ 68: Ethiopic
+ 69: Extender
+ 70: Georgian
+ 71: Glagolitic
+ 72: Gothic
+ 73: Grantha
+ 74: Grapheme_Base
+ 75: Grapheme_Cluster_Break_CR
+ 76: Grapheme_Cluster_Break_Control
+ 77: Grapheme_Cluster_Break_Extend
+ 78: Grapheme_Cluster_Break_L
+ 79: Grapheme_Cluster_Break_LF
+ 80: Grapheme_Cluster_Break_LV
+ 81: Grapheme_Cluster_Break_LVT
+ 82: Grapheme_Cluster_Break_Regional_Indicator
+ 83: Grapheme_Cluster_Break_SpacingMark
+ 84: Grapheme_Cluster_Break_T
+ 85: Grapheme_Cluster_Break_V
+ 86: Grapheme_Extend
+ 87: Grapheme_Link
+ 88: Greek
+ 89: Gujarati
+ 90: Gurmukhi
+ 91: Han
+ 92: Hangul
+ 93: Hanunoo
+ 94: Hatran
+ 95: Hebrew
+ 96: Hex_Digit
+ 97: Hiragana
+ 98: Hyphen
+ 99: IDS_Binary_Operator
+100: IDS_Trinary_Operator
+101: ID_Continue
+102: ID_Start
+103: Ideographic
+104: Imperial_Aramaic
+105: Inherited
+106: Inscriptional_Pahlavi
+107: Inscriptional_Parthian
+108: Javanese
+109: Join_Control
+110: Kaithi
+111: Kannada
+112: Katakana
+113: Kayah_Li
+114: Kharoshthi
+115: Khmer
+116: Khojki
+117: Khudawadi
+118: L
+119: LC
+120: Lao
+121: Latin
+122: Lepcha
+123: Limbu
+124: Linear_A
+125: Linear_B
+126: Lisu
+127: Ll
+128: Lm
+129: Lo
+130: Logical_Order_Exception
+131: Lowercase
+132: Lt
+133: Lu
+134: Lycian
135: Lydian
-136: Cham
-137: Tai_Tham
-138: Tai_Viet
-139: Avestan
-140: Egyptian_Hieroglyphs
-141: Samaritan
-142: Lisu
-143: Bamum
-144: Javanese
-145: Meetei_Mayek
-146: Imperial_Aramaic
-147: Old_South_Arabian
-148: Inscriptional_Parthian
-149: Inscriptional_Pahlavi
-150: Old_Turkic
-151: Kaithi
-152: Batak
-153: Brahmi
-154: Mandaic
-155: Chakma
-156: Meroitic_Cursive
-157: Meroitic_Hieroglyphs
-158: Miao
-159: Sharada
-160: Sora_Sompeng
-161: Takri
-162: Caucasian_Albanian
-163: Bassa_Vah
-164: Duployan
-165: Elbasan
-166: Grantha
-167: Pahawh_Hmong
-168: Khojki
-169: Linear_A
-170: Mahajani
-171: Manichaean
-172: Mende_Kikakui
-173: Modi
-174: Mro
-175: Old_North_Arabian
-176: Nabataean
-177: Palmyrene
-178: Pau_Cin_Hau
-179: Old_Permic
-180: Psalter_Pahlavi
-181: Siddham
-182: Khudawadi
-183: Tirhuta
-184: Warang_Citi
-185: Ahom
-186: Anatolian_Hieroglyphs
-187: Hatran
-188: Multani
-189: Old_Hungarian
-190: SignWriting
-191: White_Space
-192: Bidi_Control
-193: Join_Control
-194: Dash
-195: Hyphen
-196: Quotation_Mark
-197: Terminal_Punctuation
-198: Other_Math
-199: Hex_Digit
-200: ASCII_Hex_Digit
-201: Other_Alphabetic
-202: Ideographic
-203: Diacritic
-204: Extender
-205: Other_Lowercase
-206: Other_Uppercase
-207: Noncharacter_Code_Point
-208: Other_Grapheme_Extend
-209: IDS_Binary_Operator
-210: IDS_Trinary_Operator
-211: Radical
-212: Unified_Ideograph
-213: Other_Default_Ignorable_Code_Point
-214: Deprecated
+136: M
+137: Mahajani
+138: Malayalam
+139: Mandaic
+140: Manichaean
+141: Math
+142: Mc
+143: Me
+144: Meetei_Mayek
+145: Mende_Kikakui
+146: Meroitic_Cursive
+147: Meroitic_Hieroglyphs
+148: Miao
+149: Mn
+150: Modi
+151: Mongolian
+152: Mro
+153: Multani
+154: Myanmar
+155: N
+156: Nabataean
+157: Nd
+158: New_Tai_Lue
+159: Nko
+160: Nl
+161: No
+162: Noncharacter_Code_Point
+163: Ogham
+164: Ol_Chiki
+165: Old_Hungarian
+166: Old_Italic
+167: Old_North_Arabian
+168: Old_Permic
+169: Old_Persian
+170: Old_South_Arabian
+171: Old_Turkic
+172: Oriya
+173: Osmanya
+174: Other_Alphabetic
+175: Other_Default_Ignorable_Code_Point
+176: Other_Grapheme_Extend
+177: Other_ID_Continue
+178: Other_ID_Start
+179: Other_Lowercase
+180: Other_Math
+181: Other_Uppercase
+182: P
+183: Pahawh_Hmong
+184: Palmyrene
+185: Pattern_Syntax
+186: Pattern_White_Space
+187: Pau_Cin_Hau
+188: Pc
+189: Pd
+190: Pe
+191: Pf
+192: Phags_Pa
+193: Phoenician
+194: Pi
+195: Po
+196: Ps
+197: Psalter_Pahlavi
+198: Quotation_Mark
+199: Radical
+200: Rejang
+201: Runic
+202: S
+203: STerm
+204: Samaritan
+205: Saurashtra
+206: Sc
+207: Sharada
+208: Shavian
+209: Siddham
+210: SignWriting
+211: Sinhala
+212: Sk
+213: Sm
+214: So
215: Soft_Dotted
-216: Logical_Order_Exception
-217: Other_ID_Start
-218: Other_ID_Continue
-219: STerm
-220: Variation_Selector
-221: Pattern_White_Space
-222: Pattern_Syntax
-223: Unknown
-224: Aghb
-225: AHex
-226: Arab
-227: Armi
-228: Armn
-229: Avst
-230: Bali
-231: Bamu
-232: Bass
-233: Batk
-234: Beng
-235: Bidi_C
-236: Bopo
-237: Brah
-238: Brai
-239: Bugi
-240: Buhd
-241: Cakm
-242: Cans
-243: Cari
-244: Cased_Letter
-245: Cher
-246: CI
-247: Close_Punctuation
-248: Combining_Mark
-249: Connector_Punctuation
-250: Control
-251: Copt
-252: Cprt
-253: Currency_Symbol
-254: CWCF
-255: CWCM
-256: CWL
-257: CWT
-258: CWU
-259: Cyrl
-260: Dash_Punctuation
-261: Decimal_Number
-262: Dep
-263: Deva
-264: DI
-265: Dia
-266: Dsrt
-267: Dupl
-268: Egyp
-269: Elba
-270: Enclosing_Mark
-271: Ethi
-272: Ext
-273: Final_Punctuation
-274: Format
-275: Geor
-276: Glag
-277: Goth
-278: Gran
-279: Gr_Base
-280: Grek
-281: Gr_Ext
-282: Gr_Link
-283: Gujr
-284: Guru
-285: Hang
-286: Hani
-287: Hano
-288: Hatr
-289: Hebr
-290: Hex
-291: Hira
-292: Hluw
-293: Hmng
-294: Hung
-295: IDC
-296: Ideo
-297: IDS
-298: IDSB
-299: IDST
-300: Initial_Punctuation
-301: Ital
-302: Java
-303: Join_C
-304: Kali
-305: Kana
-306: Khar
-307: Khmr
-308: Khoj
-309: Knda
-310: Kthi
-311: Lana
-312: Laoo
-313: Latn
-314: Lepc
-315: Letter
-316: Letter_Number
-317: Limb
-318: Lina
-319: Linb
-320: Line_Separator
-321: LOE
-322: Lowercase_Letter
-323: Lyci
-324: Lydi
-325: Mahj
-326: Mand
-327: Mani
-328: Mark
-329: Math_Symbol
-330: Mend
-331: Merc
-332: Mero
-333: Mlym
-334: Modifier_Letter
-335: Modifier_Symbol
-336: Mong
-337: Mroo
-338: Mtei
-339: Mult
-340: Mymr
-341: Narb
-342: Nbat
-343: NChar
-344: Nkoo
-345: Nonspacing_Mark
-346: Number
-347: OAlpha
-348: ODI
-349: Ogam
-350: OGr_Ext
-351: OIDC
-352: OIDS
-353: Olck
-354: OLower
-355: OMath
-356: Open_Punctuation
-357: Orkh
-358: Orya
-359: Osma
-360: Other
-361: Other_Letter
-362: Other_Number
-363: Other_Punctuation
-364: Other_Symbol
-365: OUpper
-366: Palm
-367: Paragraph_Separator
-368: Pat_Syn
-369: Pat_WS
-370: Pauc
-371: Perm
-372: Phag
-373: Phli
-374: Phlp
-375: Phnx
-376: Plrd
-377: Private_Use
-378: Prti
-379: Punctuation
-380: Qaac
-381: Qaai
-382: QMark
-383: Rjng
-384: Runr
-385: Samr
-386: Sarb
-387: Saur
-388: SD
-389: Separator
-390: Sgnw
-391: Shaw
-392: Shrd
-393: Sidd
-394: Sind
-395: Sinh
-396: Sora
-397: Space_Separator
-398: Spacing_Mark
-399: Sund
-400: Surrogate
-401: Sylo
-402: Symbol
-403: Syrc
-404: Tagb
-405: Takr
-406: Tale
-407: Talu
-408: Taml
-409: Tavt
-410: Telu
-411: Term
-412: Tfng
-413: Tglg
-414: Thaa
-415: Tibt
-416: Tirh
-417: Titlecase_Letter
-418: Ugar
-419: UIdeo
-420: Unassigned
-421: Uppercase_Letter
-422: Vaii
-423: VS
-424: Wara
-425: WSpace
-426: XIDC
-427: XIDS
-428: Xpeo
-429: Xsux
-430: Yiii
-431: Zinh
-432: Zyyy
-433: Zzzz
-434: In_Basic_Latin
-435: In_Latin_1_Supplement
-436: In_Latin_Extended_A
-437: In_Latin_Extended_B
-438: In_IPA_Extensions
-439: In_Spacing_Modifier_Letters
-440: In_Combining_Diacritical_Marks
-441: In_Greek_and_Coptic
-442: In_Cyrillic
-443: In_Cyrillic_Supplement
-444: In_Armenian
-445: In_Hebrew
-446: In_Arabic
-447: In_Syriac
-448: In_Arabic_Supplement
-449: In_Thaana
-450: In_NKo
-451: In_Samaritan
-452: In_Mandaic
-453: In_Arabic_Extended_A
-454: In_Devanagari
-455: In_Bengali
-456: In_Gurmukhi
-457: In_Gujarati
-458: In_Oriya
-459: In_Tamil
-460: In_Telugu
-461: In_Kannada
-462: In_Malayalam
-463: In_Sinhala
-464: In_Thai
-465: In_Lao
-466: In_Tibetan
-467: In_Myanmar
-468: In_Georgian
-469: In_Hangul_Jamo
-470: In_Ethiopic
-471: In_Ethiopic_Supplement
-472: In_Cherokee
-473: In_Unified_Canadian_Aboriginal_Syllabics
-474: In_Ogham
-475: In_Runic
-476: In_Tagalog
-477: In_Hanunoo
-478: In_Buhid
-479: In_Tagbanwa
-480: In_Khmer
-481: In_Mongolian
-482: In_Unified_Canadian_Aboriginal_Syllabics_Extended
-483: In_Limbu
-484: In_Tai_Le
-485: In_New_Tai_Lue
-486: In_Khmer_Symbols
-487: In_Buginese
-488: In_Tai_Tham
-489: In_Combining_Diacritical_Marks_Extended
-490: In_Balinese
-491: In_Sundanese
-492: In_Batak
-493: In_Lepcha
-494: In_Ol_Chiki
-495: In_Sundanese_Supplement
-496: In_Vedic_Extensions
-497: In_Phonetic_Extensions
-498: In_Phonetic_Extensions_Supplement
-499: In_Combining_Diacritical_Marks_Supplement
-500: In_Latin_Extended_Additional
-501: In_Greek_Extended
-502: In_General_Punctuation
-503: In_Superscripts_and_Subscripts
-504: In_Currency_Symbols
-505: In_Combining_Diacritical_Marks_for_Symbols
-506: In_Letterlike_Symbols
-507: In_Number_Forms
-508: In_Arrows
-509: In_Mathematical_Operators
-510: In_Miscellaneous_Technical
-511: In_Control_Pictures
-512: In_Optical_Character_Recognition
-513: In_Enclosed_Alphanumerics
-514: In_Box_Drawing
-515: In_Block_Elements
-516: In_Geometric_Shapes
-517: In_Miscellaneous_Symbols
-518: In_Dingbats
-519: In_Miscellaneous_Mathematical_Symbols_A
-520: In_Supplemental_Arrows_A
-521: In_Braille_Patterns
-522: In_Supplemental_Arrows_B
-523: In_Miscellaneous_Mathematical_Symbols_B
-524: In_Supplemental_Mathematical_Operators
-525: In_Miscellaneous_Symbols_and_Arrows
-526: In_Glagolitic
-527: In_Latin_Extended_C
-528: In_Coptic
-529: In_Georgian_Supplement
-530: In_Tifinagh
-531: In_Ethiopic_Extended
-532: In_Cyrillic_Extended_A
-533: In_Supplemental_Punctuation
-534: In_CJK_Radicals_Supplement
-535: In_Kangxi_Radicals
-536: In_Ideographic_Description_Characters
-537: In_CJK_Symbols_and_Punctuation
-538: In_Hiragana
-539: In_Katakana
-540: In_Bopomofo
-541: In_Hangul_Compatibility_Jamo
-542: In_Kanbun
-543: In_Bopomofo_Extended
-544: In_CJK_Strokes
-545: In_Katakana_Phonetic_Extensions
-546: In_Enclosed_CJK_Letters_and_Months
-547: In_CJK_Compatibility
-548: In_CJK_Unified_Ideographs_Extension_A
-549: In_Yijing_Hexagram_Symbols
-550: In_CJK_Unified_Ideographs
-551: In_Yi_Syllables
-552: In_Yi_Radicals
-553: In_Lisu
-554: In_Vai
-555: In_Cyrillic_Extended_B
-556: In_Bamum
-557: In_Modifier_Tone_Letters
-558: In_Latin_Extended_D
-559: In_Syloti_Nagri
-560: In_Common_Indic_Number_Forms
-561: In_Phags_pa
-562: In_Saurashtra
-563: In_Devanagari_Extended
-564: In_Kayah_Li
-565: In_Rejang
-566: In_Hangul_Jamo_Extended_A
-567: In_Javanese
-568: In_Myanmar_Extended_B
-569: In_Cham
-570: In_Myanmar_Extended_A
-571: In_Tai_Viet
-572: In_Meetei_Mayek_Extensions
-573: In_Ethiopic_Extended_A
-574: In_Latin_Extended_E
-575: In_Cherokee_Supplement
-576: In_Meetei_Mayek
-577: In_Hangul_Syllables
-578: In_Hangul_Jamo_Extended_B
-579: In_High_Surrogates
-580: In_High_Private_Use_Surrogates
-581: In_Low_Surrogates
-582: In_Private_Use_Area
-583: In_CJK_Compatibility_Ideographs
-584: In_Alphabetic_Presentation_Forms
-585: In_Arabic_Presentation_Forms_A
-586: In_Variation_Selectors
-587: In_Vertical_Forms
-588: In_Combining_Half_Marks
-589: In_CJK_Compatibility_Forms
-590: In_Small_Form_Variants
-591: In_Arabic_Presentation_Forms_B
-592: In_Halfwidth_and_Fullwidth_Forms
-593: In_Specials
-594: In_Linear_B_Syllabary
-595: In_Linear_B_Ideograms
-596: In_Aegean_Numbers
-597: In_Ancient_Greek_Numbers
-598: In_Ancient_Symbols
-599: In_Phaistos_Disc
-600: In_Lycian
-601: In_Carian
-602: In_Coptic_Epact_Numbers
-603: In_Old_Italic
-604: In_Gothic
-605: In_Old_Permic
-606: In_Ugaritic
-607: In_Old_Persian
-608: In_Deseret
-609: In_Shavian
-610: In_Osmanya
-611: In_Elbasan
-612: In_Caucasian_Albanian
-613: In_Linear_A
-614: In_Cypriot_Syllabary
-615: In_Imperial_Aramaic
-616: In_Palmyrene
-617: In_Nabataean
-618: In_Hatran
-619: In_Phoenician
-620: In_Lydian
-621: In_Meroitic_Hieroglyphs
-622: In_Meroitic_Cursive
-623: In_Kharoshthi
-624: In_Old_South_Arabian
-625: In_Old_North_Arabian
-626: In_Manichaean
-627: In_Avestan
-628: In_Inscriptional_Parthian
-629: In_Inscriptional_Pahlavi
-630: In_Psalter_Pahlavi
-631: In_Old_Turkic
-632: In_Old_Hungarian
-633: In_Rumi_Numeral_Symbols
-634: In_Brahmi
-635: In_Kaithi
-636: In_Sora_Sompeng
-637: In_Chakma
-638: In_Mahajani
-639: In_Sharada
-640: In_Sinhala_Archaic_Numbers
-641: In_Khojki
-642: In_Multani
-643: In_Khudawadi
-644: In_Grantha
-645: In_Tirhuta
-646: In_Siddham
-647: In_Modi
-648: In_Takri
-649: In_Ahom
-650: In_Warang_Citi
-651: In_Pau_Cin_Hau
-652: In_Cuneiform
-653: In_Cuneiform_Numbers_and_Punctuation
-654: In_Early_Dynastic_Cuneiform
-655: In_Egyptian_Hieroglyphs
-656: In_Anatolian_Hieroglyphs
-657: In_Bamum_Supplement
-658: In_Mro
-659: In_Bassa_Vah
-660: In_Pahawh_Hmong
-661: In_Miao
-662: In_Kana_Supplement
-663: In_Duployan
-664: In_Shorthand_Format_Controls
-665: In_Byzantine_Musical_Symbols
-666: In_Musical_Symbols
-667: In_Ancient_Greek_Musical_Notation
-668: In_Tai_Xuan_Jing_Symbols
-669: In_Counting_Rod_Numerals
-670: In_Mathematical_Alphanumeric_Symbols
-671: In_Sutton_SignWriting
-672: In_Mende_Kikakui
-673: In_Arabic_Mathematical_Alphabetic_Symbols
-674: In_Mahjong_Tiles
-675: In_Domino_Tiles
-676: In_Playing_Cards
-677: In_Enclosed_Alphanumeric_Supplement
-678: In_Enclosed_Ideographic_Supplement
-679: In_Miscellaneous_Symbols_and_Pictographs
-680: In_Emoticons
-681: In_Ornamental_Dingbats
-682: In_Transport_and_Map_Symbols
-683: In_Alchemical_Symbols
-684: In_Geometric_Shapes_Extended
-685: In_Supplemental_Arrows_C
-686: In_Supplemental_Symbols_and_Pictographs
-687: In_CJK_Unified_Ideographs_Extension_B
-688: In_CJK_Unified_Ideographs_Extension_C
-689: In_CJK_Unified_Ideographs_Extension_D
-690: In_CJK_Unified_Ideographs_Extension_E
-691: In_CJK_Compatibility_Ideographs_Supplement
-692: In_Tags
-693: In_Variation_Selectors_Supplement
-694: In_Supplementary_Private_Use_Area_A
-695: In_Supplementary_Private_Use_Area_B
-696: In_No_Block
+216: Sora_Sompeng
+217: Sundanese
+218: Syloti_Nagri
+219: Syriac
+220: Tagalog
+221: Tagbanwa
+222: Tai_Le
+223: Tai_Tham
+224: Tai_Viet
+225: Takri
+226: Tamil
+227: Telugu
+228: Terminal_Punctuation
+229: Thaana
+230: Thai
+231: Tibetan
+232: Tifinagh
+233: Tirhuta
+234: Ugaritic
+235: Unified_Ideograph
+236: Unknown
+237: Uppercase
+238: Vai
+239: Variation_Selector
+240: Warang_Citi
+241: White_Space
+242: XID_Continue
+243: XID_Start
+244: Yi
+245: Z
+246: Zl
+247: Zp
+248: Zs
+ 40: Aghb
+ 15: AHex
+ 20: Arab
+104: Armi
+ 21: Armn
+ 23: Avst
+ 24: Bali
+ 25: Bamu
+ 26: Bass
+ 27: Batk
+ 28: Beng
+ 29: Bidi_C
+ 30: Bopo
+ 31: Brah
+ 32: Brai
+ 33: Bugi
+ 34: Buhd
+ 43: Cakm
+ 36: Cans
+ 37: Cari
+119: Cased_Letter
+ 50: Cher
+ 38: CI
+190: Close_Punctuation
+136: Combining_Mark
+188: Connector_Punctuation
+ 41: Control
+ 54: Copt
+ 57: Cprt
+206: Currency_Symbol
+ 45: CWCF
+ 46: CWCM
+ 47: CWL
+ 48: CWT
+ 49: CWU
+ 58: Cyrl
+189: Dash_Punctuation
+157: Decimal_Number
+ 61: Dep
+ 63: Deva
+ 60: DI
+ 64: Dia
+ 62: Dsrt
+ 65: Dupl
+ 66: Egyp
+ 67: Elba
+143: Enclosing_Mark
+ 68: Ethi
+ 69: Ext
+191: Final_Punctuation
+ 42: Format
+ 70: Geor
+ 71: Glag
+ 72: Goth
+ 73: Gran
+ 74: Gr_Base
+ 88: Grek
+ 86: Gr_Ext
+ 87: Gr_Link
+ 89: Gujr
+ 90: Guru
+ 92: Hang
+ 91: Hani
+ 93: Hano
+ 94: Hatr
+ 95: Hebr
+ 96: Hex
+ 97: Hira
+ 18: Hluw
+183: Hmng
+165: Hung
+101: IDC
+103: Ideo
+102: IDS
+ 99: IDSB
+100: IDST
+194: Initial_Punctuation
+166: Ital
+108: Java
+109: Join_C
+113: Kali
+112: Kana
+114: Khar
+115: Khmr
+116: Khoj
+111: Knda
+110: Kthi
+223: Lana
+120: Laoo
+121: Latn
+122: Lepc
+118: Letter
+160: Letter_Number
+123: Limb
+124: Lina
+125: Linb
+246: Line_Separator
+130: LOE
+127: Lowercase_Letter
+134: Lyci
+135: Lydi
+137: Mahj
+139: Mand
+140: Mani
+136: Mark
+213: Math_Symbol
+145: Mend
+146: Merc
+147: Mero
+138: Mlym
+128: Modifier_Letter
+212: Modifier_Symbol
+151: Mong
+152: Mroo
+144: Mtei
+153: Mult
+154: Mymr
+167: Narb
+156: Nbat
+162: NChar
+159: Nkoo
+149: Nonspacing_Mark
+155: Number
+174: OAlpha
+175: ODI
+163: Ogam
+176: OGr_Ext
+177: OIDC
+178: OIDS
+164: Olck
+179: OLower
+180: OMath
+196: Open_Punctuation
+171: Orkh
+172: Orya
+173: Osma
+ 35: Other
+129: Other_Letter
+161: Other_Number
+195: Other_Punctuation
+214: Other_Symbol
+181: OUpper
+184: Palm
+247: Paragraph_Separator
+185: Pat_Syn
+186: Pat_WS
+187: Pauc
+168: Perm
+192: Phag
+106: Phli
+197: Phlp
+193: Phnx
+148: Plrd
+ 52: Private_Use
+107: Prti
+182: Punctuation
+ 54: Qaac
+105: Qaai
+198: QMark
+200: Rjng
+201: Runr
+204: Samr
+170: Sarb
+205: Saur
+215: SD
+245: Separator
+210: Sgnw
+208: Shaw
+207: Shrd
+209: Sidd
+117: Sind
+211: Sinh
+216: Sora
+248: Space_Separator
+142: Spacing_Mark
+217: Sund
+ 55: Surrogate
+218: Sylo
+202: Symbol
+219: Syrc
+221: Tagb
+225: Takr
+222: Tale
+158: Talu
+226: Taml
+224: Tavt
+227: Telu
+228: Term
+232: Tfng
+220: Tglg
+229: Thaa
+231: Tibt
+233: Tirh
+132: Titlecase_Letter
+234: Ugar
+235: UIdeo
+ 51: Unassigned
+133: Uppercase_Letter
+238: Vaii
+239: VS
+240: Wara
+241: WSpace
+242: XIDC
+243: XIDS
+169: Xpeo
+ 56: Xsux
+244: Yiii
+105: Zinh
+ 53: Zyyy
+236: Zzzz
+249: In_Basic_Latin
+250: In_Latin_1_Supplement
+251: In_Latin_Extended_A
+252: In_Latin_Extended_B
+253: In_IPA_Extensions
+254: In_Spacing_Modifier_Letters
+255: In_Combining_Diacritical_Marks
+256: In_Greek_and_Coptic
+257: In_Cyrillic
+258: In_Cyrillic_Supplement
+259: In_Armenian
+260: In_Hebrew
+261: In_Arabic
+262: In_Syriac
+263: In_Arabic_Supplement
+264: In_Thaana
+265: In_NKo
+266: In_Samaritan
+267: In_Mandaic
+268: In_Arabic_Extended_A
+269: In_Devanagari
+270: In_Bengali
+271: In_Gurmukhi
+272: In_Gujarati
+273: In_Oriya
+274: In_Tamil
+275: In_Telugu
+276: In_Kannada
+277: In_Malayalam
+278: In_Sinhala
+279: In_Thai
+280: In_Lao
+281: In_Tibetan
+282: In_Myanmar
+283: In_Georgian
+284: In_Hangul_Jamo
+285: In_Ethiopic
+286: In_Ethiopic_Supplement
+287: In_Cherokee
+288: In_Unified_Canadian_Aboriginal_Syllabics
+289: In_Ogham
+290: In_Runic
+291: In_Tagalog
+292: In_Hanunoo
+293: In_Buhid
+294: In_Tagbanwa
+295: In_Khmer
+296: In_Mongolian
+297: In_Unified_Canadian_Aboriginal_Syllabics_Extended
+298: In_Limbu
+299: In_Tai_Le
+300: In_New_Tai_Lue
+301: In_Khmer_Symbols
+302: In_Buginese
+303: In_Tai_Tham
+304: In_Combining_Diacritical_Marks_Extended
+305: In_Balinese
+306: In_Sundanese
+307: In_Batak
+308: In_Lepcha
+309: In_Ol_Chiki
+310: In_Sundanese_Supplement
+311: In_Vedic_Extensions
+312: In_Phonetic_Extensions
+313: In_Phonetic_Extensions_Supplement
+314: In_Combining_Diacritical_Marks_Supplement
+315: In_Latin_Extended_Additional
+316: In_Greek_Extended
+317: In_General_Punctuation
+318: In_Superscripts_and_Subscripts
+319: In_Currency_Symbols
+320: In_Combining_Diacritical_Marks_for_Symbols
+321: In_Letterlike_Symbols
+322: In_Number_Forms
+323: In_Arrows
+324: In_Mathematical_Operators
+325: In_Miscellaneous_Technical
+326: In_Control_Pictures
+327: In_Optical_Character_Recognition
+328: In_Enclosed_Alphanumerics
+329: In_Box_Drawing
+330: In_Block_Elements
+331: In_Geometric_Shapes
+332: In_Miscellaneous_Symbols
+333: In_Dingbats
+334: In_Miscellaneous_Mathematical_Symbols_A
+335: In_Supplemental_Arrows_A
+336: In_Braille_Patterns
+337: In_Supplemental_Arrows_B
+338: In_Miscellaneous_Mathematical_Symbols_B
+339: In_Supplemental_Mathematical_Operators
+340: In_Miscellaneous_Symbols_and_Arrows
+341: In_Glagolitic
+342: In_Latin_Extended_C
+343: In_Coptic
+344: In_Georgian_Supplement
+345: In_Tifinagh
+346: In_Ethiopic_Extended
+347: In_Cyrillic_Extended_A
+348: In_Supplemental_Punctuation
+349: In_CJK_Radicals_Supplement
+350: In_Kangxi_Radicals
+351: In_Ideographic_Description_Characters
+352: In_CJK_Symbols_and_Punctuation
+353: In_Hiragana
+354: In_Katakana
+355: In_Bopomofo
+356: In_Hangul_Compatibility_Jamo
+357: In_Kanbun
+358: In_Bopomofo_Extended
+359: In_CJK_Strokes
+360: In_Katakana_Phonetic_Extensions
+361: In_Enclosed_CJK_Letters_and_Months
+362: In_CJK_Compatibility
+363: In_CJK_Unified_Ideographs_Extension_A
+364: In_Yijing_Hexagram_Symbols
+365: In_CJK_Unified_Ideographs
+366: In_Yi_Syllables
+367: In_Yi_Radicals
+368: In_Lisu
+369: In_Vai
+370: In_Cyrillic_Extended_B
+371: In_Bamum
+372: In_Modifier_Tone_Letters
+373: In_Latin_Extended_D
+374: In_Syloti_Nagri
+375: In_Common_Indic_Number_Forms
+376: In_Phags_pa
+377: In_Saurashtra
+378: In_Devanagari_Extended
+379: In_Kayah_Li
+380: In_Rejang
+381: In_Hangul_Jamo_Extended_A
+382: In_Javanese
+383: In_Myanmar_Extended_B
+384: In_Cham
+385: In_Myanmar_Extended_A
+386: In_Tai_Viet
+387: In_Meetei_Mayek_Extensions
+388: In_Ethiopic_Extended_A
+389: In_Latin_Extended_E
+390: In_Cherokee_Supplement
+391: In_Meetei_Mayek
+392: In_Hangul_Syllables
+393: In_Hangul_Jamo_Extended_B
+394: In_High_Surrogates
+395: In_High_Private_Use_Surrogates
+396: In_Low_Surrogates
+397: In_Private_Use_Area
+398: In_CJK_Compatibility_Ideographs
+399: In_Alphabetic_Presentation_Forms
+400: In_Arabic_Presentation_Forms_A
+401: In_Variation_Selectors
+402: In_Vertical_Forms
+403: In_Combining_Half_Marks
+404: In_CJK_Compatibility_Forms
+405: In_Small_Form_Variants
+406: In_Arabic_Presentation_Forms_B
+407: In_Halfwidth_and_Fullwidth_Forms
+408: In_Specials
+409: In_Linear_B_Syllabary
+410: In_Linear_B_Ideograms
+411: In_Aegean_Numbers
+412: In_Ancient_Greek_Numbers
+413: In_Ancient_Symbols
+414: In_Phaistos_Disc
+415: In_Lycian
+416: In_Carian
+417: In_Coptic_Epact_Numbers
+418: In_Old_Italic
+419: In_Gothic
+420: In_Old_Permic
+421: In_Ugaritic
+422: In_Old_Persian
+423: In_Deseret
+424: In_Shavian
+425: In_Osmanya
+426: In_Elbasan
+427: In_Caucasian_Albanian
+428: In_Linear_A
+429: In_Cypriot_Syllabary
+430: In_Imperial_Aramaic
+431: In_Palmyrene
+432: In_Nabataean
+433: In_Hatran
+434: In_Phoenician
+435: In_Lydian
+436: In_Meroitic_Hieroglyphs
+437: In_Meroitic_Cursive
+438: In_Kharoshthi
+439: In_Old_South_Arabian
+440: In_Old_North_Arabian
+441: In_Manichaean
+442: In_Avestan
+443: In_Inscriptional_Parthian
+444: In_Inscriptional_Pahlavi
+445: In_Psalter_Pahlavi
+446: In_Old_Turkic
+447: In_Old_Hungarian
+448: In_Rumi_Numeral_Symbols
+449: In_Brahmi
+450: In_Kaithi
+451: In_Sora_Sompeng
+452: In_Chakma
+453: In_Mahajani
+454: In_Sharada
+455: In_Sinhala_Archaic_Numbers
+456: In_Khojki
+457: In_Multani
+458: In_Khudawadi
+459: In_Grantha
+460: In_Tirhuta
+461: In_Siddham
+462: In_Modi
+463: In_Takri
+464: In_Ahom
+465: In_Warang_Citi
+466: In_Pau_Cin_Hau
+467: In_Cuneiform
+468: In_Cuneiform_Numbers_and_Punctuation
+469: In_Early_Dynastic_Cuneiform
+470: In_Egyptian_Hieroglyphs
+471: In_Anatolian_Hieroglyphs
+472: In_Bamum_Supplement
+473: In_Mro
+474: In_Bassa_Vah
+475: In_Pahawh_Hmong
+476: In_Miao
+477: In_Kana_Supplement
+478: In_Duployan
+479: In_Shorthand_Format_Controls
+480: In_Byzantine_Musical_Symbols
+481: In_Musical_Symbols
+482: In_Ancient_Greek_Musical_Notation
+483: In_Tai_Xuan_Jing_Symbols
+484: In_Counting_Rod_Numerals
+485: In_Mathematical_Alphanumeric_Symbols
+486: In_Sutton_SignWriting
+487: In_Mende_Kikakui
+488: In_Arabic_Mathematical_Alphabetic_Symbols
+489: In_Mahjong_Tiles
+490: In_Domino_Tiles
+491: In_Playing_Cards
+492: In_Enclosed_Alphanumeric_Supplement
+493: In_Enclosed_Ideographic_Supplement
+494: In_Miscellaneous_Symbols_and_Pictographs
+495: In_Emoticons
+496: In_Ornamental_Dingbats
+497: In_Transport_and_Map_Symbols
+498: In_Alchemical_Symbols
+499: In_Geometric_Shapes_Extended
+500: In_Supplemental_Arrows_C
+501: In_Supplemental_Symbols_and_Pictographs
+502: In_CJK_Unified_Ideographs_Extension_B
+503: In_CJK_Unified_Ideographs_Extension_C
+504: In_CJK_Unified_Ideographs_Extension_D
+505: In_CJK_Unified_Ideographs_Extension_E
+506: In_CJK_Compatibility_Ideographs_Supplement
+507: In_Tags
+508: In_Variation_Selectors_Supplement
+509: In_Supplementary_Private_Use_Area_A
+510: In_Supplementary_Private_Use_Area_B
+511: In_No_Block