LEFT S-SHAPED BAG DELIMITER (U+27C5, Ps): ⟅ MEDIUM RIGHT CURLY BRACKET ORNAMENT (U+2775, Pe): ❵ MEDIUM LEFT CURLY BRACKET ORNAMENT (U+2774, Ps): ❴ LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENT (U+2773, Pe): ❳ LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENT (U+2772, Ps): ❲ HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENT (U+2771, Pe): ❱ HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENT (U+2770, Ps): ❰ HEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENT (U+276F, Pe): ❯ HEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENT (U+276E, Ps): ❮ MEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENT (U+276D, Pe): ❭ MEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENT (U+276C, Ps): ❬ MEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENT (U+276B, Pe): ❫ MEDIUM FLATTENED LEFT PARENTHESIS ORNAMENT (U+276A, Ps): ❪ MEDIUM RIGHT PARENTHESIS ORNAMENT (U+2769, Pe): ❩ MEDIUM LEFT PARENTHESIS ORNAMENT (U+2768, Ps): ❨ RIGHT-POINTING ANGLE BRACKET (U+232A, Pe): 〉 LEFT-POINTING ANGLE BRACKET (U+2329, Ps): 〈 SUBSCRIPT RIGHT PARENTHESIS (U+208E, Pe): ₎ SUBSCRIPT LEFT PARENTHESIS (U+208D, Ps): ₍ SUPERSCRIPT RIGHT PARENTHESIS (U+207E, Pe): ⁾ SUPERSCRIPT LEFT PARENTHESIS (U+207D, Ps): ⁽ RIGHT SQUARE BRACKET WITH QUILL (U+2046, Pe): ⁆ LEFT SQUARE BRACKET WITH QUILL (U+2045, Ps): ⁅ SINGLE RIGHT-POINTING ANGLE QUOTATION MARK (U+203A, Pf): › SINGLE LEFT-POINTING ANGLE QUOTATION MARK (U+2039, Pi): ‹ RIGHT DOUBLE QUOTATION MARK (U+201D, Pf): ”ĭOUBLE LOW-9 QUOTATION MARK (U+201E, Ps): „ĭOUBLE HIGH-REVERSED-9 QUOTATION MARK (U+201F, Pi): ‟ LEFT DOUBLE QUOTATION MARK (U+201C, Pi): “ SINGLE HIGH-REVERSED-9 QUOTATION MARK (U+201B, Pi): ‛ SINGLE LOW-9 QUOTATION MARK (U+201A, Ps): ‚ RIGHT SINGLE QUOTATION MARK (U+2019, Pf): ’ LEFT SINGLE QUOTATION MARK (U+2018, Pi): ‘ OGHAM REVERSED FEATHER MARK (U+169C, Pe): ᚜ TIBETAN MARK ANG KHANG GYAS (U+0F3D, Pe): ༽ TIBETAN MARK ANG KHANG GYON (U+0F3C, Ps): ༼ TIBETAN MARK GUG RTAGS GYAS (U+0F3B, Pe): ༻ TIBETAN MARK GUG RTAGS GYON (U+0F3A, Ps): ༺ RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK (U+00BB, Pf): » LEFT-POINTING DOUBLE ANGLE QUOTATION MARK (U+00AB, Pi): « (note: on some Bash implementations UTF-8 printing has a bug that causes it to print U+00AB "«" and U+00BB "»" as "?", and some terminals don't have the ability to render all characters correctly.) while IFS=' ' read number name category rest Here's a quick Bash script to get this information, and its output. Those you are going to have to find by hand there is no pre-determined listing of those. And some character, like, are used as brackets in some contexts (such as HTML/XML), while they are considered math symbols ( Sm) in UnicodeData.txt. are indicated with Pi and Pf (initial and final punctuation), so you might want to include those as well. Note that not all characters that you consider brackets may be listed for instance, quotation marks (including "«»"). Look for those character, and you'll find what you're looking for. Open and close punctuation characters are denoted with Ps (punctuation start) and Pe (punctuation end) in the General_Category field (the third field, delimited by ). The primary information is contained in UnicodeData.txt. There is a plain-text database of information about every Unicode character available from the Unicode Consortium the format is described in Unicode Annex #44.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |