|
Expression
|
Syntax
|
Description
|
| Uppercase letter | :Lu | Matches any one capital letter. For example, :Luhe matches "The" but not "the". |
| Lowercase letter | :Ll | Matches any one lower case letter. For example, :Llhe matches "the" but not "The". |
| Title case letter | :Lt | Matches characters that combine an uppercase letter with a lowercase letter, such as Nj and Dz. |
| Modifier letter | :Lm | Matches letters or punctuation, such as commas, cross accents, and double prime, used to indicate modifications to the preceding letter. |
| Other letter | :Lo | Matches other letters, such as gothic letter ahsa. |
| Decimal digit | :Nd | Matches decimal digits such as 0-9 and their full-width equivalents. |
| Letter digit | :Nl | Matches letter digits such as roman numerals and ideographic number zero. |
| Other digit | :No | Matches other digits such as old italic number one. |
| Open punctuation | :Ps | Matches opening punctuation such as open brackets and braces. |
| Close punctuation | :Pe | Matches closing punctuation such as closing brackets and braces. |
| Initial quote punctuation | :Pi | Matches initial double quotation marks. |
| Final quote punctuation | :Pf | Matches single quotation marks and ending double quotation marks. |
| Dash punctuation | :Pd | Matches the dash mark. |
| Connector punctuation | :Pc | Matches the underscore or underline mark. |
| Other punctuation | :Po | Matches (,), ?, ", !, @, #, %, &, *, \, (:), (;), ', and /. |
| Space separator | :Zs | Matches blanks. |
| Line separator | :Zl | Matches the Unicode character U+2028. |
| Paragraph separator | :Zp | Matches the Unicode character U+2029. |
| Non-spacing mark | :Mn | Matches non-spacing marks. |
| Combining mark | :Mc | Matches combining marks. |
| Enclosing mark | :Me | Matches enclosing marks. |
| Math symbol | :Sm | Matches +, =, ~, |, <, and >. |
| Currency symbol | :Sc | Matches $ and other currency symbols. |
| Modifier symbol | :Sk | Matches modifier symbols such as circumflex accent, grave accent, and macron. |
| Other symbol | :So | Matches other symbols, such as the copyright sign, pilcrow sign, and the degree sign. |
| Other control | :Cc | Matches Unicode control characters such as TAB and NEWLINE. |
| Other format | :Cf | Formatting control character such as the bi-directional control characters. |
| Surrogate | :Cs | Matches one half of a surrogate pair. |
| Other private-use | :Co | Matches any character from the private-use area. |
| Other not assigned | :Cn | Matches characters that do not map to a Unicode character. |