CharUnicodeInfo Class
Retrieves information about a Unicode character. This class cannot be inherited.
Assembly: mscorlib (in mscorlib.dll)
The CharUnicodeInfo type exposes the following members.
| Name | Description | |
|---|---|---|
![]() ![]() | GetDecimalDigitValue(Char) | Gets the decimal digit value of the specified numeric character. |
![]() ![]() | GetDecimalDigitValue(String, Int32) | Gets the decimal digit value of the numeric character at the specified index of the specified string. |
![]() ![]() | GetDigitValue(Char) | Gets the digit value of the specified numeric character. |
![]() ![]() | GetDigitValue(String, Int32) | Gets the digit value of the numeric character at the specified index of the specified string. |
![]() ![]() ![]() ![]() | GetNumericValue(Char) | Gets the numeric value associated with the specified character. |
![]() ![]() ![]() ![]() | GetNumericValue(String, Int32) | Gets the numeric value associated with the character at the specified index of the specified string. |
![]() ![]() ![]() ![]() | GetUnicodeCategory(Char) | Gets the Unicode category of the specified character. |
![]() ![]() ![]() ![]() | GetUnicodeCategory(String, Int32) | Gets the Unicode category of the character at the specified index of the specified string. |
The Unicode Standard defines a number of Unicode character categories. For example, a character might be categorized as an uppercase letter, a lowercase letter, a decimal digit number, a letter number, a paragraph separator, a math symbol, or a currency symbol. Your application can use the character category to govern string-based operations, such as parsing. The UnicodeCategory enumeration defines the possible character categories.
Your application uses the CharUnicodeInfo class to obtain the UnicodeCategory value for a specific character. The CharUnicodeInfo class defines methods that return the following Unicode character values:
Numeric value. Applies only to numeric characters, including fractions, subscripts, superscripts, Roman numerals, currency numerators, encircled numbers, and script-specific digits.
Digit value. Applies to numeric characters that can be combined with other numeric characters to represent a whole number in a numbering system.
Decimal digit value. Applies only to decimal digits in the decimal (base-10) system. A decimal digit can be one of ten digits, from 0 through 9.
When using this class in your applications, keep in mind the following programming considerations for using the Char type. The type can be difficult to use, and strings are generally preferable for representing linguistic content.
A Char object does not always correspond to a single character. Although the Char type represents a single 16-bit value, some Unicode characters consist of two or more UTF-16 code points. For more information, see "Char Objects and Unicode Characters" in the String class.
The notion of a "character" is also flexible. A character is often thought of as a glyph, but many glyphs require multiple code points. For example, ä can be represented either by two code points ("a" plus U+0308, which is the combining diaeresis), or by a single code point ("ä" or U+00A4). Some languages have many letters, characters, and glyphs that require multiple code points, which can cause confusion in linguistic content representation. For example, there is a ΰ (U+03B0, Greek small letter upsilon with dialytika and tonos), but there is no equivalent capital letter. Uppercasing such a value simply retrieves the original value.
Windows 7, Windows Vista SP1 or later, Windows XP SP3, Windows XP SP2 x64 Edition, Windows Server 2008 (Server Core not supported), Windows Server 2008 R2 (Server Core supported with SP1 or later), Windows Server 2003 SP2
The .NET Framework does not support all versions of every platform. For a list of the supported versions, see .NET Framework System Requirements.

