Export (0) Print
Expand All

CharUnicodeInfo Class

Note: This class is new in the .NET Framework version 2.0.

Retrieves information about a Unicode character. This class cannot be inherited.

Namespace: System.Globalization
Assembly: mscorlib (in mscorlib.dll)

'Declaration
Public NotInheritable Class CharUnicodeInfo
'Usage
Dim instance As CharUnicodeInfo

public final class CharUnicodeInfo
public final class CharUnicodeInfo

The Unicode standard defines several properties for Unicode characters, and one such property is the character's category. For example, a character might be categorized as an uppercase letter, a lowercase letter, a decimal digit number, a letter number, a connector punctuation, a math symbol, or a currency symbol. Your application can use the character's category to govern string-based operations such as parsing.

The UnicodeCategory enumeration specifies the Unicode categories for characters. Use the CharUnicodeInfo class to obtain the UnicodeCategory value for a specific character.

The CharUnicodeInfo class returns the following Unicode character properties:

  • Numeric value is a Unicode character property that applies only to numeric characters, which include fractions, subscripts, superscripts, Roman numerals, currency numerators, encircled numbers, and script-specific digits.

  • Digit value is a property that applies only to numeric characters that can be used as digits. A digit is a numeric character that can be combined with other digits to represent a whole number in a numbering system.

  • Decimal digit value is a property that applies only to decimal digits. A decimal digit is a digit in the decimal (base-10) system that can be one of ten digits, from zero through nine.

For more information on Unicode characters, see the Unicode Standard.

The following code example shows the values returned by each method for different types of characters.

Imports System
Imports System.Globalization
Imports Microsoft.VisualBasic

Public Class SamplesCharUnicodeInfo   

   Public Shared Sub Main()

      Console.WriteLine("                                        c  Num   Dig   Dec   UnicodeCategory")

      Console.Write("U+0061 LATIN SMALL LETTER A            ")
      PrintProperties("a"c)

      Console.Write("U+0393 GREEK CAPITAL LETTER GAMMA      ")
      PrintProperties(ChrW(&H0393))

      Console.Write("U+0039 DIGIT NINE                      ")
      PrintProperties("9"c)

      Console.Write("U+00B2 SUPERSCRIPT TWO                 ")
      PrintProperties(ChrW(&H00B2))

      Console.Write("U+00BC VULGAR FRACTION ONE QUARTER     ")
      PrintProperties(ChrW(&H00BC))

      Console.Write("U+0BEF TAMIL DIGIT NINE                ")
      PrintProperties(ChrW(&H0BEF))

      Console.Write("U+0BF0 TAMIL NUMBER TEN                ")
      PrintProperties(ChrW(&H0BF0))

      Console.Write("U+0F33 TIBETAN DIGIT HALF ZERO         ")
      PrintProperties(ChrW(&H0F33))

      Console.Write("U+2788 CIRCLED SANS-SERIF DIGIT NINE   ")
      PrintProperties(ChrW(&H2788))

   End Sub 'Main

   Public Shared Sub PrintProperties(c As Char)
      Console.Write(" {0,-3}", c)
      Console.Write(" {0,-5}", CharUnicodeInfo.GetNumericValue(c))
      Console.Write(" {0,-5}", CharUnicodeInfo.GetDigitValue(c))
      Console.Write(" {0,-5}", CharUnicodeInfo.GetDecimalDigitValue(c))
      Console.WriteLine("{0}", CharUnicodeInfo.GetUnicodeCategory(c))
   End Sub 'PrintProperties

End Class 'SamplesCharUnicodeInfo 


'This code produces the following output.  Some characters might not display at the console.
'
'                                        c  Num   Dig   Dec   UnicodeCategory
'U+0061 LATIN SMALL LETTER A             a   -1    -1    -1   LowercaseLetter
'U+0393 GREEK CAPITAL LETTER GAMMA       \u0393   -1    -1    -1   UppercaseLetter
'U+0039 DIGIT NINE                       9   9     9     9    DecimalDigitNumber
'U+00B2 SUPERSCRIPT TWO                  \u00B2   2     2     2    OtherNumber
'U+00BC VULGAR FRACTION ONE QUARTER      \u00BC   0.25  -1    -1   OtherNumber
'U+0BEF TAMIL DIGIT NINE                 \u0BEF   9     9     9    DecimalDigitNumber
'U+0BF0 TAMIL NUMBER TEN                 \u0BF0   10    -1    -1   OtherNumber
'U+0F33 TIBETAN DIGIT HALF ZERO          \u0F33   -0.5  -1    -1   OtherNumber
'U+2788 CIRCLED SANS-SERIF DIGIT NINE    \u2788   9     9     -1   OtherNumber


import System.* ;
import System.Globalization.* ;

public class SamplesCharUnicodeInfo
{
    public static void main(String[] args)
    {
        Console.WriteLine("                                        c  Num " 
            + " Dig   Dec   UnicodeCategory");

        Console.Write("U+0061 LATIN SMALL LETTER A            ");
        PrintProperties('a');

        Console.Write("U+0393 GREEK CAPITAL LETTER GAMMA      ");
        PrintProperties('\u0393');

        Console.Write("U+0039 DIGIT NINE                      ");
        PrintProperties('9');

        Console.Write("U+00B2 SUPERSCRIPT TWO                 ");
        PrintProperties('\u00B2');

        Console.Write("U+00BC VULGAR FRACTION ONE QUARTER     ");
        PrintProperties('\u00BC');

        Console.Write("U+0BEF TAMIL DIGIT NINE                ");
        PrintProperties('\u0BEF');

        Console.Write("U+0BF0 TAMIL NUMBER TEN                ");
        PrintProperties('\u0BF0');

        Console.Write("U+0F33 TIBETAN DIGIT HALF ZERO         ");
        PrintProperties('\u0F33');

        Console.Write("U+2788 CIRCLED SANS-SERIF DIGIT NINE   ");
        PrintProperties('\u2788');
    } //main
   
    public static void PrintProperties(char c)
    {
        Console.Write(" {0,-3}", System.Convert.ToString( c));
        Console.Write(" {0,-5}", 
            System.Convert.ToString(CharUnicodeInfo.GetNumericValue(c)));
        Console.Write(" {0,-5}", 
            System.Convert.ToString(CharUnicodeInfo.GetDigitValue(c)));
        Console.Write(" {0,-5}",
            System.Convert.ToString( CharUnicodeInfo.GetDecimalDigitValue(c)));
        Console.WriteLine("{0}", 
            System.Convert.ToString(CharUnicodeInfo.GetUnicodeCategory(c)));
    } //PrintProperties
} //SamplesCharUnicodeInfo

/*
This code produces the following output.  
Some characters might not display at the console.

                                        c  Num   Dig   Dec   UnicodeCategory
U+0061 LATIN SMALL LETTER A             a   -1    -1    -1   LowercaseLetter
U+0393 GREEK CAPITAL LETTER GAMMA       \u0393   -1    -1    -1   
UppercaseLetter
U+0039 DIGIT NINE                       9   9     9     9    DecimalDigitNumber
U+00B2 SUPERSCRIPT TWO                  \u00B2   2     2     2    OtherNumber
U+00BC VULGAR FRACTION ONE QUARTER      \u00BC   0.25  -1    -1   OtherNumber
U+0BEF TAMIL DIGIT NINE                 \u0BEF   9     9     9    
DecimalDigitNumber
U+0BF0 TAMIL NUMBER TEN                 \u0BF0   10    -1    -1   OtherNumber
U+0F33 TIBETAN DIGIT HALF ZERO          \u0F33   -0.5  -1    -1   OtherNumber
U+2788 CIRCLED SANS-SERIF DIGIT NINE    \u2788   9     9     -1   OtherNumber
*/

System.Object
  System.Globalization.CharUnicodeInfo

Any public static (Shared in Visual Basic) members of this type are thread safe. Any instance members are not guaranteed to be thread safe.

Windows 98, Windows 2000 SP4, Windows CE, Windows Millennium Edition, Windows Mobile for Pocket PC, Windows Mobile for Smartphone, Windows Server 2003, Windows XP Media Center Edition, Windows XP Professional x64 Edition, Windows XP SP2, Windows XP Starter Edition

The .NET Framework does not support all versions of every platform. For a list of the supported versions, see System Requirements.

.NET Framework

Supported in: 2.0

.NET Compact Framework

Supported in: 2.0

Community Additions

ADD
Show:
© 2014 Microsoft