UTF32Encoding Class
Represents a UTF-32 encoding of Unicode characters.
Assembly: mscorlib (in mscorlib.dll)
The UTF32Encoding type exposes the following members.
| Name | Description | |
|---|---|---|
![]() | UTF32Encoding() | Initializes a new instance of the UTF32Encoding class. |
![]() | UTF32Encoding(Boolean, Boolean) | Initializes a new instance of the UTF32Encoding class. Parameters specify whether to use the big endian byte order and whether to provide a Unicode byte order mark. |
![]() | UTF32Encoding(Boolean, Boolean, Boolean) | Initializes a new instance of the UTF32Encoding class. Parameters specify whether to use the big endian byte order, whether to provide a Unicode byte order mark, and whether to throw an exception when an invalid encoding is detected. |
| Name | Description | |
|---|---|---|
![]() | BodyName | When overridden in a derived class, gets a name for the current encoding that can be used with mail agent body tags. (Inherited from Encoding.) |
![]() | CodePage | When overridden in a derived class, gets the code page identifier of the current Encoding. (Inherited from Encoding.) |
![]() | DecoderFallback | Gets or sets the DecoderFallback object for the current Encoding object. (Inherited from Encoding.) |
![]() | EncoderFallback | Gets or sets the EncoderFallback object for the current Encoding object. (Inherited from Encoding.) |
![]() | EncodingName | When overridden in a derived class, gets the human-readable description of the current encoding. (Inherited from Encoding.) |
![]() | HeaderName | When overridden in a derived class, gets a name for the current encoding that can be used with mail agent header tags. (Inherited from Encoding.) |
![]() | IsBrowserDisplay | When overridden in a derived class, gets a value indicating whether the current encoding can be used by browser clients for displaying content. (Inherited from Encoding.) |
![]() | IsBrowserSave | When overridden in a derived class, gets a value indicating whether the current encoding can be used by browser clients for saving content. (Inherited from Encoding.) |
![]() | IsMailNewsDisplay | When overridden in a derived class, gets a value indicating whether the current encoding can be used by mail and news clients for displaying content. (Inherited from Encoding.) |
![]() | IsMailNewsSave | When overridden in a derived class, gets a value indicating whether the current encoding can be used by mail and news clients for saving content. (Inherited from Encoding.) |
![]() | IsReadOnly | When overridden in a derived class, gets a value indicating whether the current encoding is read-only. (Inherited from Encoding.) |
![]() | IsSingleByte | When overridden in a derived class, gets a value indicating whether the current encoding uses single-byte code points. (Inherited from Encoding.) |
![]() | WebName | When overridden in a derived class, gets the name registered with the Internet Assigned Numbers Authority (IANA) for the current encoding. (Inherited from Encoding.) |
![]() | WindowsCodePage | When overridden in a derived class, gets the Windows operating system code page that most closely corresponds to the current encoding. (Inherited from Encoding.) |
| Name | Description | |
|---|---|---|
![]() | Clone | When overridden in a derived class, creates a shallow copy of the current Encoding object. (Inherited from Encoding.) |
![]() | Equals | Determines whether the specified Object is equal to the current UTF32Encoding object. (Overrides Encoding::Equals(Object).) |
![]() | Finalize | Allows an object to try to free resources and perform other cleanup operations before it is reclaimed by garbage collection. (Inherited from Object.) |
![]() | GetByteCount(array<Char>) | When overridden in a derived class, calculates the number of bytes produced by encoding all the characters in the specified character array. (Inherited from Encoding.) |
![]() | GetByteCount(String) | Calculates the number of bytes produced by encoding the characters in the specified String. (Overrides Encoding::GetByteCount(String).) |
![]() | GetByteCount(Char*, Int32) | Calculates the number of bytes produced by encoding a set of characters starting at the specified character pointer. (Overrides Encoding::GetByteCount(Char*, Int32).) |
![]() | GetByteCount(array<Char>, Int32, Int32) | Calculates the number of bytes produced by encoding a set of characters from the specified character array. (Overrides Encoding::GetByteCount(array<Char>, Int32, Int32).) |
![]() | GetBytes(array<Char>) | When overridden in a derived class, encodes all the characters in the specified character array into a sequence of bytes. (Inherited from Encoding.) |
![]() | GetBytes(String) | When overridden in a derived class, encodes all the characters in the specified string into a sequence of bytes. (Inherited from Encoding.) |
![]() | GetBytes(array<Char>, Int32, Int32) | When overridden in a derived class, encodes a set of characters from the specified character array into a sequence of bytes. (Inherited from Encoding.) |
![]() | GetBytes(Char*, Int32, Byte*, Int32) | Encodes a set of characters starting at the specified character pointer into a sequence of bytes that are stored starting at the specified byte pointer. (Overrides Encoding::GetBytes(Char*, Int32, Byte*, Int32).) |
![]() | GetBytes(array<Char>, Int32, Int32, array<Byte>, Int32) | Encodes a set of characters from the specified character array into the specified byte array. (Overrides Encoding::GetBytes(array<Char>, Int32, Int32, array<Byte>, Int32).) |
![]() | GetBytes(String, Int32, Int32, array<Byte>, Int32) | Encodes a set of characters from the specified String into the specified byte array. (Overrides Encoding::GetBytes(String, Int32, Int32, array<Byte>, Int32).) |
![]() | GetCharCount(array<Byte>) | When overridden in a derived class, calculates the number of characters produced by decoding all the bytes in the specified byte array. (Inherited from Encoding.) |
![]() | GetCharCount(Byte*, Int32) | Calculates the number of characters produced by decoding a sequence of bytes starting at the specified byte pointer. (Overrides Encoding::GetCharCount(Byte*, Int32).) |
![]() | GetCharCount(array<Byte>, Int32, Int32) | Calculates the number of characters produced by decoding a sequence of bytes from the specified byte array. (Overrides Encoding::GetCharCount(array<Byte>, Int32, Int32).) |
![]() | GetChars(array<Byte>) | When overridden in a derived class, decodes all the bytes in the specified byte array into a set of characters. (Inherited from Encoding.) |
![]() | GetChars(array<Byte>, Int32, Int32) | When overridden in a derived class, decodes a sequence of bytes from the specified byte array into a set of characters. (Inherited from Encoding.) |
![]() | GetChars(Byte*, Int32, Char*, Int32) | Decodes a sequence of bytes starting at the specified byte pointer into a set of characters that are stored starting at the specified character pointer. (Overrides Encoding::GetChars(Byte*, Int32, Char*, Int32).) |
![]() | GetChars(array<Byte>, Int32, Int32, array<Char>, Int32) | Decodes a sequence of bytes from the specified byte array into the specified character array. (Overrides Encoding::GetChars(array<Byte>, Int32, Int32, array<Char>, Int32).) |
![]() | GetDecoder | Obtains a decoder that converts a UTF-32 encoded sequence of bytes into a sequence of Unicode characters. (Overrides Encoding::GetDecoder().) |
![]() | GetEncoder | Obtains an encoder that converts a sequence of Unicode characters into a UTF-32 encoded sequence of bytes. (Overrides Encoding::GetEncoder().) |
![]() | GetHashCode | Returns the hash code for the current instance. (Overrides Encoding::GetHashCode().) |
![]() | GetMaxByteCount | Calculates the maximum number of bytes produced by encoding the specified number of characters. (Overrides Encoding::GetMaxByteCount(Int32).) |
![]() | GetMaxCharCount | Calculates the maximum number of characters produced by decoding the specified number of bytes. (Overrides Encoding::GetMaxCharCount(Int32).) |
![]() | GetPreamble | Returns a Unicode byte order mark encoded in UTF-32 format, if the constructor for this instance requests a byte order mark. (Overrides Encoding::GetPreamble().) |
![]() | GetString(array<Byte>) | When overridden in a derived class, decodes all the bytes in the specified byte array into a string. (Inherited from Encoding.) |
![]() | GetString(array<Byte>, Int32, Int32) | Decodes a range of bytes from a byte array into a string. (Overrides Encoding::GetString(array<Byte>, Int32, Int32).) |
![]() | GetType | Gets the Type of the current instance. (Inherited from Object.) |
![]() | IsAlwaysNormalized() | Gets a value indicating whether the current encoding is always normalized, using the default normalization form. (Inherited from Encoding.) |
![]() | IsAlwaysNormalized(NormalizationForm) | When overridden in a derived class, gets a value indicating whether the current encoding is always normalized, using the specified normalization form. (Inherited from Encoding.) |
![]() | MemberwiseClone | Creates a shallow copy of the current Object. (Inherited from Object.) |
![]() | ToString | Returns a string that represents the current object. (Inherited from Object.) |
Encoding is the process of transforming a set of Unicode characters into a sequence of bytes. Decoding is the process of transforming a sequence of encoded bytes into a set of Unicode characters.
The UTF-32 encoding represents each code point as a 32-bit integer. For more information about the UTFs and other encodings supported by System.Text, see Character Encoding in the .NET Framework and Using Unicode Encoding.
The GetByteCount method determines how many bytes result in encoding a set of Unicode characters, and the GetBytes method performs the actual encoding.
Likewise, the GetCharCount method determines how many characters result in decoding a sequence of bytes, and the GetChars and GetString methods perform the actual decoding.
UTF32Encoding corresponds to the Windows code pages 12000 (little endian byte order) and 12001 (big endian byte order).
The encoder can use the big endian byte order (most significant byte first) or the little endian byte order (least significant byte first). For example, the Latin Capital Letter A (code point U+0041) is serialized as follows (in hexadecimal):
Big endian byte order: 00 00 00 41
Little endian byte order: 41 00 00 00
It is generally more efficient to store Unicode characters using the native byte order. For example, it is better to use the little endian byte order on little endian platforms, such as Intel computers.
The GetPreamble method retrieves an array of bytes that can include the byte order mark (BOM). If this byte array is prefixed to an encoded stream, it helps the decoder to identify the encoding format used.
For more information on byte order and the byte order mark, see The Unicode Standard at the Unicode home page.
Note |
|---|
To enable error detection and to make the class instance more secure, the application should use the UTF32Encoding constructor that takes a throwOnInvalidCharacters parameter, and set that parameter to true. With error detection, a method that detects an invalid sequence of characters or bytes throws a ArgumentException. Without error detection, no exception is thrown, and the invalid sequence is generally ignored. |
The following code example demonstrates the behavior of UTF32Encoding, both with error detection enabled and without.
using namespace System; using namespace System::Text; void PrintDecodedString( array<Byte>^bytes, Encoding^ enc ); int main() { // Create an instance of UTF32Encoding using little-endian byte order. // This will be used for encoding. UTF32Encoding^ u32LE = gcnew UTF32Encoding( false,true ); // Create two instances of UTF32Encoding using big-endian byte order: one with error detection and one without. // These will be used for decoding. UTF32Encoding^ u32withED = gcnew UTF32Encoding( true,true,true ); UTF32Encoding^ u32noED = gcnew UTF32Encoding( true,true,false ); // Create byte arrays from the same string containing the following characters: // Latin Small Letter Z (U+007A) // Latin Small Letter A (U+0061) // Combining Breve (U+0306) // Latin Small Letter AE With Acute (U+01FD) // Greek Small Letter Beta (U+03B2) String^ myStr = L"za\u0306\u01FD\u03B2\xD8FF\xDCFF"; // Encode the string using little-endian byte order. array<Byte>^myBytes = gcnew array<Byte>(u32LE->GetByteCount( myStr )); u32LE->GetBytes( myStr, 0, myStr->Length, myBytes, 0 ); // Decode the byte array with error detection. Console::WriteLine( "Decoding with error detection:" ); PrintDecodedString( myBytes, u32withED ); // Decode the byte array without error detection. Console::WriteLine( "Decoding without error detection:" ); PrintDecodedString( myBytes, u32noED ); } // Decode the bytes and display the string. void PrintDecodedString( array<Byte>^bytes, Encoding^ enc ) { try { Console::WriteLine( " Decoded string: {0}", enc->GetString( bytes, 0, bytes->Length ) ); } catch ( System::ArgumentException^ e ) { Console::WriteLine( e ); } Console::WriteLine(); }
Windows 7, Windows Vista SP1 or later, Windows XP SP3, Windows XP SP2 x64 Edition, Windows Server 2008 (Server Core not supported), Windows Server 2008 R2 (Server Core supported with SP1 or later), Windows Server 2003 SP2
The .NET Framework does not support all versions of every platform. For a list of the supported versions, see .NET Framework System Requirements.
