2.2.3.13 Chars8Text Record (0x98)

This structure represents attribute or element content.


0


1


2


3


4


5


6


7


8


9

1
0


1


2


3


4


5


6


7


8


9

2
0


1


2


3


4


5


6


7


8


9

3
0


1

Length

Bytes (variable)

...

Length (1 byte): This is the length in bytes of the UTF-8 [RFC2279]-encoded string and is represented as UINT8.

Bytes (variable): The string encoded as UTF-8 [RFC2279] bytes.

For example, if length is 3 and bytes = { 0x41, 0x42, 0x43 }, this record is interpreted as the following characters.

 abc

UTF-8 [RFC2279]-encoded sequences MUST be fully formed. There MUST not be any partial UTF-8 [RFC2279] sequences within a record. UTF-8 [RFC2279] sequences that expand to a low surrogate character MUST be paired with a high surrogate character. (For more information on surrogate characters, see [UNICODE].)