2.2.2 Supported Codepage Data Files

The mapping of UTF-16 strings to codepages relies on codepage data files to provide conversion data. These codepage data files map Unicode characters to characters in a single-byte character set (SBCS) or double-byte character set (DBCS).

The data files of supported system codepages are published as specified in [CODEPAGEFILES], [UNICODE], and [UNICODE-BESTFIT]. The location identification uses a simple file-naming convention, which is bestfitxxxx.txt, where xxxx is the codepage number. For example, bestfit950.txt contains the data for codepage 950, and bestfit1252.txt contains the data for codepage 1252.

The pseudocode assumes all these codepage files are available.