Values for code page |
Offset | Description | ||||
---|---|---|---|---|---|
0 |
Reduced 7-bit ASCII | [NetWare] | |||
37 |
EBCDIC: | US/Canada English (CECP) | [Windows NT 3.51+] | ||
38 |
EBCDIC: | International (old) | |||
111 |
Greek | ||||
112 |
Turkish | ||||
113 |
Yugoslavian | ||||
161 |
Arabic | [Linux] | |||
162 |
Arabic | [Linux] | |||
163 |
Arabic | [Linux] | |||
164 |
Arabic | [Linux] | |||
165 |
Arabic | [Linux] | |||
237 |
EBCDIC: | Germany (CECP) | |||
273 |
EBCDIC: | ??? (CECP) | |||
274 |
EBCDIC: | Belgium | |||
275 |
EBCDIC: | Brazilian | |||
277 |
EBCDIC: | Danish/Norwegian (CECP) | |||
278 |
EBCDIC: | Finnish/Swedish (CECP) | |||
280 |
EBCDIC: | Italian (CECP) | |||
281 |
EBCDIC: | Japanese-E | |||
284 |
EBCDIC: | Latin-American/Spanish (CECP) | |||
285 |
EBCDIC: | UK English (CECP) | |||
290 |
EBCDIC: | Japanese Kana | |||
297 |
EBCDIC: | French (CECP) | |||
367 |
US-ASCII (ISO 646-US, 7-bit) | ||||
420 |
EBCDIC: | Arabic 1 | |||
423 |
EBCDIC: | Greek | |||
424 |
EBCDIC: | Hebrew | |||
437 |
US / English / PC-8 / IBM-2 | ||||
500 |
EBCDIC: | Belgium/Switzerland (CECP) | |||
500 |
EBCDIC: | International | |||
646 |
(??? reserved for ISO 646 7-bit codepages) | ||||
667 |
Eastern Europe (Polish) | ||||
668 |
Eastern Europe (Slavic) | ||||
708 |
Arabic/Middle East | ||||
737 |
Greek (2) | ||||
775 |
Baltic / Baltic Rim | ||||
819 |
Latin-1 (ISO 8859-1) | ||||
850 |
Multilingual (Latin-1) | ||||
851 |
Greek | ||||
852 |
Slavic/Easter Europe (Latin-2) | [DOS 5+] | |||
853 |
Turkish (Latin-2) | ||||
854 |
Spanish | ||||
855 |
Cyrillic (1) | ||||
857 |
Turkish | ||||
860 |
Portuguese | ||||
861 |
Icelandic | ||||
862 |
Hebrew | ||||
863 |
French Canadian | ||||
864 |
Arabic/Middle East | ||||
865 |
Nordic (Norwegian/Danish) | ||||
866 |
Russian (Cyrillic 2) | ||||
867 |
Czech | ||||
868 |
Arabic | ||||
869 |
Greek (1) | ||||
870 |
EBCDIC: | Yugoslavia (Roece) | |||
871 |
EBCDIC: | Icelandic | |||
874 |
Thailand | ||||
875 |
EBCDIC: | Greek | |||
880 |
Russian (Cyrillic GOST) | ||||
880 |
EBCDIC: | Cyrillic | |||
881 |
Latin 1 (ISO 8859-1) | ||||
882 |
Latin 2 (ISO 8859-2) | ||||
883 |
Latin 3 (ISO 8859-3) | ||||
884 |
Latin 4 (ISO 8859-4) | ||||
885 |
Latin 5 (ISO 8859-5) | ||||
891 |
unknown | ||||
897 |
Japanese (Shift-JIS) | ||||
903 |
unknown | ||||
904 |
unknown | ||||
905 |
EBCDIC: | Turkish | |||
912 |
Latin 2 (ISO 8859-2: Eastern Europe) | ||||
913 |
(??? reserved for Latin 3) | ||||
914 |
(??? reserved for Latin 4) | ||||
915 |
Cyrillic (ISO 8859-5: Latin/Cyrillic) | ||||
916 |
(??? reserved for ISO 8859-6: Latin/Arabic) | ||||
917 |
(??? reserved for ISO 8859-7: Latin/Greek) | ||||
918 |
EBCDIC: | Arabic 2 | |||
919 |
(??? reserved for ISO 8859-9: Latin 5) | ||||
920 |
(??? reserved for ISO 8859-10: Latin 6/Sami) | ||||
932 |
DBCS: | Japanese (Shift-JIS) | |||
934 |
DBCS: | Korean | |||
936 |
DBCS: | Chinese (PRC/ROC, Simplified/xGB) | |||
938 |
DBCS: | Taiwan | |||
938 |
DBCS: | Chinese (PRC/ROC) | |||
942 |
DBCS: | Japanese SAA | |||
944 |
DBCS: | Korean SAA | |||
948 |
DBCS: | Chinese SAA (PRC/ROC) | |||
949 |
Korean (Unified Hangul; Extended Wansung) | ||||
950 |
Chinese Traditional, Big5 (Taiwan, Hong Kong) | ||||
966 |
Saudi Arabian | ||||
972 |
Hebrew (Israeli VT100) | ||||
999 |
reserved for user-definable codepages | ||||
1004 |
Desktop Publishing | ||||
1026 |
EBCDIC: | Turkish (Latin 5) | |||
1047 |
EBCDIC: | International (CECP, de-facto EBCDIC-US) | |||
1250 |
MSWIN: | Eastern Europe (Latin 2) | |||
1251 |
MSWIN: | Cyrillic | |||
1252 |
MSWIN: | English/W. Europe/Standard (Latin 1) | |||
1253 |
MSWIN: | Greek (GRC) | |||
1254 |
MSWIN: | Turkish | |||
1255 |
MSWIN: | Hebrew | |||
1256 |
MSWIN: | Arabic | |||
1257 |
MSWIN: | Baltic (Estonian, Latvian, Lithuanian) | |||
1258 |
MSWIN: | Vietnamese | |||
1361 |
ANSI???: | Korean (Johab) | |||
10000 |
MAC: | International/Standard (Roman) | |||
10006 |
MAC: | Greek | |||
10007 |
MAC: | Cyrillic | |||
10029 |
MAC: | Latin 2 | |||
10079 |
MAC: | Icelandic | |||
10081 |
MAC: | Turkish | |||
10646 |
(should be reserved for the future ISO 10646 32-bit codepage???) | ||||
65400 |
OS/2: | reserved for Glyphs |
Note: |
Not all code pages are available in all
versions of DOS or DOS-compatibles, and many (particularly EBCDIC) have not been
implemented for *any* DOS to date. Unicode (UCS-2) is a 16-bit character codeset, covering all commonly used characters from almost any language. Not all definitions are fixed at the time of this writing. Unicode will be the future of character coding for the foreseeable future, but is only the "basic multilingual plane" (BMP) subset of 32-bit ISO 10646 codes (UCS-4), a single character set standard covering requirements for all countries and languages, which is still under construction. The MS Windows 'ANSI' codepage 1252 (based on the MS Windows 3.0+ implementation) appears to be 100% compatible with the code sets used by Amiga OS and Acorn Archimedes RISC-OS and is also a linear subset of the 16bit UniCode code set (UCS-2); the actual ANSI codepage is defined by ISO 8859-1 (Latin 1). At least applications for OS/2 Warp 3 Presentation Manager can use EBCDIC codepages, but the codepage ID assignments for EBCDIC codepages are not known for OS/2. OS/2 SAA codepages are not supported in CONFIG.SYS. Codepage 65400 "Glyphs" is not actually a codepage, but a way to directly access the first 256 of the 383 glyphs from the current font set. Novell DOS 7/DR DOS 6/Caldera OpenDOS undocumented codepage 853 does not necessarily match with MS-DOS' undocumented codepage 853. Undocumented codepages 667 and 668 can be found in Russian's PTS/DOS 6.51 and S/DOS 1.x DISPLAY.CPI and contain some Eastern European characters. Novell NetWare 3.xx clients support UniCode and codepages 437, 850, 860, 863, 865, 897, 932, and 1252 (possibly more). NetWare 4.xx clients also support 1250, 1251, 1256. Personal NetWare 1.0 (PNW), as it was distributed in Europe, supports UniCode and codepages 437, 850 and 1252. Novell's Client32 for DOS/Windows supports 874, 932, 936, 949, 950, 1250 - 1257. For codesets not yet available, Novell offers a reduced ASCII 7-bit support through a codepage 0 used as a translation table to UniCode, that supports characters 32-127 except 92 ('\'). |