Values for code page

[ Back ] [ Next ]

Offset	Description
0		Reduced 7-bit ASCII	[NetWare]
37	EBCDIC:	US/Canada English (CECP)			[Windows NT 3.51+]
38	EBCDIC:	International (old)
111		Greek
112		Turkish
113		Yugoslavian
161		Arabic		[Linux]
162		Arabic		[Linux]
163		Arabic		[Linux]
164		Arabic		[Linux]
165		Arabic		[Linux]
237	EBCDIC:	Germany (CECP)
273	EBCDIC:	??? (CECP)
274	EBCDIC:	Belgium
275	EBCDIC:	Brazilian
277	EBCDIC:	Danish/Norwegian (CECP)
278	EBCDIC:	Finnish/Swedish (CECP)
280	EBCDIC:	Italian (CECP)
281	EBCDIC:	Japanese-E
284	EBCDIC:	Latin-American/Spanish (CECP)
285	EBCDIC:	UK English (CECP)
290	EBCDIC:	Japanese Kana
297	EBCDIC:	French (CECP)
367		US-ASCII (ISO 646-US, 7-bit)
420	EBCDIC:	Arabic 1
423	EBCDIC:	Greek
424	EBCDIC:	Hebrew
437		US / English / PC-8 / IBM-2
500	EBCDIC:	Belgium/Switzerland (CECP)
500	EBCDIC:	International
646		(??? reserved for ISO 646 7-bit codepages)
667		Eastern Europe (Polish)
668		Eastern Europe (Slavic)
708		Arabic/Middle East
737		Greek (2)
775		Baltic / Baltic Rim
819		Latin-1 (ISO 8859-1)
850		Multilingual (Latin-1)
851		Greek
852		Slavic/Easter Europe (Latin-2)		[DOS 5+]
853		Turkish (Latin-2)
854		Spanish
855		Cyrillic (1)
857		Turkish
860		Portuguese
861		Icelandic
862		Hebrew
863		French Canadian
864		Arabic/Middle East
865		Nordic (Norwegian/Danish)
866		Russian (Cyrillic 2)
867		Czech
868		Arabic
869		Greek (1)
870	EBCDIC:	Yugoslavia (Roece)
871	EBCDIC:	Icelandic
874		Thailand
875	EBCDIC:	Greek
880		Russian (Cyrillic GOST)
880	EBCDIC:	Cyrillic
881		Latin 1 (ISO 8859-1)
882		Latin 2 (ISO 8859-2)
883		Latin 3 (ISO 8859-3)
884		Latin 4 (ISO 8859-4)
885		Latin 5 (ISO 8859-5)
891		unknown
897		Japanese (Shift-JIS)
903		unknown
904		unknown
905	EBCDIC:	Turkish
912		Latin 2 (ISO 8859-2: Eastern Europe)
913		(??? reserved for Latin 3)
914		(??? reserved for Latin 4)
915		Cyrillic (ISO 8859-5: Latin/Cyrillic)
916		(??? reserved for ISO 8859-6: Latin/Arabic)
917		(??? reserved for ISO 8859-7: Latin/Greek)
918	EBCDIC:	Arabic 2
919		(??? reserved for ISO 8859-9: Latin 5)
920		(??? reserved for ISO 8859-10: Latin 6/Sami)
932	DBCS:	Japanese (Shift-JIS)
934	DBCS:	Korean
936	DBCS:	Chinese (PRC/ROC, Simplified/xGB)
938	DBCS:	Taiwan
938	DBCS:	Chinese (PRC/ROC)
942	DBCS:	Japanese SAA
944	DBCS:	Korean SAA
948	DBCS:	Chinese SAA (PRC/ROC)
949		Korean (Unified Hangul; Extended Wansung)
950		Chinese Traditional, Big5 (Taiwan, Hong Kong)
966		Saudi Arabian
972		Hebrew (Israeli VT100)
999		reserved for user-definable codepages
1004		Desktop Publishing
1026	EBCDIC:	Turkish (Latin 5)
1047	EBCDIC:	International (CECP, de-facto EBCDIC-US)
1250	MSWIN:	Eastern Europe (Latin 2)
1251	MSWIN:	Cyrillic
1252	MSWIN:	English/W. Europe/Standard (Latin 1)
1253	MSWIN:	Greek (GRC)
1254	MSWIN:	Turkish
1255	MSWIN:	Hebrew
1256	MSWIN:	Arabic
1257	MSWIN:	Baltic (Estonian, Latvian, Lithuanian)
1258	MSWIN:	Vietnamese
1361	ANSI???:	Korean (Johab)
10000	MAC:	International/Standard (Roman)
10006	MAC:	Greek
10007	MAC:	Cyrillic
10029	MAC:	Latin 2
10079	MAC:	Icelandic
10081	MAC:	Turkish
10646		(should be reserved for the future ISO 10646 32-bit codepage???)
65400	OS/2:	reserved for Glyphs

Note:

Not all code pages are available in all versions of DOS or DOS-compatibles, and many (particularly EBCDIC) have not been implemented for *any* DOS to date.
CECP = 'Country Extended CodePage' by IBM.

Unicode (UCS-2) is a 16-bit character codeset, covering all commonly used characters from almost any language. Not all definitions are fixed at the time of this writing. Unicode will be the future of character coding for the foreseeable future, but is only the "basic multilingual plane" (BMP) subset of 32-bit ISO 10646 codes (UCS-4), a single character set standard covering requirements for all countries and languages, which is still under construction.

The MS Windows 'ANSI' codepage 1252 (based on the MS Windows 3.0+ implementation) appears to be 100% compatible with the code sets used by Amiga OS and Acorn Archimedes RISC-OS and is also a linear subset of the 16bit UniCode code set (UCS-2); the actual ANSI codepage is defined by ISO 8859-1 (Latin 1).

At least applications for OS/2 Warp 3 Presentation Manager can use EBCDIC codepages, but the codepage ID assignments for EBCDIC codepages are not known for OS/2. OS/2 SAA codepages are not supported in CONFIG.SYS. Codepage 65400 "Glyphs" is not actually a codepage, but a way to directly access the first 256 of the 383 glyphs from the current font set.

Novell DOS 7/DR DOS 6/Caldera OpenDOS undocumented codepage 853 does not necessarily match with MS-DOS' undocumented codepage 853.

Undocumented codepages 667 and 668 can be found in Russian's PTS/DOS 6.51 and S/DOS 1.x DISPLAY.CPI and contain some Eastern European characters.

Novell NetWare 3.xx clients support UniCode and codepages 437, 850, 860, 863, 865, 897, 932, and 1252 (possibly more). NetWare 4.xx clients also support 1250, 1251, 1256. Personal NetWare 1.0 (PNW), as it was distributed in Europe, supports UniCode and codepages 437, 850 and 1252. Novell's Client32 for DOS/Windows supports 874, 932, 936, 949, 950, 1250 - 1257. For codesets not yet available, Novell offers a reduced ASCII 7-bit support through a codepage 0 used as a translation table to UniCode, that supports characters 32-127 except 92 ('\').