Up ] Docs Home ]

Values for code page

Back ] Next ]

Offset Description

0

  Reduced 7-bit ASCII [NetWare]

37

EBCDIC: US/Canada English (CECP) [Windows NT 3.51+]

38

EBCDIC: International (old)

111

  Greek

112

  Turkish

113

  Yugoslavian

161

  Arabic [Linux]

162

  Arabic [Linux]

163

  Arabic [Linux]

164

  Arabic [Linux]

165

  Arabic [Linux]

237

EBCDIC: Germany (CECP)

273

EBCDIC: ??? (CECP)

274

EBCDIC: Belgium

275

EBCDIC: Brazilian

277

EBCDIC: Danish/Norwegian (CECP)

278

EBCDIC: Finnish/Swedish (CECP)

280

EBCDIC: Italian (CECP)

281

EBCDIC: Japanese-E

284

EBCDIC: Latin-American/Spanish (CECP)

285

EBCDIC: UK English (CECP)

290

EBCDIC: Japanese Kana

297

EBCDIC: French (CECP)

367

  US-ASCII (ISO 646-US, 7-bit)

420

EBCDIC: Arabic 1

423

EBCDIC: Greek

424

EBCDIC: Hebrew

437

  US / English / PC-8 / IBM-2

500

EBCDIC: Belgium/Switzerland (CECP)

500

EBCDIC: International

646

  (??? reserved for ISO 646 7-bit codepages)

667

  Eastern Europe (Polish)

668

  Eastern Europe (Slavic)

708

  Arabic/Middle East

737

  Greek (2)

775

  Baltic / Baltic Rim

819

  Latin-1 (ISO 8859-1)

850

  Multilingual (Latin-1)

851

  Greek

852

  Slavic/Easter Europe (Latin-2) [DOS 5+]

853

  Turkish (Latin-2)

854

  Spanish

855

  Cyrillic (1)

857

  Turkish

860

  Portuguese

861

  Icelandic

862

  Hebrew

863

  French Canadian

864

  Arabic/Middle East

865

  Nordic (Norwegian/Danish)

866

  Russian (Cyrillic 2)

867

  Czech

868

  Arabic

869

  Greek (1)

870

EBCDIC: Yugoslavia (Roece)

871

EBCDIC: Icelandic

874

  Thailand

875

EBCDIC: Greek

880

  Russian (Cyrillic GOST)

880

EBCDIC: Cyrillic

881

  Latin 1 (ISO 8859-1)

882

  Latin 2 (ISO 8859-2)

883

  Latin 3 (ISO 8859-3)

884

  Latin 4 (ISO 8859-4)

885

  Latin 5 (ISO 8859-5)

891

  unknown

897

  Japanese (Shift-JIS)

903

  unknown

904

  unknown

905

EBCDIC: Turkish

912

  Latin 2 (ISO 8859-2: Eastern Europe)

913

  (??? reserved for Latin 3)

914

  (??? reserved for Latin 4)

915

  Cyrillic (ISO 8859-5: Latin/Cyrillic)

916

  (??? reserved for ISO 8859-6: Latin/Arabic)

917

  (??? reserved for ISO 8859-7: Latin/Greek)

918

EBCDIC: Arabic 2

919

  (??? reserved for ISO 8859-9: Latin 5)

920

  (??? reserved for ISO 8859-10: Latin 6/Sami)

932

DBCS: Japanese (Shift-JIS)

934

DBCS: Korean

936

DBCS: Chinese (PRC/ROC, Simplified/xGB)

938

DBCS: Taiwan

938

DBCS: Chinese (PRC/ROC)

942

DBCS: Japanese SAA

944

DBCS: Korean SAA

948

DBCS: Chinese SAA (PRC/ROC)

949

  Korean (Unified Hangul; Extended Wansung)

950

  Chinese Traditional, Big5 (Taiwan, Hong Kong)

966

  Saudi Arabian

972

  Hebrew (Israeli VT100)

999

  reserved for user-definable codepages

1004

  Desktop Publishing

1026

EBCDIC: Turkish (Latin 5)

1047

EBCDIC: International (CECP, de-facto EBCDIC-US)

1250

MSWIN: Eastern Europe (Latin 2)

1251

MSWIN: Cyrillic

1252

MSWIN: English/W. Europe/Standard (Latin 1)

1253

MSWIN: Greek (GRC)

1254

MSWIN: Turkish

1255

MSWIN: Hebrew

1256

MSWIN: Arabic

1257

MSWIN: Baltic (Estonian, Latvian, Lithuanian)

1258

MSWIN: Vietnamese

1361

ANSI???: Korean (Johab)

10000

MAC: International/Standard (Roman)

10006

MAC: Greek

10007

MAC: Cyrillic

10029

MAC: Latin 2

10079

MAC: Icelandic

10081

MAC: Turkish

10646

  (should be reserved for the future ISO 10646 32-bit codepage???)

65400

OS/2: reserved for Glyphs

Note:

Not all code pages are available in all versions of DOS or DOS-compatibles, and many (particularly EBCDIC) have not been implemented for *any* DOS to date.
CECP = 'Country Extended CodePage' by IBM.

Unicode (UCS-2) is a 16-bit character codeset, covering all commonly used characters from almost any language. Not all definitions are fixed at the time of this writing. Unicode will be the future of character coding for the foreseeable future, but is only the "basic multilingual plane" (BMP) subset of 32-bit ISO 10646 codes (UCS-4), a single character set standard covering requirements for all countries and languages, which is still under construction.

The MS Windows 'ANSI' codepage 1252 (based on the MS Windows 3.0+ implementation) appears to be 100% compatible with the code sets used by Amiga OS and Acorn Archimedes RISC-OS and is also a linear subset of the 16bit UniCode code set (UCS-2); the actual ANSI codepage is defined by ISO 8859-1 (Latin 1).

At least applications for OS/2 Warp 3 Presentation Manager can use EBCDIC codepages, but the codepage ID assignments for EBCDIC codepages are not known for OS/2. OS/2 SAA codepages are not supported in CONFIG.SYS. Codepage 65400 "Glyphs" is not actually a codepage, but a way to directly access the first 256 of the 383 glyphs from the current font set.

Novell DOS 7/DR DOS 6/Caldera OpenDOS undocumented codepage 853 does not necessarily match with MS-DOS' undocumented codepage 853.

Undocumented codepages 667 and 668 can be found in Russian's PTS/DOS 6.51 and S/DOS 1.x DISPLAY.CPI and contain some Eastern European characters.

Novell NetWare 3.xx clients support UniCode and codepages 437, 850, 860, 863, 865, 897, 932, and 1252 (possibly more). NetWare 4.xx clients also support 1250, 1251, 1256. Personal NetWare 1.0 (PNW), as it was distributed in Europe, supports UniCode and codepages 437, 850 and 1252. Novell's Client32 for DOS/Windows supports 874, 932, 936, 949, 950, 1250 - 1257. For codesets not yet available, Novell offers a reduced ASCII 7-bit support through a codepage 0 used as a translation table to UniCode, that supports characters 32-127 except 92 ('\').