Language Character Sets

Note: This page relates to an old method of character encoding using a single byte per character. This encoding was used in pre-HTML5 browsers. Current browsers compatible with HTML5 employ a 2 byte Unicode UTF-8 encoding.

The table below lists ISO and Windows Codepage for Single Byte Character Sets (SBCS). In a single byte character set, there are 28=256 codes from 0 to 255. The first 128 codes are identical to the 7-bit ASCII code. Codes from 128 to 255 are used to represent the characters of the second language of the character set, such as Greek, Hebrew, Turkish, etc., and additional special characters.

Each link in the ISO code column will open a page that provides the Unicode coding for the characters in the ISO-8859 set.

ISO codeWin. codepageAliasUsed for Languages
ISO-8859-11252Latin-1Western European languages: Albanian, Basque, Catalan, Danish, Dutch, English, Faroese, Finnish, French, Galician, German, Icelandic, Irish, Italian, Malay, Norwegian, Portuguese, Spanish, Catalan, Swedish. African languages Afrikaans and Swahili. Southeast Asian Indonesian.
ISO-8859-21250Latin-2Central Europe languages: Bosnian, Croatian, Czech, Hungarian, Polish, Romanian, Serbian (Latin), Slovak, Slovenian
ISO-8859-3Latin-3Superseded by ISO-8859-9 for Turkish
ISO-8859-41257Latin-4Baltic languages: Estonian, Latvian, Lithuanian.
ISO-8859-51251Latin/CyrillicCyrillic languages: Azeri, Belarusian, Bulgarian, Macedonian, Kazakh, Kyrgyz, Mongolian, Russian, Serbian, Tatar, Ukrainian, Uzbek
ISO-8859-61256Latin/ArabicArabic, Farsi, Urdu
ISO-8859-91254Latin-5Turkic languages: Azeri (Latin), Turkish, Uzbek (Latin)
ISO-8859-10Latin-6Nordic languages
ISO-8859-12Not used
ISO-8859-13Latin-7Baltic rim
ISO-8859-15Latin-9Revision of ISO-8859-1
ISO-8859-16Latin-10Intended for Albanian, Croatian, Hungarian, Italian, Polish, Romanian and Slovene, but also Finnish, French, German and Irish Gaelic