ISO/IEC 8859

A series of international standards defining 8-bit single-byte character encodings for different language groups, widely used before the adoption of Unicode.

ISO/IEC 8859 was developed to extend ASCII beyond English. ASCII uses only 7 bits, covering 128 characters, enough for basic English but not for accented letters, non-Latin scripts, or language-specific symbols. ISO/IEC 8859 used the full 8-bit byte to add 96 more characters in the upper range, bringing the total to 256 usable positions per part.

The series is divided into numbered parts, each defining a distinct character set for a specific group of languages. These parts are not versions of the same standard. They are parallel standards sharing the same structure but differing in what characters occupy the upper 128 positions. No single part can represent more than one language family at a time, which made multilingual content impossible within the system.

🌍 Who is it for? #️⃣

ISO/IEC 8859 is relevant to developers and localization engineers working with legacy systems, older databases, or software built before widespread Unicode adoption. Understanding the standard helps teams diagnose encoding errors, migrate legacy content to UTF-8, and handle files that still declare an ISO 8859 charset.

☝️ Why does it matter for localization? #️⃣

For most of the 1990s, ISO/IEC 8859 was the dominant encoding framework for software and web content across Europe and the Middle East. Localization work from that era required choosing the correct part for each target language, and mixing parts within the same file was not possible.

This fragmentation is the root cause of many legacy encoding problems teams still encounter today. A file written in ISO 8859-1 cannot also contain Cyrillic or Greek characters. Mismatches between parts, or between ISO 8859 and Unicode systems, produce mojibake that is difficult to trace and fix.

📌 Key points about ISO/IEC 8859 #️⃣

  • The series has 15 active parts (ISO 8859-12 was never published).
  • All parts share the lower 128 positions with ASCII. Only the upper 128 positions differ.
  • The parts are parallel standards, not revisions of each other — ISO 8859-2 is not an update to ISO 8859-1.
  • East Asian languages are not covered, as Chinese, Japanese, and Korean require far more than 256 character positions.
  • ISO 8859-15 was introduced in 1999 as a practical update to ISO 8859-1, adding the euro sign (€) and a few characters the original omitted.
  • The working group responsible for the standard disbanded in 2004. No new parts are being developed.
  • UTF-8 now accounts for around 99% of all web pages and has effectively replaced ISO/IEC 8859 for all modern use.

Common parts at a glance:

Part Coverage
ISO 8859-1 Western European languages
ISO 8859-2 Central and Eastern European languages
ISO 8859-5 Cyrillic (Russian, Bulgarian, Serbian)
ISO 8859-6 Arabic
ISO 8859-7 Greek
ISO 8859-8 Hebrew
ISO 8859-9 Turkish
ISO 8859-15 Western European with euro sign

Note: All parts of ISO/IEC 8859 are published by ISO/IEC JTC 1/SC 2. Most parts were last confirmed in 2020. The standard is no longer actively maintained. Always check the official ISO catalogue for the current status of specific parts.

Curious about software localization beyond the terminology?

⚡ Manage your translations with Localazy! 🌍