ISO/IEC 8859-14 – Wikipedia

ISO/IEC 8859-14
MIME / IANA	ISO-8859-14
Alias(es)	iso-ir-199, latin8, iso-celtic, l8
Language(s)	Irish, Manx, Scottish Gaelic, Welsh, Cornish, Breton, English
Standard	ISO/IEC 8859-14:1998
Classification	ISO/IEC 8859 (Extended ASCII, ISO/IEC 4873 level 1)
Extends	US-ASCII
Based on	ISO-IR-182

8-bit character set

ISO/IEC 8859-14:1998, Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic), is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1998. It is informally referred to as Latin-8 or Celtic. It was designed to cover the Celtic languages, such as Irish, Manx, Scottish Gaelic, Welsh, Cornish, and Breton.

ISO-8859-14 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. CeltScript made an extension for Windows called Extended Latin-8. Microsoft has assigned code page 28604 a.k.a. Windows-28604 to ISO-8859-14.^[2]

Table of Contents

History[edit]

ISO-8859-14 was originally proposed for the Sami languages.^[3]ISO 8859-12 was proposed for Celtic.^[4] Later, ISO 8859-12 was proposed for Devanagari, so the Celtic proposal was changed to ISO 8859-14. The Sami proposal was changed to ISO 8859-15,^[5] but it got rejected as an ISO/IEC 8859 part, although it was registered as ISO-IR-197.^[6]

The original proposal used a different arrangement of points 0xA1–BF.^[4] At the committee draft stage of the specification, a dotless i was included at 0xAE,^[7] which was changed to a registered trademark sign (matching ISO-8859-1) in the final publication.

ISO-IR-182, an earlier (registered in 1994) modification of ISO-8859-1, had added the letters Ẁ, Ẃ, Ẅ, Ỳ, Ÿ, Ŵ, Ŷ and their lowercase forms (except for ÿ, which was already included) for Welsh language use.^[8] The final published version of ISO-8859-14 includes these letters in the same positions which they appear at in ISO-IR-182.

Codepage layout[edit]

Differences from ISO-8859-1 have the Unicode code point number below the character.

Draft layout[edit]

The first draft had positions A0-BF different. It did not include the pilcrow sign, but included the cent sign instead at its Latin-1 position. Later, it was ruled that the pilcrow sign was more common, so the pilcrow sign remains at its Latin-1 position, and the cent sign was removed instead.

Differences from ISO-8859-14 have the Unicode code point below them.

References[edit]

^ Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12
^ “SheetJS/js-codepage”. GitHub. 12 October 2021.
^ Everson, Michael. “Proposed ISO 8859-14 (later 15)”.
^ ^a ^b ^c Everson, Michael. “Proposed ISO 8859-12 (later 14)”.
^ Everson, Michael (1996-06-19). Proposal for a new part of ISO/IEC 8859: Latin alphabet No. 9 (Sámi).
^ Swedish Institute for Standards (1997-01-24). ISO-IR-197: Sami supplementary Latin set (PDF). ITSCJ/IPSJ. Archived from the original (PDF) on 2022-03-10.
^ Everson, Michael (1997-05-05). “ISO/IEC CD 8859-14:1997 — Latin alphabet No. 8 (Celtic)” (Committee Draft).
^ British Standards Institution (1994-03-16). ISO-IR-182: Welsh variant of Latin Alphabet No. 1 (right-hand part) (PDF). ITSCJ/IPSJ. Archived from the original (PDF) on 2022-03-10.
^ Kuhn, Markus; Whistler, Ken (1999-07-27). “ISO/IEC 8859-14:1998 to Unicode”. 8859 to Unicode mapping tables. Unicode, Inc.
^ International Components for Unicode (ICU), iso-8859_14-1998.ucm, 1999-07-27

External links[edit]

[1] Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12

[2] “SheetJS/js-codepage”. GitHub. 12 October 2021.

[3] Everson, Michael. “Proposed ISO 8859-14 (later 15)”.

[eversonold-4] Everson, Michael. “Proposed ISO 8859-12 (later 14)”.

[5] Everson, Michael (1996-06-19). Proposal for a new part of ISO/IEC 8859: Latin alphabet No. 9 (Sámi).

[6] Swedish Institute for Standards (1997-01-24). ISO-IR-197: Sami supplementary Latin set (PDF). ITSCJ/IPSJ. Archived from the original (PDF) on 2022-03-10.

[7] Everson, Michael (1997-05-05). “ISO/IEC CD 8859-14:1997 — Latin alphabet No. 8 (Celtic)” (Committee Draft).

[8] British Standards Institution (1994-03-16). ISO-IR-182: Welsh variant of Latin Alphabet No. 1 (right-hand part) (PDF). ITSCJ/IPSJ. Archived from the original (PDF) on 2022-03-10.

[9] Kuhn, Markus; Whistler, Ken (1999-07-27). “ISO/IEC 8859-14:1998 to Unicode”. 8859 to Unicode mapping tables. Unicode, Inc.

[10] International Components for Unicode (ICU), iso-8859_14-1998.ucm, 1999-07-27

ISO/IEC 8859-14 – Wikipedia

History[edit]

Codepage layout[edit]

Draft layout[edit]

References[edit]

External links[edit]

Recent Posts

Recent Comments

Archives

Categories

Meta