ISO 639-2
This article relies largely or entirely on a single source. (August 2020) |
ISO 639-2:1998, Codes for the representation of names of languages — Part 2: Alpha-3 code, is the second part of the ISO 639 standard, which lists codes for the representation of the names of languages. The three-letter codes given for each language in this part of the standard are referred to as "Alpha-3" codes. There are 487 entries in the list of ISO 639-2 codes.
The US Library of Congress is the registration authority for ISO 639-2 (referred to as ISO 639-2/RA). As registration authority, the LOC receives and reviews proposed changes; they also have representation on the ISO 639-RA Joint Advisory Committee responsible for maintaining the ISO 639 code tables.
Find a language |
---|
Enter an ISO 639-2 code to find the corresponding language article. |
History and relationship to other ISO 639 standards
[edit]Work was begun on the ISO 639-2 standard in 1989, because the ISO 639-1 standard, which uses only two-letter codes for languages, is not able to accommodate a sufficient number of languages. The ISO 639-2 standard was first released in 1998.
In practice, ISO 639-2 has largely been superseded by ISO 639-3 (2007), which includes codes for all the individual languages in ISO 639-2 plus many more. It also includes the special and reserved codes, and is designed not to conflict with ISO 639-2. ISO 639-3, however, does not include any of the collective languages in ISO 639-2; most of these are included in ISO 639-5.
B and T codes
[edit]While most languages are given one code by the standard, twenty of the languages described have two three-letter codes, a "bibliographic" code (ISO 639-2/B), which is derived from the English name for the language and was a necessary legacy feature, and a "terminological" code (ISO 639-2/T), which is derived from the native name for the language and resembles the language's two-letter code in ISO 639-1. There were originally 22 B codes; scc and scr are now deprecated.
In general the T codes are favored; ISO 639-3 uses ISO 639-2/T.
Scopes and types
[edit]The codes in ISO 639-2 have a variety of "scopes of denotation", or types of meaning and use, some of which are described in more detail below.
- Individual languages
- Macrolanguages (see ISO 639 macrolanguage)
- Collections of languages
- Dialects
- Reserved for local use
- Special situations
For a definition of macrolanguages and collective languages, see ISO 639-3/RA: Scope of denotation for language identifiers.
Individual languages are further classified as to type:
- Living languages
- Extinct languages
- Ancient languages
- Historic languages
- Constructed languages
Collections of languages
[edit]Some ISO 639-2 codes that are commonly used for languages do not precisely represent a particular language or some related languages (as the above macrolanguages). They are regarded as collective language codes and are excluded from ISO 639-3.
The collective language codes in ISO 639-2 are listed below. Some language groups are noted to be remainder groups, that is excluding languages with their own codes, while other are not. Remainder groups are afa
, alg
, art
, ath
, bat
, ber
, bnt
, cai
, cau
, cel
, crp
, cus
, dra
, fiu
, gem
, inc
, ine
, ira
, khi
, kro
, map
, mis
, mkh
, mun
, nai
, nic
, paa
, roa
, sai
, sem
, sio
, sit
, sla
, ssa
, tai
and tut
, while inclusive groups are apa
, arn
, arw
, aus
, bad
, bai
, bih
, cad
, car
, chb
, cmc
, cpe
, cpf
, cpp
, dua
, hmn
, iro
, mno
, mul
, myn
, nub
, oto
, phi
, sgn
, wak
, wen
, ypk
and znd
.[1]
The following code is identified as a collective code in ISO 639-2 but is (at present) missing from ISO 639-5:
- him Himachali
Codes registered for 639-2 that are listed as collective codes in ISO 639-5 (and collective codes by name in ISO 639-2):
- afa Afro-Asiatic languages
- alg Algonquian languages
- apa Apache languages
- art artificial languages
- ath Athapascan languages
- aus Australian languages
- bad Banda languages
- bai Bamileke languages
- bat Baltic languages
- ber Berber languages
- bih Bihari languages
- bnt Bantu languages
- btk Batak languages
- cai Central American Indian languages
- cau Caucasian languages
- cel Celtic languages
- cmc Chamic languages
- cpe creoles and pidgins, English-based
- cpf creoles and pidgins, French-based
- cpp creoles and pidgins, Portuguese-based
- crp creoles and pidgins
- cus Cushitic languages
- day Land Dayak languages
- dra Dravidian languages
- fiu Finno-Ugrian languages
- gem Germanic languages
- ijo Ijo languages
- inc Indic languages
- ine Indo-European languages
- ira Iranian languages
- iro Iroquoian languages
- kar Karen languages
- khi Khoisan languages
- kro Kru languages
- map Austronesian languages
- mkh Mon–Khmer languages
- mno Manobo languages
- mun Munda languages
- myn Mayan languages
- nah Nahuatl languages
- nai North American Indian languages
- nic Niger-Kordofanian languages
- nub Nubian languages
- oto Otomian languages
- paa Papuan languages
- phi Philippine languages
- pra Prakrit languages
- roa Romance languages
- sai South American Indian languages
- sal Salishan languages
- sem Semitic languages
- sgn sign languages
- sio Siouan languages
- sit Sino-Tibetan languages
- sla Slavic languages
- smi Sami languages
- son Songhai languages
- ssa Nilo-Saharan languages
- tai Tai languages
- tup Tupi languages
- tut Altaic languages
- wak Wakashan languages
- wen Sorbian languages
- ypk Yupik languages
- znd Zande languages
Reserved for local use
[edit]The interval from qaa to qtz is "reserved for local use" and is not used in ISO 639-2 nor in ISO 639-3. These codes are typically used privately for languages not (yet) in either standard. Microsoft Windows uses the qps language code for pseudo-locales generated automatically from English strings, designed for testing software localization.[2]
Special situations
[edit]There are four generic codes for special situations:
- mis is listed as "uncoded languages" (originally an abbreviation for "miscellaneous")
- mul (for "multiple languages") is applied when several languages are used and it is not practical to specify all the appropriate language codes
- und (for "undetermined") is used in situations in which a language or languages must be indicated but the language cannot be identified.
- zxx is listed in the code list as "no linguistic content", e.g. animal sounds (code added on 11 January 2006)
These four codes are also used in ISO 639-3.
See also
[edit]References
[edit]- ^ "ISO 639-2 Language Code List - Codes for the representation of names of languages". Library of Congress.
- ^ "Pseudo-Locales - Win32 apps". Microsoft Learn. 7 January 2021. Retrieved 31 August 2023.
External links
[edit]- Official website
- ISO 639-2/RA Change Notice at the Library of Congress
- ISO 639-2:1998: Codes for the representation of names of languages—Part 2: Alpha-3 code at the International Organization for Standardization