Variable: locale-language-names
locale-language-names is a variable defined in mule-cmds.el.gz.
Value
Large value
(("aa_DJ" . "Latin-1")
("aa" . "UTF-8")
("af" . "Latin-1")
("am" "Ethiopic" utf-8)
("an" . "Latin-9")
("arn" . "UTF-8")
("ar" . "Arabic")
("as" . "UTF-8")
("az" . "UTF-8")
("ba" . "UTF-8")
("be" "Belarusian" cp1251)
("bg" "Bulgarian" cp1251)
("bn" "Bengali" utf-8)
("bo" . "Tibetan")
("br" . "Latin-1")
("bs" . "Latin-2")
("byn" . "UTF-8")
("ca" "Catalan" iso-8859-1)
("co" . "UTF-8")
("cs" "Czech" iso-8859-2)
("cy" "Welsh" iso-8859-14)
("da" . "Latin-1")
("de" "German" iso-8859-1)
("dv" . "UTF-8")
("ee" . "Latin-4")
("el" "Greek" iso-8859-7)
("en_IN" "English" utf-8)
("en" "English" iso-8859-1)
("eo" . "Esperanto")
("es" "Spanish" iso-8859-1)
("et" . "Latin-9")
("eu" . "Latin-1")
("fa" "Persian" utf-8)
("fil" . "UTF-8")
("fpo" . "UTF-8")
("fi" . "Latin-9")
("fj" . "Latin-1")
("fo" . "Latin-1")
("fr" "French" iso-8859-1)
("fy" . "Latin-1")
("ga" . "Latin-1")
("gd" . "Latin-9")
("gez" "Ethiopic" utf-8)
("gla" . "Latin-9")
("gl" . "Latin-1")
("gu" "Gujarati" utf-8)
("gv" . "Latin-1")
("he" "Hebrew" iso-8859-8)
("hi" "Devanagari" utf-8)
("hni_IN" . "UTF-8")
("hr" "Croatian" iso-8859-2)
("hu" . "Latin-2")
("hy" . "UTF-8")
("id" . "Latin-1")
("ig" . "UTF-8")
("ibo" . "UTF-8")
("is" . "Latin-1")
("it" "Italian" iso-8859-1)
("iw" "Hebrew" iso-8859-8)
("ja" "Japanese" euc-jp)
("kal" . "Latin-1")
("ka" "Georgian" georgian-ps)
("kk" . "UTF-8")
("kl" . "Latin-1")
("km" "Khmer" utf-8)
("knk" "Devanagari" utf-8)
("kok" "Devanagari" utf-8)
("kn" "Kannada" utf-8)
("ko" "Korean" euc-kr)
("ks" . "UTF-8")
("kw" . "Latin-1")
("ky" . "UTF-8")
("lao" "Lao" utf-8)
("la" . "Latin-1")
("lb" . "Latin-1")
("lg" . "Latin-6")
("lo" "Lao" utf-8)
("lt" "Lithuanian" iso-8859-13)
("lv" "Latvian" iso-8859-13)
("mi" . "Latin-7")
("mk" "Cyrillic-ISO" iso-8859-5)
("mlt" . "Latin-3")
("ml" "Malayalam" utf-8)
("mn" . "UTF-8")
("mri" . "Latin-7")
("mr" "Devanagari" utf-8)
("ms" . "Latin-1")
("mt" . "Latin-3")
("mym" "Malayalam" utf-8)
("my" "Burmese" utf-8)
("nb" . "Latin-1")
("ne" "Devanagari" utf-8)
("nl" "Dutch" iso-8859-1)
("nn" . "Latin-1")
("non" . "Latin-1")
("no" . "Latin-1")
("nr_ZA" . "UTF-8")
("nso_ZA" . "UTF-8")
("oc" . "Latin-1")
("om_ET" . "UTF-8")
("om" . "Latin-1")
("or" "Oriya" utf-8)
("pa" "Punjabi" utf-8)
("pl" "Polish" iso-8859-2)
("ps" . "UTF-8")
("pas" . "UTF-8")
("pt_BR" "Brazilian Portuguese" iso-8859-1)
("pt" . "Latin-1")
("rm" . "Latin-1")
("ro" "Romanian" iso-8859-2)
("ru_RU.koi8r" "Cyrillic-KOI8" koi8-r)
("ru_RU" "Russian" iso-8859-5)
("ru_UA" "Russian" koi8-u)
("rw" . "UTF-8")
("sa" . "Devanagari")
("se" . "UTF-8")
("sh" . "Latin-2")
("si" "Sinhala" utf-8)
("sid" . "UTF-8")
("sk" "Slovak" iso-8859-2)
("sl" "Slovenian" iso-8859-2)
("so_ET" "UTF-8")
("so" "Latin-1")
("sq" . "Latin-2")
("sr" . "Latin-2")
("st" . "Latin-1")
("sv" "Swedish" iso-8859-1)
("sw" . "Latin-1")
("taj" "Tajik" koi8-t)
("ta" "Tamil" utf-8)
("te" "Telugu" utf-8)
("tg" "Tajik" koi8-t)
("th_TH.tis620" "Thai" tis-620)
("th_TH.TIS-620" "Thai" tis-620)
("th_TH" "Thai" iso-8859-11)
("th" "Thai" iso-8859-11)
("ti" "Ethiopic" utf-8)
("tig_ER" . "UTF-8")
("tk" . "Latin-5")
("tuk" . "Latin-5")
("tl" . "Latin-1")
("tn" . "Latin-9")
("tr" "Turkish" iso-8859-9)
("tsn" . "Latin-9")
("ts" . "Latin-1")
("tt" . "UTF-8")
("ug" . "UTF-8")
("uk" "Ukrainian" koi8-u)
("ur" . "UTF-8")
("uz_UZ@cyrillic" . "UTF-8")
("uz" . "Latin-1")
("ve" . "UTF-8")
("vi" "Vietnamese" utf-8)
("wa" . "Latin-1")
("wo" . "UTF-8")
("xh" . "Latin-1")
("yi" . "Windows-1255")
("yo" . "UTF-8")
("zh_HK" . "Chinese-Big5")
("zh_TW" . "Chinese-Big5")
("zh_CN.GB2312" "Chinese-GB")
("zh_CN.GBK" "Chinese-GBK")
("zh_CN.GB18030" "Chinese-GB18030")
("zh_CN.UTF-8" . "Chinese-GBK")
("zh_CN" . "Chinese-GB")
("zhh" . "Chinese-Big5")
("zhi" . "Chinese-GBK")
("zhm" . "Chinese-Big5")
("zh" . "Chinese-GB")
("zu" . "Latin-1")
("c$" . "ASCII")
("posix$" . "ASCII")
("ipa$" . "IPA")
("cz" . "Czech")
("iw" . "Hebrew")
("sp" . "Cyrillic-ISO")
("su" . "Latin-1")
("jp" . "Japanese")
("chs" . "Chinese-GBK")
("cht" . "Chinese-BIG5")
("gbz" . "UTF-8")
("div" . "UTF-8")
("wee" . "Latin-2")
("wen" . "Latin-2")
("ind" . "Latin-1")
("sme" . "UTF-8")
("smf" . "UTF-8")
("smg" . "UTF-8")
("kdi" "Kannada" utf-8)
("mar" "Devanagari" utf-8)
("khm" "Khmer" utf-8)
("iri" . "Latin-1")
("uig" . "UTF-8")
("kin" . "UTF-8"))
Documentation
Alist of locale regexps vs the corresponding languages and coding systems.
Each element has this form:
(LOCALE-REGEXP LANG-ENV CODING-SYSTEM)
The first element whose LOCALE-REGEXP matches the start of a
downcased locale specifies the LANG-ENV (language environment)
and CODING-SYSTEM corresponding to that locale. If there is no
appropriate language environment, the element may have this form:
(LOCALE-REGEXP . LANG-ENV)
In this case, LANG-ENV is one of generic language environments for an
specific encoding such as "Latin-1" and "UTF-8".
Probably introduced at or before Emacs version 21.1.
Source Code
;; Defined in /usr/src/emacs/lisp/international/mule-cmds.el.gz
;; The following definitions might as well be marked as constants and
;; purecopied, since they're normally used on startup, and probably
;; should reflect the facilities of the base Emacs.
(defconst locale-language-names
(purecopy
'(
;; Locale names of the form LANGUAGE[_TERRITORY][.CODESET][@MODIFIER]
;; as specified in the Single Unix Spec, Version 2.
;; LANGUAGE is a language code taken from ISO 639:1988 (E/F)
;; with additions from ISO 639/RA Newsletter No.1/1989;
;; see Internet RFC 2165 (1997-06) and
;; https://www.evertype.com/standards/iso639/iso639-en.html
;; TERRITORY is a country code taken from ISO 3166
;; http://www.din.de/gremien/nas/nabd/iso3166ma/codlstp1/en_listp1.html.
;; CODESET and MODIFIER are implementation-dependent.
;; Language names for which there are no locales (yet) are
;; commented out.
;; jasonr comments: MS Windows uses three letter codes for
;; languages instead of the two letter ISO codes that POSIX
;; uses. In most cases the first two letters are the same, so
;; most of the regexps in locale-language-names work. Japanese,
;; Chinese, and some others are exceptions, which are listed in the
;; non-standard section at the bottom of locale-language-names, or
;; in the main section, if otherwise we would pick up the wrong
;; entry (because the first matching entry is used).
("aa_DJ" . "Latin-1") ; Afar
("aa" . "UTF-8")
;; ab Abkhazian
("af" . "Latin-1") ; Afrikaans
("am" "Ethiopic" utf-8) ; Amharic
("an" . "Latin-9") ; Aragonese
("arn" . "UTF-8") ; MS-Windows Mapudungun, Mapuche
("ar" . "Arabic")
("as" . "UTF-8") ; Assamese
; ay Aymara
("az" . "UTF-8") ; Azerbaijani
("ba" . "UTF-8") ; Bashkir, Cyrillic script
("be" "Belarusian" cp1251) ; Belarusian [Byelorussian until early 1990s]
("bg" "Bulgarian" cp1251) ; Bulgarian
; bh Bihari
; bi Bislama
("bn" "Bengali" utf-8) ; Bengali, Bangla
("bo" . "Tibetan")
("br" . "Latin-1") ; Breton
("bs" . "Latin-2") ; Bosnian
("byn" . "UTF-8") ; Bilin; Blin
("ca" "Catalan" iso-8859-1) ; Catalan
("co" . "UTF-8") ; Corsican
("cs" "Czech" iso-8859-2)
("cy" "Welsh" iso-8859-14)
("da" . "Latin-1") ; Danish
("de" "German" iso-8859-1)
("dv" . "UTF-8") ; Divehi
; dz Bhutani
("ee" . "Latin-4") ; Ewe
("el" "Greek" iso-8859-7)
;; Users who specify "en" explicitly typically want Latin-1, not ASCII.
;; That's actually what the GNU locales define, modulo things like
;; en_IN -- fx.
("en_IN" "English" utf-8) ; glibc uses utf-8 for English in India
("en" "English" iso-8859-1) ; English
("eo" . "Esperanto") ; Esperanto
("es" "Spanish" iso-8859-1)
("et" . "Latin-9") ; Estonian
("eu" . "Latin-1") ; Basque
("fa" "Persian" utf-8) ; Persian
("fil" . "UTF-8") ; Filipino
("fpo" . "UTF-8") ; MS-Windows Filipino
("fi" . "Latin-9") ; Finnish
("fj" . "Latin-1") ; Fiji
("fo" . "Latin-1") ; Faroese
("fr" "French" iso-8859-1) ; French
("fy" . "Latin-1") ; Frisian
("ga" . "Latin-1") ; Irish Gaelic (new orthography)
("gd" . "Latin-9") ; Scots Gaelic
("gez" "Ethiopic" utf-8) ; Geez
("gla" . "Latin-9") ; MS-Windows Scots Gaelic
("gl" . "Latin-1") ; Gallegan; Galician
; gn Guarani
("gu" "Gujarati" utf-8) ; Gujarati
("gv" . "Latin-1") ; Manx Gaelic
; ha Hausa
("he" "Hebrew" iso-8859-8)
("hi" "Devanagari" utf-8) ; Hindi
("hni_IN" . "UTF-8") ; Chhattisgarhi
("hr" "Croatian" iso-8859-2) ; Croatian
("hu" . "Latin-2") ; Hungarian
("hy" . "UTF-8") ; Armenian
; ia Interlingua
("id" . "Latin-1") ; Indonesian
; ie Interlingue
("ig" . "UTF-8") ; Igbo (Nigeria)
("ibo" . "UTF-8") ; MS-Windows Igbo
; ik Inupiak, Inupiaq
("is" . "Latin-1") ; Icelandic
("it" "Italian" iso-8859-1) ; Italian
; iu Inuktitut
("iw" "Hebrew" iso-8859-8)
("ja" "Japanese" euc-jp)
; jw Javanese
("kal" . "Latin-1") ; MS-Windows Greenlandic
("ka" "Georgian" georgian-ps) ; Georgian
("kk" . "UTF-8") ; Kazakh
("kl" . "Latin-1") ; Greenlandic
("km" "Khmer" utf-8) ; Cambodian, Khmer
("knk" "Devanagari" utf-8) ; MS-Windows Konkani
("kok" "Devanagari" utf-8) ; Konkani
("kn" "Kannada" utf-8)
("ko" "Korean" euc-kr)
("ks" . "UTF-8") ; Kashmiri
; ku Kurdish
("kw" . "Latin-1") ; Cornish
("ky" . "UTF-8") ; Kirghiz
("lao" "Lao" utf-8) ; MS-Windows Lao
("la" . "Latin-1") ; Latin
("lb" . "Latin-1") ; Luxemburgish
("lg" . "Latin-6") ; Ganda, a.k.a. Luganda
; ln Lingala
("lo" "Lao" utf-8) ; Laothian
("lt" "Lithuanian" iso-8859-13)
("lv" "Latvian" iso-8859-13) ; Latvian, Lettish
; mg Malagasy
("mi" . "Latin-7") ; Maori
("mk" "Cyrillic-ISO" iso-8859-5) ; Macedonian
("mlt" . "Latin-3") ; MS-Windows Maltese
("ml" "Malayalam" utf-8)
("mn" . "UTF-8") ; Mongolian
; mo Moldavian (retired)
("mri" . "Latin-7") ; MS-Windows Maori
("mr" "Devanagari" utf-8) ; Marathi
("ms" . "Latin-1") ; Malay
("mt" . "Latin-3") ; Maltese
("mym" "Malayalam" utf-8) ; MS-Windows Malayalam
("my" "Burmese" utf-8) ; Burmese
; na Nauru
("nb" . "Latin-1") ; Norwegian
("ne" "Devanagari" utf-8) ; Nepali
("nl" "Dutch" iso-8859-1)
("nn" . "Latin-1") ; Norwegian Nynorsk
("non" . "Latin-1") ; MS-Windows Norwegian Nynorsk
("no" . "Latin-1") ; Norwegian
("nr_ZA" . "UTF-8") ; South Ndebele
("nso_ZA" . "UTF-8") ; Pedi
("oc" . "Latin-1") ; Occitan
("om_ET" . "UTF-8") ; (Afan) Oromo
("om" . "Latin-1") ; (Afan) Oromo
("or" "Oriya" utf-8)
("pa" "Punjabi" utf-8) ; Punjabi
("pl" "Polish" iso-8859-2) ; Polish
("ps" . "UTF-8") ; Pashto, Pushto
("pas" . "UTF-8") ; MS-Windows Pashto
("pt_BR" "Brazilian Portuguese" iso-8859-1) ; Brazilian Portuguese
("pt" . "Latin-1") ; Portuguese
; qu Quechua
("rm" . "Latin-1") ; Rhaeto-Romanic
; rn Kirundi
("ro" "Romanian" iso-8859-2)
("ru_RU.koi8r" "Cyrillic-KOI8" koi8-r)
("ru_RU" "Russian" iso-8859-5)
("ru_UA" "Russian" koi8-u)
("rw" . "UTF-8") ; Kinyarwanda
("sa" . "Devanagari") ; Sanskrit
; sd Sindhi
("se" . "UTF-8") ; Northern Sami
; sg Sangho
("sh" . "Latin-2") ; Serbo-Croatian
("si" "Sinhala" utf-8) ; Sinhalese
("sid" . "UTF-8") ; Sidamo
("sk" "Slovak" iso-8859-2)
("sl" "Slovenian" iso-8859-2)
; sm Samoan
; sn Shona
("so_ET" "UTF-8") ; Somali
("so" "Latin-1") ; Somali
("sq" . "Latin-2") ; Albanian
("sr" . "Latin-2") ; Serbian (Latin alphabet)
; ss Siswati
("st" . "Latin-1") ; Sesotho
; su Sundanese
("sv" "Swedish" iso-8859-1) ; Swedish
("sw" . "Latin-1") ; Swahili
("taj" "Tajik" koi8-t) ; MS-Windows Tajik w/Cyrillic script
("ta" "Tamil" utf-8)
("te" "Telugu" utf-8) ; Telugu
("tg" "Tajik" koi8-t)
("th_TH.tis620" "Thai" tis-620)
("th_TH.TIS-620" "Thai" tis-620)
("th_TH" "Thai" iso-8859-11)
("th" "Thai" iso-8859-11)
("ti" "Ethiopic" utf-8) ; Tigrinya
("tig_ER" . "UTF-8") ; Tigre
("tk" . "Latin-5") ; Turkmen
("tuk" . "Latin-5") ; MS-Windows Turkmen
("tl" . "Latin-1") ; Tagalog
("tn" . "Latin-9") ; Setswana, Tswana
; to Tonga
("tr" "Turkish" iso-8859-9)
("tsn" . "Latin-9") ; MS-Windows Tswana
("ts" . "Latin-1") ; Tsonga
("tt" . "UTF-8") ; Tatar
; tw Twi
("ug" . "UTF-8") ; Uighur
("uk" "Ukrainian" koi8-u)
("ur" . "UTF-8") ; Urdu
("uz_UZ@cyrillic" . "UTF-8"); Uzbek
("uz" . "Latin-1") ; Uzbek
("ve" . "UTF-8") ; Venda
("vi" "Vietnamese" utf-8)
; vo Volapuk
("wa" . "Latin-1") ; Walloon
("wo" . "UTF-8") ; Wolof
("xh" . "Latin-1") ; Xhosa
("yi" . "Windows-1255") ; Yiddish
("yo" . "UTF-8") ; Yoruba
; za Zhuang
("zh_HK" . "Chinese-Big5")
; zh_HK/BIG5-HKSCS \
("zh_TW" . "Chinese-Big5")
("zh_CN.GB2312" "Chinese-GB")
("zh_CN.GBK" "Chinese-GBK")
("zh_CN.GB18030" "Chinese-GB18030")
("zh_CN.UTF-8" . "Chinese-GBK")
("zh_CN" . "Chinese-GB")
("zhh" . "Chinese-Big5") ; MS-Windows Chinese (Hong Kong S.A.R.)
("zhi" . "Chinese-GBK") ; MS-Windows Chinese (Singapore)
("zhm" . "Chinese-Big5") ; MS-Windows Chinese (Macao S.A.R.)
("zh" . "Chinese-GB")
("zu" . "Latin-1") ; Zulu
;; ISO standard locales
("c$" . "ASCII")
("posix$" . "ASCII")
;; The "IPA" Emacs language environment does not correspond
;; to any ISO 639 code, so let it stand for itself.
("ipa$" . "IPA")
;; Nonstandard or obsolete language codes
("cz" . "Czech") ; e.g. Solaris 2.6
("iw" . "Hebrew") ; e.g. X11R6.4
("sp" . "Cyrillic-ISO") ; Serbian (Cyrillic alphabet), e.g. X11R6.4
("su" . "Latin-1") ; Finnish, e.g. Solaris 2.6
("jp" . "Japanese") ; e.g. MS Windows
("chs" . "Chinese-GBK") ; MS Windows Chinese Simplified (PRC)
("cht" . "Chinese-BIG5") ; MS Windows Chinese Traditional (Taiwan)
("gbz" . "UTF-8") ; MS Windows Dari Persian
("div" . "UTF-8") ; MS Windows Divehi (Maldives)
("wee" . "Latin-2") ; MS Windows Lower Sorbian
("wen" . "Latin-2") ; MS Windows Upper Sorbian
("ind" . "Latin-1") ; MS-Windows Indonesian
("sme" . "UTF-8") ; MS-Windows Northern Sami (Norway)
("smf" . "UTF-8") ; MS-Windows Northern Sami (Sweden)
("smg" . "UTF-8") ; MS-Windows Northern Sami (Finland)
("kdi" "Kannada" utf-8) ; MS-Windows Kannada
("mar" "Devanagari" utf-8) ; MS-Windows Marathi
("khm" "Khmer" utf-8) ; MS-Windows Khmer
("iri" . "Latin-1") ; MS-Windows Irish Gaelic
; mwk MS-Windows Mohawk (Canada)
("uig" . "UTF-8") ; MS-Windows Uighur
("kin" . "UTF-8") ; MS-Windows Kinyarwanda
))
"Alist of locale regexps vs the corresponding languages and coding systems.
Each element has this form:
(LOCALE-REGEXP LANG-ENV CODING-SYSTEM)
The first element whose LOCALE-REGEXP matches the start of a
downcased locale specifies the LANG-ENV \(language environment)
and CODING-SYSTEM corresponding to that locale. If there is no
appropriate language environment, the element may have this form:
(LOCALE-REGEXP . LANG-ENV)
In this case, LANG-ENV is one of generic language environments for an
specific encoding such as \"Latin-1\" and \"UTF-8\".")