Variable: `locale-language-names`

locale-language-names is a variable defined in mule-cmds.el.gz.

Value

Large value

(("aa_DJ" . "Latin-1") ("aa" . "UTF-8") ("af" . "Latin-1")
 ("am" "Ethiopic" utf-8) ("an" . "Latin-9") ("arn" . "UTF-8")
 ("ar" . "Arabic") ("as" . "UTF-8") ("az" . "UTF-8") ("ba" . "UTF-8")
 ("be" "Belarusian" cp1251) ("bg" "Bulgarian" cp1251)
 ("bn" "Bengali" utf-8) ("bo" . "Tibetan") ("br" . "Latin-1")
 ("bs" . "Latin-2") ("byn" . "UTF-8") ("ca" "Catalan" iso-8859-1)
 ("co" . "UTF-8") ("cs" "Czech" iso-8859-2) ("cy" "Welsh" iso-8859-14)
 ("da" . "Latin-1") ("de" "German" iso-8859-1) ("dv" . "UTF-8")
 ("ee" . "Latin-4") ("el" "Greek" iso-8859-7)
 ("en_IN" "English" utf-8) ("en" "English" iso-8859-1)
 ("eo" . "Esperanto") ("es" "Spanish" iso-8859-1) ("et" . "Latin-9")
 ("eu" . "Latin-1") ("fa" "Persian" utf-8) ("fil" . "UTF-8")
 ("fpo" . "UTF-8") ("fi" . "Latin-9") ("fj" . "Latin-1")
 ("fo" . "Latin-1") ("fr" "French" iso-8859-1) ("fy" . "Latin-1")
 ("ga" . "Latin-1") ("gd" . "Latin-9") ("gez" "Ethiopic" utf-8)
 ("gla" . "Latin-9") ("gl" . "Latin-1") ("gu" "Gujarati" utf-8)
 ("gv" . "Latin-1") ("he" "Hebrew" iso-8859-8)
 ("hi" "Devanagari" utf-8) ("hni_IN" . "UTF-8")
 ("hr" "Croatian" iso-8859-2) ("hu" . "Latin-2") ("hy" . "UTF-8")
 ("id" . "Latin-1") ("ig" . "UTF-8") ("ibo" . "UTF-8")
 ("is" . "Latin-1") ("it" "Italian" iso-8859-1)
 ("iw" "Hebrew" iso-8859-8) ("ja" "Japanese" euc-jp)
 ("kal" . "Latin-1") ("ka" "Georgian" georgian-ps) ("kk" . "UTF-8")
 ("kl" . "Latin-1") ("km" "Khmer" utf-8) ("knk" "Devanagari" utf-8)
 ("kok" "Devanagari" utf-8) ("kn" "Kannada" utf-8)
 ("ko" "Korean" euc-kr) ("ks" . "UTF-8") ("kw" . "Latin-1")
 ("ky" . "UTF-8") ("lao" "Lao" utf-8) ("la" . "Latin-1")
 ("lb" . "Latin-1") ("lg" . "Latin-6") ("lo" "Lao" utf-8)
 ("lt" "Lithuanian" iso-8859-13) ("lv" "Latvian" iso-8859-13)
 ("mi" . "Latin-7") ("mk" "Cyrillic-ISO" iso-8859-5)
 ("mlt" . "Latin-3") ("ml" "Malayalam" utf-8) ("mn" . "UTF-8")
 ("mri" . "Latin-7") ("mr" "Devanagari" utf-8) ("ms" . "Latin-1")
 ("mt" . "Latin-3") ("mym" "Malayalam" utf-8) ("my" "Burmese" utf-8)
 ("nb" . "Latin-1") ("ne" "Devanagari" utf-8)
 ("nl" "Dutch" iso-8859-1) ("nn" . "Latin-1") ("non" . "Latin-1")
 ("no" . "Latin-1") ("nr_ZA" . "UTF-8") ("nso_ZA" . "UTF-8")
 ("oc" . "Latin-1") ("om_ET" . "UTF-8") ("om" . "Latin-1")
 ("or" "Oriya" utf-8) ("pa" "Punjabi" utf-8)
 ("pl" "Polish" iso-8859-2) ("ps" . "UTF-8") ("pas" . "UTF-8")
 ("pt_BR" "Brazilian Portuguese" iso-8859-1) ("pt" . "Latin-1")
 ("rm" . "Latin-1") ("ro" "Romanian" iso-8859-2)
 ("ru_RU.koi8r" "Cyrillic-KOI8" koi8-r) ("ru_RU" "Russian" iso-8859-5)
 ("ru_UA" "Russian" koi8-u) ("rw" . "UTF-8") ("sa" . "Devanagari")
 ("se" . "UTF-8") ("sh" . "Latin-2") ("si" "Sinhala" utf-8)
 ("sid" . "UTF-8") ("sk" "Slovak" iso-8859-2)
 ("sl" "Slovenian" iso-8859-2) ("so_ET" "UTF-8") ("so" "Latin-1")
 ("sq" . "Latin-2") ("sr" . "Latin-2") ("st" . "Latin-1")
 ("sv" "Swedish" iso-8859-1) ("sw" . "Latin-1") ("taj" "Tajik" koi8-t)
 ("ta" "Tamil" utf-8) ("te" "Telugu" utf-8) ("tg" "Tajik" koi8-t)
 ("th_TH.tis620" "Thai" tis-620) ("th_TH.TIS-620" "Thai" tis-620)
 ("th_TH" "Thai" iso-8859-11) ("th" "Thai" iso-8859-11)
 ("ti" "Ethiopic" utf-8) ("tig_ER" . "UTF-8") ("tk" . "Latin-5")
 ("tuk" . "Latin-5") ("tl" . "Latin-1") ("tn" . "Latin-9")
 ("tr" "Turkish" iso-8859-9) ("tsn" . "Latin-9") ("ts" . "Latin-1")
 ("tt" . "UTF-8") ("ug" . "UTF-8") ("uk" "Ukrainian" koi8-u)
 ("ur" . "UTF-8") ("uz_UZ@cyrillic" . "UTF-8") ("uz" . "Latin-1")
 ("ve" . "UTF-8") ("vi" "Vietnamese" utf-8) ("wa" . "Latin-1")
 ("wo" . "UTF-8") ("xh" . "Latin-1") ("yi" . "Windows-1255")
 ("yo" . "UTF-8") ("zh_HK" . "Chinese-Big5")
 ("zh_TW" . "Chinese-Big5") ("zh_CN.GB2312" "Chinese-GB")
 ("zh_CN.GBK" "Chinese-GBK") ("zh_CN.GB18030" "Chinese-GB18030")
 ("zh_CN.UTF-8" . "Chinese-GBK") ("zh_CN" . "Chinese-GB")
 ("zhh" . "Chinese-Big5") ("zhi" . "Chinese-GBK")
 ("zhm" . "Chinese-Big5") ("zh" . "Chinese-GB") ("zu" . "Latin-1")
 ("c$" . "ASCII") ("posix$" . "ASCII") ("ipa$" . "IPA")
 ("cz" . "Czech") ("iw" . "Hebrew") ("sp" . "Cyrillic-ISO")
 ("su" . "Latin-1") ("jp" . "Japanese") ("chs" . "Chinese-GBK")
 ("cht" . "Chinese-BIG5") ("gbz" . "UTF-8") ("div" . "UTF-8")
 ("wee" . "Latin-2") ("wen" . "Latin-2") ("ind" . "Latin-1")
 ("sme" . "UTF-8") ("smf" . "UTF-8") ("smg" . "UTF-8")
 ("kdi" "Kannada" utf-8) ("mar" "Devanagari" utf-8)
 ("khm" "Khmer" utf-8) ("iri" . "Latin-1") ("uig" . "UTF-8")
 ("kin" . "UTF-8"))

Documentation

Alist of locale regexps vs the corresponding languages and coding systems.

Each element has this form:
(LOCALE-REGEXP LANG-ENV CODING-SYSTEM)
The first element whose LOCALE-REGEXP matches the start of a downcased locale specifies the LANG-ENV (language environment) and CODING-SYSTEM corresponding to that locale. If there is no appropriate language environment, the element may have this form:
(LOCALE-REGEXP . LANG-ENV)
In this case, LANG-ENV is one of generic language environments for an specific encoding such as "Latin-1" and "UTF-8".

View in manual

Probably introduced at or before Emacs version 21.1.

Source Code

;; Defined in /usr/src/emacs/lisp/international/mule-cmds.el.gz
(defconst locale-language-names
  '(
     ;; Locale names of the form LANGUAGE[_TERRITORY][.CODESET][@MODIFIER]
     ;; as specified in the Single Unix Spec, Version 2.
     ;; LANGUAGE is a language code taken from ISO 639:1988 (E/F)
     ;; with additions from ISO 639/RA Newsletter No.1/1989;
     ;; see Internet RFC 2165 (1997-06) and
     ;; https://www.evertype.com/standards/iso639/iso639-en.html
     ;; TERRITORY is a country code taken from ISO 3166
     ;; http://www.din.de/gremien/nas/nabd/iso3166ma/codlstp1/en_listp1.html.
     ;; CODESET and MODIFIER are implementation-dependent.

     ;; Language names for which there are no locales (yet) are
     ;; commented out.

     ;; jasonr comments: MS Windows uses three letter codes for
     ;; languages instead of the two letter ISO codes that POSIX
     ;; uses.  In most cases the first two letters are the same, so
     ;; most of the regexps in locale-language-names work.  Japanese,
     ;; Chinese, and some others are exceptions, which are listed in the
     ;; non-standard section at the bottom of locale-language-names, or
     ;; in the main section, if otherwise we would pick up the wrong
     ;; entry (because the first matching entry is used).

    ("aa_DJ" . "Latin-1") ; Afar
    ("aa" . "UTF-8")
    ;; ab Abkhazian
    ("af" . "Latin-1") ; Afrikaans
    ("am" "Ethiopic" utf-8) ; Amharic
    ("an" . "Latin-9") ; Aragonese
    ("arn" . "UTF-8") ; MS-Windows Mapudungun, Mapuche
    ("ar" . "Arabic")
    ("as" . "UTF-8") ; Assamese
    ; ay Aymara
    ("az" . "UTF-8") ; Azerbaijani
    ("ba" . "UTF-8") ; Bashkir, Cyrillic script
    ("be" "Belarusian" cp1251) ; Belarusian [Byelorussian until early 1990s]
    ("bg" "Bulgarian" cp1251) ; Bulgarian
    ; bh Bihari
    ; bi Bislama
    ("bn" "Bengali" utf-8) ; Bengali, Bangla
    ("bo" . "Tibetan")
    ("br" . "Latin-1") ; Breton
    ("bs" . "Latin-2") ; Bosnian
    ("byn" . "UTF-8")  ; Bilin; Blin
    ("ca" "Catalan" iso-8859-1) ; Catalan
    ("co" . "UTF-8") ; Corsican
    ("cs" "Czech" iso-8859-2)
    ("cy" "Welsh" iso-8859-14)
    ("da" . "Latin-1") ; Danish
    ("de" "German" iso-8859-1)
    ("dv" . "UTF-8") ; Divehi
    ; dz Bhutani
    ("ee" . "Latin-4") ; Ewe
    ("el" "Greek" iso-8859-7)
    ;; Users who specify "en" explicitly typically want Latin-1, not ASCII.
    ;; That's actually what the GNU locales define, modulo things like
    ;; en_IN -- fx.
    ("en_IN" "English" utf-8) ; glibc uses utf-8 for English in India
    ("en" "English" iso-8859-1) ; English
    ("eo" . "Esperanto") ; Esperanto
    ("es" "Spanish" iso-8859-1)
    ("et" . "Latin-9") ; Estonian
    ("eu" . "Latin-1") ; Basque
    ("fa" "Persian" utf-8) ; Persian
    ("fil" . "UTF-8") ; Filipino
    ("fpo" . "UTF-8") ; MS-Windows Filipino
    ("fi" . "Latin-9") ; Finnish
    ("fj" . "Latin-1") ; Fiji
    ("fo" . "Latin-1") ; Faroese
    ("fr" "French" iso-8859-1) ; French
    ("fy" . "Latin-1") ; Frisian
    ("ga" . "Latin-1") ; Irish Gaelic (new orthography)
    ("gd" . "Latin-9") ; Scots Gaelic
    ("gez" "Ethiopic" utf-8) ; Geez
    ("gla" . "Latin-9") ; MS-Windows Scots Gaelic
    ("gl" . "Latin-1") ; Gallegan; Galician
    ; gn Guarani
    ("gu" "Gujarati" utf-8) ; Gujarati
    ("gv" . "Latin-1") ; Manx Gaelic
    ; ha Hausa
    ("he" "Hebrew" iso-8859-8)
    ("hi" "Devanagari" utf-8) ; Hindi
    ("hni_IN" . "UTF-8") ; Chhattisgarhi
    ("hr" "Croatian" iso-8859-2) ; Croatian
    ("hu" . "Latin-2") ; Hungarian
    ("hy" . "UTF-8") ;  Armenian
    ; ia Interlingua
    ("id" . "Latin-1") ; Indonesian
    ; ie Interlingue
    ("ig" . "UTF-8") ; Igbo (Nigeria)
    ("ibo" . "UTF-8") ; MS-Windows Igbo
    ; ik Inupiak, Inupiaq
    ("is" . "Latin-1") ; Icelandic
    ("it" "Italian" iso-8859-1) ; Italian
    ; iu Inuktitut
    ("iw" "Hebrew" iso-8859-8)
    ("ja" "Japanese" euc-jp)
    ; jw Javanese
    ("kal" . "Latin-1") ; MS-Windows Greenlandic
    ("ka" "Georgian" georgian-ps) ; Georgian
    ("kk" . "UTF-8") ; Kazakh
    ("kl" . "Latin-1") ; Greenlandic
    ("km" "Khmer" utf-8) ; Cambodian, Khmer
    ("knk" "Devanagari" utf-8) ; MS-Windows Konkani
    ("kok" "Devanagari" utf-8) ; Konkani
    ("kn" "Kannada" utf-8)
    ("ko" "Korean" euc-kr)
    ("ks" . "UTF-8") ; Kashmiri
    ; ku Kurdish
    ("kw" . "Latin-1") ; Cornish
    ("ky" . "UTF-8") ; Kirghiz
    ("lao" "Lao" utf-8) ; MS-Windows Lao
    ("la" . "Latin-1") ; Latin
    ("lb" . "Latin-1") ; Luxemburgish
    ("lg" . "Latin-6") ; Ganda, a.k.a. Luganda
    ; ln Lingala
    ("lo" "Lao" utf-8) ; Laothian
    ("lt" "Lithuanian" iso-8859-13)
    ("lv" "Latvian" iso-8859-13) ; Latvian, Lettish
    ; mg Malagasy
    ("mi" . "Latin-7") ; Maori
    ("mk" "Cyrillic-ISO" iso-8859-5) ; Macedonian
    ("mlt" . "Latin-3") ; MS-Windows Maltese
    ("ml" "Malayalam" utf-8)
    ("mn" . "UTF-8") ; Mongolian
    ; mo Moldavian (retired)
    ("mri" . "Latin-7") ; MS-Windows Maori
    ("mr" "Devanagari" utf-8) ; Marathi
    ("ms" . "Latin-1") ; Malay
    ("mt" . "Latin-3") ; Maltese
    ("mym" "Malayalam" utf-8) ; MS-Windows Malayalam
    ("my" "Burmese" utf-8) ; Burmese
    ; na Nauru
    ("nb" . "Latin-1") ; Norwegian
    ("ne" "Devanagari" utf-8) ; Nepali
    ("nl" "Dutch" iso-8859-1)
    ("nn" . "Latin-1") ; Norwegian Nynorsk
    ("non" . "Latin-1") ; MS-Windows Norwegian Nynorsk
    ("no" . "Latin-1") ; Norwegian
    ("nr_ZA" . "UTF-8") ; South Ndebele
    ("nso_ZA" . "UTF-8") ; Pedi
    ("oc" . "Latin-1") ; Occitan
    ("om_ET" . "UTF-8") ; (Afan) Oromo
    ("om" . "Latin-1") ; (Afan) Oromo
    ("or" "Oriya" utf-8)
    ("pa" "Punjabi" utf-8) ; Punjabi
    ("pl" "Polish" iso-8859-2) ; Polish
    ("ps" . "UTF-8") ; Pashto, Pushto
    ("pas" . "UTF-8") ; MS-Windows Pashto
    ("pt_BR" "Brazilian Portuguese" iso-8859-1) ; Brazilian Portuguese
    ("pt" . "Latin-1") ; Portuguese
    ; qu Quechua
    ("rm" . "Latin-1") ; Rhaeto-Romanic
    ; rn Kirundi
    ("ro" "Romanian" iso-8859-2)
    ("ru_RU.koi8r" "Cyrillic-KOI8" koi8-r)
    ("ru_RU" "Russian" iso-8859-5)
    ("ru_UA" "Russian" koi8-u)
    ("rw" . "UTF-8") ; Kinyarwanda
    ("sa" . "Devanagari") ; Sanskrit
    ; sd Sindhi
    ("se" . "UTF-8") ; Northern Sami
    ; sg Sangho
    ("sh" . "Latin-2") ; Serbo-Croatian
    ("si" "Sinhala" utf-8) ; Sinhalese
    ("sid" . "UTF-8") ; Sidamo
    ("sk" "Slovak" iso-8859-2)
    ("sl" "Slovenian" iso-8859-2)
    ; sm Samoan
    ; sn Shona
    ("so_ET" "UTF-8") ; Somali
    ("so" "Latin-1") ; Somali
    ("sq" . "Latin-2") ; Albanian
    ("sr" . "Latin-2") ; Serbian (Latin alphabet)
    ; ss Siswati
    ("st" . "Latin-1") ;  Sesotho
    ; su Sundanese
    ("sv" "Swedish" iso-8859-1)		; Swedish
    ("sw" . "Latin-1") ; Swahili
    ("taj" "Tajik" koi8-t) ; MS-Windows Tajik w/Cyrillic script
    ("ta" "Tamil" utf-8)
    ("te" "Telugu" utf-8) ; Telugu
    ("tg" "Tajik" koi8-t)
    ("th_TH.tis620" "Thai" tis-620)
    ("th_TH.TIS-620" "Thai" tis-620)
    ("th_TH" "Thai" iso-8859-11)
    ("th" "Thai" iso-8859-11)
    ("ti" "Ethiopic" utf-8) ; Tigrinya
    ("tig_ER" . "UTF-8") ; Tigre
    ("tk" . "Latin-5") ; Turkmen
    ("tuk" . "Latin-5") ; MS-Windows Turkmen
    ("tl" . "Latin-1") ; Tagalog
    ("tn" . "Latin-9") ; Setswana, Tswana
    ; to Tonga
    ("tr" "Turkish" iso-8859-9)
    ("tsn" . "Latin-9") ; MS-Windows Tswana
    ("ts" . "Latin-1") ; Tsonga
    ("tt" . "UTF-8") ; Tatar
    ; tw Twi
    ("ug" . "UTF-8") ; Uighur
    ("uk" "Ukrainian" koi8-u)
    ("ur" . "UTF-8") ; Urdu
    ("uz_UZ@cyrillic" . "UTF-8"); Uzbek
    ("uz" . "Latin-1") ; Uzbek
    ("ve" . "UTF-8") ; Venda
    ("vi" "Vietnamese" utf-8)
    ; vo Volapuk
    ("wa" . "Latin-1") ; Walloon
    ("wo" . "UTF-8") ; Wolof
    ("xh" . "Latin-1") ; Xhosa
    ("yi" . "Windows-1255") ; Yiddish
    ("yo" . "UTF-8") ; Yoruba
    ; za Zhuang
    ("zh_HK" . "Chinese-Big5")
    ; zh_HK/BIG5-HKSCS \
    ("zh_TW" . "Chinese-Big5")
    ("zh_CN.GB2312" "Chinese-GB")
    ("zh_CN.GBK" "Chinese-GBK")
    ("zh_CN.GB18030" "Chinese-GB18030")
    ("zh_CN.UTF-8" . "Chinese-GBK")
    ("zh_CN" . "Chinese-GB")
    ("zhh" . "Chinese-Big5") ; MS-Windows Chinese (Hong Kong S.A.R.)
    ("zhi" . "Chinese-GBK") ; MS-Windows Chinese (Singapore)
    ("zhm" . "Chinese-Big5") ; MS-Windows Chinese (Macao S.A.R.)
    ("zh" . "Chinese-GB")
    ("zu" . "Latin-1") ; Zulu

    ;; ISO standard locales
    ("c$" . "ASCII")
    ("posix$" . "ASCII")

    ;; The "IPA" Emacs language environment does not correspond
    ;; to any ISO 639 code, so let it stand for itself.
    ("ipa$" . "IPA")

    ;; Nonstandard or obsolete language codes
    ("cz" . "Czech") ; e.g. Solaris 2.6
    ("iw" . "Hebrew") ; e.g. X11R6.4
    ("sp" . "Cyrillic-ISO") ; Serbian (Cyrillic alphabet), e.g. X11R6.4
    ("su" . "Latin-1") ; Finnish, e.g. Solaris 2.6
    ("jp" . "Japanese") ; e.g. MS Windows
    ("chs" . "Chinese-GBK") ; MS Windows Chinese Simplified (PRC)
    ("cht" . "Chinese-BIG5") ; MS Windows Chinese Traditional (Taiwan)
    ("gbz" . "UTF-8") ; MS Windows Dari Persian
    ("div" . "UTF-8") ; MS Windows Divehi (Maldives)
    ("wee" . "Latin-2") ; MS Windows Lower Sorbian
    ("wen" . "Latin-2") ; MS Windows Upper Sorbian
    ("ind" . "Latin-1") ; MS-Windows Indonesian
    ("sme" . "UTF-8") ; MS-Windows Northern Sami (Norway)
    ("smf" . "UTF-8") ; MS-Windows Northern Sami (Sweden)
    ("smg" . "UTF-8") ; MS-Windows Northern Sami (Finland)
    ("kdi" "Kannada" utf-8) ; MS-Windows Kannada
    ("mar" "Devanagari" utf-8) ; MS-Windows Marathi
    ("khm" "Khmer" utf-8) ; MS-Windows Khmer
    ("iri" . "Latin-1") ; MS-Windows Irish Gaelic
    ; mwk  MS-Windows Mohawk (Canada)
    ("uig" . "UTF-8") ; MS-Windows Uighur
    ("kin" . "UTF-8") ;  MS-Windows Kinyarwanda
    )
  "Alist of locale regexps vs the corresponding languages and coding systems.
Each element has this form:
  (LOCALE-REGEXP LANG-ENV CODING-SYSTEM)
The first element whose LOCALE-REGEXP matches the start of a
downcased locale specifies the LANG-ENV \(language environment)
and CODING-SYSTEM corresponding to that locale.  If there is no
appropriate language environment, the element may have this form:
  (LOCALE-REGEXP . LANG-ENV)
In this case, LANG-ENV is one of generic language environments for an
specific encoding such as \"Latin-1\" and \"UTF-8\".")

Variable: locale-language-names

Value

Documentation

Source Code

Variable: `locale-language-names`