Language support for Language Detection
Use this article to learn which natural languages that language detection supports.
The Language Detection feature can detect a wide range of languages, variants, dialects, and some regional/cultural languages, and return detected languages with their name and code. The returned language code parameters conform to BCP-47 standard with most of them conforming to ISO-639-1 identifiers.
If you have content expressed in a less frequently used language, you can try Language Detection to see if it returns a code. The response for languages that can't be detected is unknown
.
Languages supported by Language Detection
Language | Language Code |
---|---|
Afrikaans | af |
Albanian | sq |
Amharic | am |
Arabic | ar |
Armenian | hy |
Assamese | as |
Azerbaijani | az |
Bashkir | ba |
Basque | eu |
Belarusian | be |
Bengali | bn |
Bosnian | bs |
Bulgarian | bg |
Burmese | my |
Catalan | ca |
Central Khmer | km |
Chinese | zh |
Chinese Simplified | zh_chs |
Chinese Traditional | zh_cht |
Chuvash | cv |
Corsican | co |
Croatian | hr |
Czech | cs |
Danish | da |
Dari | prs |
Divehi | dv |
Dutch | nl |
English | en |
Esperanto | eo |
Estonian | et |
Faroese | fo |
Fijian | fj |
Finnish | fi |
French | fr |
Galician | gl |
Georgian | ka |
German | de |
Greek | el |
Gujarati | gu |
Haitian | ht |
Hausa | ha |
Hebrew | he |
Hindi | hi |
Hmong Daw | mww |
Hungarian | hu |
Icelandic | is |
Igbo | ig |
Indonesian | id |
Inuktitut | iu |
Irish | ga |
Italian | it |
Japanese | ja |
Javanese | jv |
Kannada | kn |
Kazakh | kk |
Kinyarwanda | rw |
Kirghiz | ky |
Korean | ko |
Kurdish | ku |
Lao | lo |
Latin | la |
Latvian | lv |
Lithuanian | lt |
Luxembourgish | lb |
Macedonian | mk |
Malagasy | mg |
Malay | ms |
Malayalam | ml |
Maltese | mt |
Maori | mi |
Marathi | mr |
Mongolian | mn |
Nepali | ne |
Norwegian | no |
Norwegian Nynorsk | nn |
Odia | or |
Pasht | ps |
Persian | fa |
Polish | pl |
Portuguese | pt |
Punjabi | pa |
Queretaro Otomi | otq |
Romanian | ro |
Russian | ru |
Samoan | sm |
Serbian | sr |
Shona | sn |
Sindhi | sd |
Sinhala | si |
Slovak | sk |
Slovenian | sl |
Somali | so |
Spanish | es |
Sundanese | su |
Swahili | sw |
Swedish | sv |
Tagalog | tl |
Tahitian | ty |
Tajik | tg |
Tamil | ta |
Tatar | tt |
Telugu | te |
Thai | th |
Tibetan | bo |
Tigrinya | ti |
Tongan | to |
Turkish | tr |
Turkmen | tk |
Upper Sorbian | hsb |
Uyghur | ug |
Ukrainian | uk |
Urdu | ur |
Uzbek | uz |
Vietnamese | vi |
Welsh | cy |
Xhosa | xh |
Yiddish | yi |
Yoruba | yo |
Yucatec Maya | yua |
Zulu | zu |
Romanized Indic Languages supported by Language Detection
Language | Language Code |
---|---|
Assamese | as |
Bengali | bn |
Gujarati | gu |
Hindi | hi |
Kannada | kn |
Malayalam | ml |
Marathi | mr |
Odia | or |
Punjabi | pa |
Tamil | ta |
Telugu | te |
Urdu | ur |
Script detection
Language | Script code | Scripts |
---|---|---|
Bengali (Bengali-Assamese) | as |
Latn , Beng |
Bengali (Bangla) | bn |
Latn , Beng |
Gujarati | gu |
Latn , Gujr |
Hindi | hi |
Latn , Deva |
Kannada | kn |
Latn , Knda |
Malayalam | ml |
Latn , Mlym |
Marathi | mr |
Latn , Deva |
Oriya | or |
Latn , Orya |
Gurmukhi | pa |
Latn , Guru |
Tamil | ta |
Latn , Taml |
Telugu | te |
Latn , Telu |
Arabic | ur |
Latn , Arab |
Cyrillic | tt |
Latn , Cyrl |
Serbian sr |
Latn , Cyrl |
|
Unified Canadian Aboriginal Syllabics | iu |
Latn , Cans |
Next steps
Phản hồi
https://aka.ms/ContentUserFeedback.
Sắp ra mắt: Trong năm 2024, chúng tôi sẽ dần gỡ bỏ Sự cố với GitHub dưới dạng cơ chế phản hồi cho nội dung và thay thế bằng hệ thống phản hồi mới. Để biết thêm thông tin, hãy xem:Gửi và xem ý kiến phản hồi dành cho