文字分析 API 支援的語言和區域Language and region support for the Text Analytics API

本文說明每項作業支援的語言:情感分析、關鍵字組提取、語言偵測和命名實體辨識。This article explains which languages are supported for each operation: sentiment analysis, key phrase extraction, language detection and named entity recognition.

語言偵測Language Detection

文字分析 API 可以偵測各種不同的語言、變體、方言,以及某些地區/文化語言。The Text Analytics API can detect a wide range of languages, variants, dialects, and some regional/cultural languages. 語言偵測會傳回語言的「指令碼」。Language Detection returns the "script" of a language. 例如,針對片語 "I have a dog",它會傳回 en 而不是 en-USFor instance, for the phrase "I have a dog" it will return en instead of en-US. 唯一的特殊案例是中文,若它可根據提供的文字決定指令碼,語言偵測功能會傳回 zh_CHSzh_CHTThe only special case is Chinese, where the language detection capability will return zh_CHS or zh_CHT if it can determine the script given the text provided. 若無法識別中文文件的特定指令碼,則只會傳回 zhIn situations where a specific script cannot be identified for a Chinese document, it will return simply zh.

我們未發佈這項功能確切的語言清單,但它可以偵測到多種不同的語言、變體、方言,以及某些區域性/文化語言。We don't publish the exact list of languages for this feature, but it can detect a wide range of languages, variants, dialects, and some regional/cultural languages.

如果您有以較不常用的語言表示的內容,您可以嘗試使用「語言偵測」,看它是否會傳回代碼。If you have content expressed in a less frequently used language, you can try Language Detection to see if it returns a code. 對於無法偵測到的語言,會產生 unknown 回應。The response for languages that cannot be detected is unknown.

情感分析、關鍵片語擷取和命名實體辨識Sentiment Analysis, Key Phrase Extraction, and Named Entity Recognition

情感分析、關鍵片語擷取和實體辨識的支援語言清單更具選擇性,因為分析器會進一步調整以配合其他語言的語言規則。For sentiment analysis, key phrase extraction, and entity recognition, the list of supported languages is more selective as the analyzers are refined to accommodate the linguistic rules of additional languages. 在命名實體辨識 v2 中,對一組完整實體類型的支援目前僅限於下列語言:In Named Entity Recognition v2, support for the full set of entity types is currently limited to the following languages:

  • EnglishEnglish
  • 中文-簡體Chinese-Simplified
  • 法文French
  • 德文German
  • 西班牙文Spanish

只有 PersonLocationOrganization 命名的實體會針對其他語言傳回。Only the Person, Location and Organization named entities are returned for the other languages.

語言清單和狀態Language list and status

語言支援一開始推出時處於預覽階段,之後會進入正式運作 (GA) 狀態,彼此且與整體文字分析服務互不影響。Language support is initially rolled out in preview, graduating to generally available (GA) status, independently of each other and of the Text Analytics service overall. 即使文字分析 API 轉換成正式運作,也可以保留預覽階段中的語言。It's possible for languages to remain in preview, even while Text Analytics API transitions to generally available.

注意

如需命名實體辨識(NER) v3 公開預覽的詳細語言支援,請參閱命名實體類型For detailed language support for the Named Entity Recognition(NER) v3 public preview, see Named entity types.

語言Language 語言代碼Language code 情感Sentiment 主要片語Key phrases 具名實體辨識Named Entity Recognition 實體連結Entity linking 注意事項Notes
阿拉伯文Arabic ar ✔ *✔ *
捷克文Czech cs ✔ *✔ *
中文-簡體Chinese-Simplified zh-hans ✔ **✔ **
中文-繁體Chinese-Traditional zh-hant ✔ **✔ **
丹麥文Danish da ✔ *✔ * ✔ *✔ *
荷蘭文Dutch nl ✔ **✔ ** ✔ *✔ *
EnglishEnglish en ✔ **✔ ** ✔ **✔ ** ✔ **✔ **
芬蘭文Finnish fi ✔ *✔ * ✔ *✔ *
法文French fr ✔ **✔ **
德文German de ✔ **✔ **
希臘文Greek el ✔ *✔ *
匈牙利文Hungarian hu ✔ *✔ *
義大利文Italian it ✔ **✔ ** ✔ *✔ *
日文Japanese ja ✔ **✔ ** ✔ *✔ *
韓文Korean ko ✔ *✔ *
挪威文 (巴克摩)Norwegian (Bokmål) no ✔ *✔ * ✔ *✔ *
波蘭文Polish pl ✔ *✔ * ✔ *✔ *
葡萄牙文 (葡萄牙)Portuguese (Portugal) pt-PT ✔**✔** ✔ *✔ * 也接受 ptpt also accepted
葡萄牙文 (巴西)Portuguese (Brazil) pt-BR ✔ *✔ *
俄文Russian ru ✔ *✔ * ✔ *✔ *
西班牙文Spanish es ✔**✔** ✔ *✔ * ✔ **✔ **
瑞典文Swedish sv ✔ *✔ * ✔ *✔ *
土耳其文Turkish tr ✔ *✔ * ✔ *✔ *

* 語言支援現供預覽* Language support is in preview

** 也適用于情感分析 v3和/或命名實體辨識 v3公開預覽。** Also available in the Sentiment Analysis v3 and/or Named Entity Recognition v3 public previews.

另請參閱See also

認知服務文件頁面 Cognitive Services Documentation page
認知服務產品頁面Cognitive Services Product page