Language and region support for the Text Analytics API
This article explains which languages are supported for each operation: sentiment analysis, key phrase extraction, and language detection.
The Text Analytics API can detect up to 120 different languages. Language Detection returns the "script" of a language. For instance, for the phrase "I have a dog" it will return
en instead of
en-US. The only special case is Chinese, where the language detection capability will return
zh_CHT if it can determine the script given the text provided. In situations where a specific script cannot be identified for a Chinese document, it will return simply
Sentiment Analysis, Key Phrase Extraction, and Entity Recognition
For sentiment analysis, key phrase extraction, and entity recognition, the list of supported languages is more selective as the analyzers are refined to accommodate the linguistic rules of additional languages.
Language list and status
Language support is initially rolled out in preview, graduating to generally available (GA) status, independently of each other and of the Text Analytics service overall. It's possible for languages to remain in preview, even while Text Analytics API transitions to generally available.
|Language||Language code||Sentiment||Key phrases||Entity Recognition||Notes|
* indicates language support in preview
** Entity extraction for Spanish is only available in (version 2.1-preview)