Supported languages in Text Analytics API
This article explains which languages are supported for each operation: sentiment analysis, key phrase extraction, and language detection.
Text Analytics API can detect up to 120 different languages. Language Detection returns the "script" of a language. For instance, for the phrase "I have a dog" it will return
en instead of
en-US. The only special case is Chinese, where the language detection capability will return
zh_CHT if it can determine the script given the text provided. In situations where a specific script cannot be identified for a Chinese document, it will return simply
Sentiment Analysis and Key Phrase Extraction
For sentiment analysis and key phrase extraction, the list of supported languages is more selective as we refine the analyzers to accommodate the linguistic rules of additional languages.
Language list and status
Language support is initially rolled out in preview, graduating to generally available (GA) status, independently of each other and of the Text Analytics service overall. It's possible for languages to remain in preview, even while Text Analytics API transitions to generally available.
|Language||Language code||Sentiment||Key phrases||Notes|
* indicates language support in preview