question

NaveenRangarajan-6563 avatar image
0 Votes"
NaveenRangarajan-6563 asked YutongTie-MSFT commented

Azure Cognitive Service - Text Analytics - Detect Language Feature : Not able to detect Ukrainian and Vietnamese languages. Showing (Unknown) for detected language name and iso6391Name

Hi Guys,


I am using Azure Cognitive service Text Analytics, Language Detection feature in my application. However, the API is not able to detect Ukrainian and Vietnamese languages. And showing (Unknown) for detected language name and iso6391Name.

From the below link, it is said that, both the languages are supported.

https://docs.microsoft.com/en-us/azure/cognitive-services/text-analytics/language-support?tabs=language-detection

Please let me know why its not able to detect these two languages. Resource created in Central India region and have Visual Studio Enterprise Subscription.

Attached are the screen shots for 4 languages including Ukrainian and Vietnamese for comparison in POSTMAN. Please let me know if any more details are needed.


French
107586-french-detected.png

Turkish
107652-turkish-detected.png

Ukrainian
107643-ukrainian-notdetected.png

Vietnamese
107653-vietnamese-notdetected.png







azure-cognitive-servicesazure-text-analytics
· 2
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Hello,

I am sorry I can not see your screenshots it shows broken. Could you please share the input to us to investigate? Thanks.
107851-image.png
Regards,
Yutong


0 Votes 0 ·
image.png (14.5 KiB)

I have posted as answer the JSON objects for various languages. Please let me know if you need any other data.

0 Votes 0 ·

1 Answer

RNaveen-8887 avatar image
0 Votes"
RNaveen-8887 answered YutongTie-MSFT commented

Hi,

Below are details requested. I updated with Italian and Japanese languages which are working.

Italian and Japanese

Request Body
{
"documents": [
{
"id": "1",
"text": "Metodi diagnostici convenzionali in patologia dopo aver prelevato un campione di tessuto da un paziente"
},
{
"id":"2",
"text":"蛍光抗体検査、電子のためのグルタルアルデヒドでの固定顕微鏡学"
}
]
}


Output

{
"documents": [
{
"id": "1",
"detectedLanguage": {
"name": "Italian",
"iso6391Name": "it",
"confidenceScore": 1.0
},
"warnings": []
},
{
"id": "2",
"detectedLanguage": {
"name": "Japanese",
"iso6391Name": "ja",
"confidenceScore": 1.0
},
"warnings": []
}
],
"errors": [],
"modelVersion": "2021-01-05"
}


Ukrainian and Vietnamese which is not working

Request Body

{
"documents": [
{
"id": "1",
"text": "Зусилля з метою комерціалізації біоконверсії лінгоцелюлозних сировинних ресурсів можуть потенційно"
},
{
"id": "2",
"text": "Sự đa dạng về các đặc điểm hóa lý của sinh khối xenlulo cho thấy nhu cầu về công nghệ tiền xử lý"
}
]
}


Output

{
"documents": [
{
"id": "1",
"detectedLanguage": {
"name": "(Unknown)",
"iso6391Name": "(Unknown)",
"confidenceScore": 0.0
},
"warnings": []
},
{
"id": "2",
"detectedLanguage": {
"name": "(Unknown)",
"iso6391Name": "(Unknown)",
"confidenceScore": 0.0
},
"warnings": []
}
],
"errors": [],
"modelVersion": "2021-01-05"
}


name and iso6391Name are (Unknown)



· 1
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Thanks for the details, I am trying on my side.


Regards,
Yutong

0 Votes 0 ·