Maximum Size of Array Element for Detect Language API in Microsoft Translator Services is 10000 or 50000?

Satya Sarika Sunkara 1 Reputation point
2020-09-01T07:43:32.15+00:00

As per the below link, the maximum size of each array element for Detect is shown as 10000.

https://learn.microsoft.com/en-us/azure/cognitive-services/translator/request-limits#character-and-array-limits-per-request

However we observed that Detect is accepting maximum of 50000 characters as size of array element.
For example: We have sent an array of 2 elements each of 25000 characters and it accepted. We have sent an array of 1 element of size 50000 characters and it accepted.

Can someone please confirm what's the maximum character limit allowed for each array element?

Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
339 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,354 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. YutongTie-MSFT 46,091 Reputation points
    2020-09-01T17:38:23.027+00:00

    Hello,

    Thanks for the feedback, I am checking with the author and engineering team. Will let you know the result and update the document to make it clear and consistent.

    Regards,
    Yutong

    0 comments No comments

  2. YutongTie-MSFT 46,091 Reputation points
    2020-09-06T03:50:14.453+00:00

    Hi,

    Thanks for the feedback. We have created a work item for this document.

    As a suggestion, it doesn't make a lot of sense to make large Detect requests. Better strategy is to select the relevant portion of the text and detect just that. For instance the body of the email, or the comment on the web page, or the first or second paragraph of the document.

    You may break the document at natural breakpoints, for instance paragraph marks or section headers, and then determine the language for each individually.

    When you send a large document, you get a single set of probabilities for the entire long document: Not that useful.

    Regards,
    Yutong

    0 comments No comments