question

JeanBaptisteRanson-9753 avatar image
0 Votes"
JeanBaptisteRanson-9753 asked JeanBaptisteRanson-9753 commented

Azure Linguistic Analysis API / sentences extraction / tokenization

Hello, In the context of the rewriting of a Swift / MacOS application to C# / .NET Core, that extensively uses the Apple NLP Framework tokenization API https://developer.apple.com/documentation/naturallanguage, I was looking for similar API in Azure Cognitive Services. In the following documentation, the Azure Linguistic Analysis API are mentioned in the "Low-level natural language processing capabilities": https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/natural-language-processing. I don't find them in any Azure Cognitive Services API. Where have they gone? What would be alternative solutions for such a requirements (string tokenization with sentence / paragraph recognition), knowing that some documents will be previously processed by OCR? Regards, Jean-Baptiste.

azure-cognitive-services
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

1 Answer

romungi-MSFT avatar image
0 Votes"
romungi-MSFT answered JeanBaptisteRanson-9753 commented

@JeanBaptisteRanson-9753 For text analysis or NLP of text you should look at the text analytics API. It provides features like Sentiment analysis, Key phrase extraction, Language detection, Named entity recognition and others.

If you are looking to use OCR to process images and extract text then Azure computer vision is the right service to start off and then use the text analytics API to use the features mentioned above. I hope this helps!!


· 3
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@romungi-MSFT
Thanks for your answer.
What I need here is a string tokenizer that can extract Word / Sentence Paragraph cleverly. Not advanced features referred to "High-level natural language processing capabilities in the following link: https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/natural-language-processing.79709-features.png


0 Votes 0 ·
features.png (204.2 KiB)
romungi-MSFT avatar image romungi-MSFT JeanBaptisteRanson-9753 ·

@JeanBaptisteRanson-9753 Linguistic Analysis API is currently not available as the preview version of this service did not make it to GA. The documentation in the above link needs to be updated where references to this service needs to be removed.

The closest available option that matches this feature is the key phrase extraction of Text Analytics API which evaluates unstructured text, and for each JSON document, returns a list of key phrases. For example, given input text "The food was delicious and there were wonderful staff", the service returns the main talking points: "food" and "wonderful staff".


0 Votes 0 ·

@romungi-MSFT
Thanks for this information.
It is unfortunate, but at least the reason is clear. Sad that the Linguistic Analysis API didn't make it to GA.
Regards,
Jean-Baptiste.

0 Votes 0 ·