question

ShwethaKumariAnantha-3767 avatar image
0 Votes"
ShwethaKumariAnantha-3767 asked ShwethaKumariAnantha-3767 answered

Arabic Support for Form Recogniser


Does Form Recognizer support Arabic text/table extraction?

If not is this on the roadmap?

I see that I am able to extract content from Arabic Digital PDF's. But the output for Scanned Arabic PDF's are worse. The Arabic text is returned in gibberish English and the tables are not extracted at all.

Would appreciate a quick answer.

azure-form-recognizer
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

SathyamoorthyVijayakumar-MSFT avatar image
1 Vote"
SathyamoorthyVijayakumar-MSFT answered

Hello @ShwethaKumariAnantha-3767 ,

Thanks for your question. Currently, Form Recognizer doesn't support Arabic Language.

For the list of supported languages you could refer the below article.

https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/language-support

Unfortunately, there is no timeline for the Arabic support at this point of time. Having said that, I would recommend you to voice out your requirement here - Azure Feedback - Cognitive Services. This is where the Product Groups look for features to add.




5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

ShwethaKumariAnantha-3767 avatar image
0 Votes"
ShwethaKumariAnantha-3767 answered

Thank you @SathyamoorthyVijayakumar-MSFT for the reply.

Have submitted the idea in the Azure feedback site as suggested.

People who find this feature necessary please do vote for the idea.

Link:
https://feedback.azure.com/forums/932041-azure-cognitive-services/suggestions/43396797-arabic-language-support-for-form-recognizer

5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.