Azure Document Intelligence crashing when trying to read unknown characters

PrivaC 0 Reputation points
2024-05-08T03:52:55.18+00:00

Hi, I'm trying to parse data from a image that's in Bengali the native language of Bangladesh and while using document intelligence to parse the information I'm getting the following error:

UnicodeEncodeError: 'charmap' codec can't encode characters in position 76-84: character maps to <undefined>

which made me believe that it's not finding the bengali characters, so I tried out with a full english image and it worked fine. The document I want to read has both bengali as well as latin characters. I was wondering if there was a way to ignore unknown bengali characters while parsing. Thank you in advance

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,430 questions
{count} votes