question

MichalMuller-7040 avatar image
0 Votes"
MichalMuller-7040 asked MichalMuller-7040 commented

Using Azure OCR, some of the text is considered part of a table instead of on the same line but separated by large spaces. Can the tables be disabled ?

I'm using the endpoint https://test-ocr-mike.cognitiveservices.azure.com/ and supplying an image that is all text with the exception of a logo.

In the body of the image the data seems to be apparently visually separated by columns, no column lines are used.
However, the data is not meant to be a table, only that the separation has large spaces between the Label and the Value that is meant to be on the same line.

Before I go and write my own "line reconstruction" routines based upon y-axis values that are <= ~3 pixels, I'm hoping that there is a way to disable the table-ization of the data as opposed to recognizing that it is all on the same line ?

thank you.

azure-computer-vision
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

1 Answer

romungi-MSFT avatar image
0 Votes"
romungi-MSFT answered MichalMuller-7040 commented

@MichalMuller-7040 The OCR API does not have a parameter to pass to ignore values from the result. The only option available is to detect orientation.
The result also depends on the image, It would easy to provide advice if the image could be shared?

I would also suggest to try the read API which is much more accurate in reading any printed or handwritten text in images which runs asynchronously. So, the result needs to be retrieved using the operation id of the actual read call. With this sample image the response seems to detect lines with accuracy which seems to be on expected lines as per image. I hope this helps.


· 1
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

romungi-MSFT,

Thank you for the response.

I'll try the Read API and see if the offers a different result.

Mike

0 Votes 0 ·