question

johnjohn-0472 avatar image
0 Votes"
johnjohn-0472 asked sadomovalex commented

Integration between ABBYY and SharePoint online custom list attachments



We want to OCR images uploaded to SharePoint, and the OCR capabilities provided by Power Automate will have a lot of limitations, mainly the inability to understand Signatures & most importantly the Checkboxes. So we are planning to go with ABBYY

So can we integrate SharePoint with ABBYY using this scenario:-

  • We have multiple documents that we need to extract for each record

  • We will create a SharePoint custom list which contain all the data that we need to extract from all the images

  • Attached multiple files to the SharePoint list item >> add the new item

  • Then call AbbYY either using Microsoft Power Automate or by writing remote event receivers >> to extract the data from the images (which are the attachments inside the list item) and populate SharePoint online list item’s fields.

My question is if the above is achievable or not?

Thanks

office-sharepoint-online
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

1 Answer

sadomovalex avatar image
2 Votes"
sadomovalex answered sadomovalex commented

not sure whether ABBYY library may work directly with SPO. But in general if it may work with images represented e.g. as bytes array (MemoryStream) then you may first read images from SPO list item's attachments via CSOM and pass them to ABBYY lib. I recommend first to create simple console app which will do these steps - and if it will work plan how to trigger this code from SPO (e.g. install it as Azure function or Azure web job which runs periodically by scheduler).

· 2
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@sadomovalex thanks for the hint and help,, but what do you exactly mean by and pass them to ABBYY lib how i can do so?
Thanks

0 Votes 0 ·

like I wrote I'm not sure how exactly ABBYY API is implemented: it can be web service or dll. If it is dll it may have class and method which has byte[] as param which represents image. Here is pseudocode:

 public class ABBYY
 {
     string DoOCR(byte[] image) { ... }
 }

Then you need to read images from list items attachments to byte[] and pass them to ABBYY.DoOCR() method. Here is example how you may read images attached to list items via CSOM: Read SharePoint List Item Attachment (image) and Convert Into Base64 String Using CSOM Through App Only Authentication.


0 Votes 0 ·