Is it possible to use Azure/ Recognizer to import data (line items) from a PDF into rows on Excel?

Akesserwani 1 Reputation point
2022-05-04T21:05:28.527+00:00

Is it possible to use Azure/ Recognizer to import data (line items) from a PDF into rows on Excel? If there is an automated tool that would do the extraction I am sure of the logic (giving it a bit of direction) that would be needed to perform that task, at least in my case. It ends up saving lots of time and makes the entire process efficient and worth doing.

Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
305 questions
{count} votes

1 answer

Sort by: Most helpful
  1. romungi-MSFT 41,861 Reputation points Microsoft Employee
    2022-05-05T12:05:18.907+00:00

    @Akesserwani It is not directly possible to extract a PDF document to an excel file. However, using the cognitive services computer vision service you can extract the text of a PDF file as a JSON response. This is possible using the read API to extract the pages in the document as text. This can be converted to excel by processing the JSON using any standard libraries.

    Form recognizer is another service that can be used to extract the data from the form but you need to custom train a model to pick specific data or use any pre-built model to extract data from certain document types like a standard receipt. The response is again available as a JSON which needs to be processed and converted to excel. I hope this helps!!

    If an answer is helpful, please click on 130616-image.png or upvote 130671-image.png which might help other community members reading this thread.

    0 comments No comments