question

BenManningetherFAX-9247 avatar image
0 Votes"
BenManningetherFAX-9247 asked GiftA-MSFT answered

parsing from multiple forms in a single payload

I need to extract data from each individual form that is received when multiple forms are sent in a single payload. For example, for an Accounts Receivable department that batch sends me their daily forms. They may have 50 forms sent in one batch (such as sending me a large fax). I need to extract the data using MS Forms Recognizer for each individual form in the batch into JSON or XML.

How do I achieve this if I have a training model for the single form, but receive multiple forms at a time sent to me in one payload?

azure-form-recognizer
· 4
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Hi, can you confirm which feature you are using for your scenario (custom model, prebuilt model, or layout api)?


0 Votes 0 ·

MS Forms Recognizer with custom model

0 Votes 0 ·
GiftA-MSFT avatar image GiftA-MSFT BenManningetherFAX-9247 ·

To train a custom model, you'll need a minimum of five completed forms. Your forms must be of the same type of document and follow the input requirements. Here's a useful tutorial on how to extract form data in bulk using Azure Data Factory. Let me know if this is the solution you're looking for.


0 Votes 0 ·

that's not the question. I have uploaded the form I need to my training model. And I've trained on the form. The data extraction works perfect.

Where I need help is in the real-world example. If I receive a fax with just one form, the Form Recognizer custom model perfectly extracts the data from the one form.

If someone faxes me 10 forms in a single fax, Forms Recognizer is only extracting the data from the first form. How do I parse each subsequent form, such that I end up with 10 outputs, one from each form?

0 Votes 0 ·

1 Answer

GiftA-MSFT avatar image
0 Votes"
GiftA-MSFT answered

Thanks for clarifying, however, this scenario isn't supported. Currently, the analyze api only supports 1 form. My suggestion would be to create a heuristic where you implement logic in your code to process multiple files with several api calls (with time interval). Another workaround would be to use Azure Data Factory as mentioned above. Hope this helps.


5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.