question

VittorioTison-1960 avatar image
0 Votes"
VittorioTison-1960 asked VittorioTison-1960 commented

Form recognizer and non-us invoices without pre-defined structure: best practice?

Hello everybody!

I would like to use the form recognizer service to analyze invoices. However, the invoices don't follow a predefined structure.
Do you have any similar cases? are there any best practices you know to handle such situations? Would composite models work in this case? i.e. does the cognitive service "learn" a bit from each and tries to find a solution accordingly or it finds which of the models forming the "composition" best suits the case and then uses it?

Thank you and best regards,

Vittorio

azure-form-recognizer
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

1 Answer

YutongTie-MSFT avatar image
0 Votes"
YutongTie-MSFT answered VittorioTison-1960 commented

Hello,

I think you are asking custom model for your business. Besides prebuilt model, you can use custom model. A custom model is a machine learning program trained to recognize form fields within your distinct content and extract key-value pairs and table data. You only need five examples of the same form type to get started and your custom model can be trained with or without labeled datasets. Custom models are trained for your distinct data and use cases.

Reference: https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/label-tool
https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/concept-custom?tabs=fott

Regards,
Yutong

· 1
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Hi, thank you for your answer.
Yes, I am definitely going with the custom model.
As far as I have understood with the "composed" model the AI makes a kind of "preselection" of the model to use, i.e. if an invoice with a totally different layout in respect of the various models forming the "composed model", it will most likely not return the correct values, is that right?
My question is aimed more to know if anybody has a "best practice" when the documents will have more or less the same contents (invoices), but the layout is almost always variable.

Thank you!

Vittorio

0 Votes 0 ·