question

DiogoSilva1180611-9620 avatar image
0 Votes"
DiogoSilva1180611-9620 asked romungi-MSFT edited

Training a Custom Model without using labeling tool

I have this application that uses form recognizer .net sdk to validate a number of different documents. But now i want to add to a composed model another custom Model. I searched and found out that in order to train a custom model while using the training labels i needed the files <filename\>.pdf.labels.json in my blobcontainer. My question is how can i generate this files without using the labelling tool?

azure-form-recognizer
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

1 Answer

romungi-MSFT avatar image
0 Votes"
romungi-MSFT answered romungi-MSFT edited

@DiogoSilva1180611-9620 It is possible to create a labels.json file and use it as part of your training without using the labeling tool but to get the file format that would work for your corresponding trainings you need to use the tool to run OCR on your files and generate labels from the tool that will create files(ocr & label files) in your blob container. Before you hit the training button you can copy the ocr.json and labels.json files and then edit them for re-use for subsequent trainings. But creating the label files for newer documents need the bounding box co-ordinates to be updated correctly using the tool or your own application that can create such files.

Here is an example of one such user who was able to follow this method to train models by creating their own label files.

There is not much detail about the same in the documentation but the guidance is to use the tool to generate these files even if you want to manually label the files later for training. I hope this helps!!

If an answer is helpful, please click on 130616-image.png or upvote 130671-image.png which might help other community members reading this thread.

5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.