Poor Performance of Document Intelligence on Table Extractions

Lee, Charlie 0 Reputation points
2024-04-30T16:08:04.5666667+00:00

We started exploring the Azure Document Intelligence tool and unfortunately experienced very poor performance on table extraction. The tables we aim to extract have dynamic column names and rows, but they share a pretty similar overall structure (e.g., header, sub-header, and row header). All values except the headers are selection marks, as shown below in the original table. Apparently, Azure DI fails to detect all selection marks within this table. Could you please let us know how we can improve our custom models? We have tried creating ad-hoc labels in selection marks for all undetected selection marks from the source PDF. However, the results were no better than the default model. Any help would be greatly appreciated. Thanks!

User's image

User's image

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,437 questions
{count} votes