Poor Performance of Document Intelligence on Table Extractions
We started exploring the Azure Document Intelligence tool and unfortunately experienced very poor performance on table extraction. The tables we aim to extract have dynamic column names and rows, but they share a pretty similar overall structure (e.g., header, sub-header, and row header). All values except the headers are selection marks, as shown below in the original table. Apparently, Azure DI fails to detect all selection marks within this table. Could you please let us know how we can improve our custom models? We have tried creating ad-hoc labels in selection marks for all undetected selection marks from the source PDF. However, the results were no better than the default model. Any help would be greatly appreciated. Thanks!