I recently upgraded my TSA Throughput Project that processes TSA Throughput PDF files like this one on the TSA FOIA Reading Room site to use the latest stable version 3.1.1 of the Azure.AI.FormRecognizer client sdk.
This defaults to use the 2.1 version of the Form Recognizer service. The 2.1 version does not have the same behavior as the 2.0 client when recognizing rows that span more than one row.
Given the figure below
Version 2.0 would recognize the Date and Hour of Day cells in the cell before the cell containing "ANC".
Version 2.1 would recognize
the Date in the cell before the cell in "MDW"
the Hour of Day in the cell before the "blank" cells and cell containing "Terminal 7 - Passenger"
This seems to be a less than optimum result as it doesn't correctly reflect the "Table Layout / Rowspan" coordinates. To fix this I had to specifically request version 2.0 using the FormRecognizerClientOptions.
What should happen: Version 2.1 should return the same Table/Cells layout as version 2.0 respecting RowSpan an Colspan layouts