This content applies to:v4.0 (GA)v3.1 (GA)v3.0 (GA)v2.1 (GA)
Document Intelligence service is updated on an ongoing basis. Bookmark this page to stay up to date with release notes, feature enhancements, and our newest documentation.
Important
Preview API versions are retired once the GA API is released. The 2023-02-28-preview API version is retiring. If you're still using the preview API or the associated SDK versions, update your code to target the latest API version 2024-11-30 (GA).
December 2024
Document Intelligence v4.0 programming language SDKs are now generally available (GA)!
Batch API now supports all models, including all read, layout, prebuilt verticals, and custom models.
Batch API supports LIST function to allow users to list batch jobs within past seven days.
Batch API supports DELETE function to explicitly delete batch job for GDPR and privacy compliance.
GetAnalyzeBatchResult supports resultId in response to LIST all resultIds.
🆕 Searchable PDF. The prebuilt read model now supports images formats (JPEG/JPG, PNG, BMP, TIFF, HEIF) and language expansion to include Chinese, Japanese, and Korean for PDF output.
Custom classification model supports incremental training. You can add new samples to existing classes or add new classes by referencing an existing classifier.
With v4.0, custom classification model doesn't split documents by default during analysis. You need to explicitly set 'splitMode' property to auto to preserve the older behavior.
Custom classification model now supports 25,000 pages as new training page limit.
Custom Neural model now supports signature detection.
Custom neural models support paid training for longer duration when you need to train model with a larger labeled dataset. The first 20 training runs in a calendar month continue to be free. Any training operations over 20 is on the paid tier. Learn more details on billing.
Analyze response is stored for 24 hours from when the operation completes for retrieval. For scenarios where you want to delete the response sooner, use the delete analyze response API to delete the response.
The v4.0 API includes cumulative updates from preview releases as listed:
The Document Intelligence 2024-07-31-preview REST API is now available. This preview API introduces new and updated capabilities:
Public preview version 2024-07-31-preview is currently available only in the following Azure regions. The new document field extraction model in Azure AI Foundry portal is only available in North Central US region:
Document Intelligence now adds support for composing model with an explicit custom classification model. Learn more about the benefits of using the new compose capability.
New unified US tax model that can extract from forms such as W-2, 1098, 1099, and 1040.
🆕 Searchable PDF. The prebuilt read model now supports PDF output to download PDFs with embedded text from extraction results, allowing for PDF to be utilized in scenarios such as search copy of contents.
Layout model now supports improved figure detection where figures from documents can now be downloaded as an image file to be used for further figure understanding. The layout model also features improvements to the OCR model for scanned text targeting improvements for single characters, boxed text, and dense text documents.
Document Intelligence now adds support for batch analysis operation to support analyzing a set of documents to simplify developer experience and improve efficiency.
Query fields AI quality of extraction is improved with the latest model.
May 2024
The Document Intelligence Studio adds support for Microsoft Entra (formerly Azure Active Directory) authentication. For more information, seeAuthentication in Document Intelligence Studio.
February 2024
The Document Intelligence 2024-07-31-preview REST API is now available. This preview API introduces new and updated capabilities:
Public preview version 2024-07-31-preview is currently available only in the following Azure regions:
Custom classification model now supported incremental training for scenarios where you need to update the classifier model with added samples or classes. Learn more about incremental training.
Custom classification model adds support for Office document types (.docx, .pptx, and .xls). Learn more about expanded document type support.
Table extraction, reading order, and section heading detection improvements.
With the Document Intelligence 2023-10-31-preview, the general document model (prebuilt-document) is deprecated. Going forward, to extract key-value pairs from documents, use the
prebuilt-layout model with the optional query string parameter features=keyValuePairs enabled.
New 1099 tax model. Supports base 1099 form and the following variations: A, B, C, CAP, DIV, G, H, INT, K, LS, LTC, MISC, NEC, OID, PATR, Q, QA, R, S, SA, SB​.
Query fields are available with the 2023-10-31-preview release.
Add-on capabilities are available within all models excluding the Read model.
Note
With the 2022-08-31 API general availability (GA) release, the associated preview APIs are being deprecated. If you're using the 2021-09-30-preview, 2022-01-30-preview, or 2022-06-30-preview API versions, update your applications to target the 2022-08-31 API version. There are a few minor changes involved, for more information, see the migration guide.