What is Azure AI Document Intelligence?

Article
04/25/2024

Important

Document Intelligence public preview releases provide early access to features that are in active development.
Features, approaches, and processes may change, prior to General Availability (GA), based on user feedback.
The public preview version of Document Intelligence client libraries default to REST API version 2024-02-29-preview.
Public preview version 2024-02-29-preview is currently only available in the following Azure regions:
East US
West US2
West Europe

This content applies to: v4.0 (preview) | Previous versions: v3.1 (GA) v3.0 (GA) v2.1 (GA)

This content applies to: v3.1 (GA) | Latest version: v4.0 (preview) | Previous versions: v3.0 v2.1

This content applies to: v3.0 (GA) | Latest versions: v4.0 (preview) v3.1 | Previous version: v2.1

This content applies to: v2.1 | Latest version: v4.0 (preview)

Note

Form Recognizer is now Azure AI Document Intelligence!

As of July 2023, Azure AI services encompass all of what were previously known as Cognitive Services and Azure Applied AI Services.
There are no changes to pricing.
The names Cognitive Services and Azure Applied AI continue to be used in Azure billing, cost analysis, price list, and price APIs.
There are no breaking changes to application programming interfaces (APIs) or SDKs prior to and including v3.1. Starting from v4.0, APIs and SDKs are updated to Document Intelligence.
Some platforms are still awaiting the renaming update. All mention of Form Recognizer or Document Intelligence in our documentation refers to the same Azure service.

Azure AI Document Intelligence is a cloud-based Azure AI service that enables you to build intelligent document processing solutions. Massive amounts of data, spanning a wide variety of data types, are stored in forms and documents. Document Intelligence enables you to effectively manage the velocity at which data is collected and processed and is key to improved operations, informed data-driven decisions, and enlightened innovation.

| ✔️ Document analysis models | ✔️ Prebuilt models | ✔️ Custom models |

Document analysis models

Document analysis models enable text extraction from forms and documents and return structured business-ready content ready for your organization's action, use, or development.

Read | Extract printed
and handwritten text.

Layout | Extract text, tables,
and document structure.

Read | Extract printed
and handwritten text.

Layout | Extract text, tables,
and document structure.

General document | Extract text,
structure, and key-value pairs.

Prebuilt models

Prebuilt models enable you to add intelligent document processing to your apps and flows without having to train and build your own models.

Invoice | Extract customer and vendor details.

Receipt | Extract sales transaction details.

Identity | Extract verification details.

US mortgage 1003 | Extract loan application details.

US mortgage 1008 | Extract loan transmittal details.

US mortgage disclosure | Extract final closing loan terms.

Health Insurance card | Extract insurance coverage details.

Contract | Extract agreement and party details.

Credit/Debit card | Extract payment card information.

Marriage certificate | Extract certified marriage information.

US Tax W-2 form | Extract taxable compensation details.

US Tax 1098 form | Extract mortgage interest details.

US Tax 1098-E form | Extract student loan interest details.

US Tax 1098-T form | Extract qualified tuition details.

US Tax 1099 form | Extract form 1099 variation details.

US Tax 1040 form | Extract form 1040 variation details.

Invoice | Extract customer
and vendor details.

Receipt | Extract sales
transaction details.

Identity | Extract identification
and verification details.

Health Insurance card | Extract health insurance details.

Business card | Extract business contact details.

Contract | Extract agreement
and party details.

US Tax W-2 form | Extract taxable
compensation details.

US Tax 1098 form | Extract mortgage interest details.

US Tax 1098-E form | Extract student loan interest details.

US Tax 1098-T form | Extract qualified tuition details.

Custom models

Custom models are trained using your labeled datasets to extract distinct data from forms and documents, specific to your use cases.
Standalone custom models can be combined to create composed models.

Extraction models
✔️ Custom extraction models are trained to extract labeled fields from documents.

Custom template | Extract data from static layouts.

Custom neural | Extract data from mixed-type documents.

Custom composed | Extract data using a collection of models.

Classification model
✔️ Custom classifiers identify document types before invoking an extraction model.

Custom classifier | Identify designated document types (classes)
before invoking an extraction model.

Add-on capabilities

Document Intelligence supports optional features that can be enabled and disabled depending on the document extraction scenario. The following add-on capabilities are available for 2023-07-31 (GA) and later releases:

queryFields

Analysis features

Model ID	Content Extraction	Query fields	Paragraphs	Paragraph Roles	Selection Marks	Tables	Key-Value Pairs	Languages	Barcodes	Document Analysis	Formulas*	Style Font*	High Resolution*
prebuilt-read	✓						O	O		O	O	O
prebuilt-layout	✓	✓	✓	✓	✓	✓		O	O		O	O	O
prebuilt-document	✓	✓	✓	✓	✓	✓	✓	O	O		O	O	O
prebuilt-businessCard	✓	✓								✓
prebuilt-contract	✓	✓	✓	✓			O	O	✓	O	O	O
prebuilt-healthInsuranceCard.us	✓	✓						O	O	✓	O	O	O
prebuilt-idDocument	✓	✓						O	O	✓	O	O	O
prebuilt-invoice	✓	✓			✓	✓	O	O	O	✓	O	O	O
prebuilt-receipt	✓	✓						O	O	✓	O	O	O
prebuilt-marriageCertificate.us	✓	✓						O	O	✓	O	O	O
prebuilt-creditCard	✓	✓						O	O	✓	O	O	O
prebuilt-mortgage.us.1003	✓	✓						O	O	✓	O	O	O
prebuilt-mortgage.us.1008	✓	✓						O	O	✓	O	O	O
prebuilt-mortgage.us.closingDisclosure	✓	✓						O	O	✓	O	O	O
prebuilt-tax.us.w2	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1098	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1098E	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1098T	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1099(variations)	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1040(variations)	✓	✓						O	O	✓	O	O	O
{ customModelName }	✓	✓	✓	✓	✓	✓		O	O	✓	O	O	O

✓ - Enabled
O - Optional
* - Premium features incur extra costs

Models and development options

Note

The following document understanding models and development options are supported by the Document Intelligence service v3.0.

You can use Document Intelligence to automate document processing in applications and workflows, enhance data-driven strategies, and enrich document search capabilities. Use the links in the table to learn more about each model and browse development options.

Read

Screenshot of Read model analysis using Document Intelligence Studio.

Model ID	Description	Automation use cases	Development options
prebuilt-read	● Extract text from documents. ● Data extraction	● Digitizing any document. ● Compliance and auditing. ● Processing handwritten notes before translation.	● Document Intelligence Studio ● REST API ● C# SDK ● Python SDK ● Java SDK ● JavaScript

Model type	Model name
Document analysis model	● Layout analysis model
Prebuilt models	● Invoice model ● Receipt model ● Identity document (ID) model ● Business card model
Custom models	● Custom model ● Composed model

Model	Description	Development options
Layout analysis	Extraction and analysis of text, selection marks, tables, and bounding box coordinates, from forms and documents.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container
Custom model	Extraction and analysis of data from forms and documents specific to distinct business data and use cases.	● Document Intelligence labeling tool ● REST API ● Sample Labeling Tool ● Document Intelligence Docker container
Invoice model	Automated data processing and extraction of key information from sales invoices.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container
Receipt model	Automated data processing and extraction of key information from sales receipts.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container
Identity document (ID) model	Automated data processing and extraction of key information from US driver's licenses and international passports.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container
Business card model	Automated data processing and extraction of key information from business cards.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container

What is Azure AI Document Intelligence?

Document analysis models

Prebuilt models

Custom models

Add-on capabilities

Analysis features

Models and development options

Read

Layout

General document (deprecated in 2023-10-31-preview)

Invoice

Receipt

Identity (ID)

US mortgage 1003 form

US mortgage 1008 form

US mortgage disclosure form

Health insurance card

Contract model

Credit card model

Marriage certificate model

US Tax W-2 model

US tax 1098 form

US tax 1098-E form

US tax 1098-T form

US tax 1099 (and variations) form

US tax 1040 form

Business card

Custom model overview

Custom template

Custom neural

Custom composed

Custom classification model

Document Intelligence models and development options

Data privacy and security

Next steps

Feedback

Additional resources