文件瞭解和表單處理模型之間的差異Difference between document understanding and form processing models

Microsoft SharePoint Syntex 中的內容瞭解可讓您識別並分類上傳至 SharePoint 文件庫的文件,以及擷取每個檔案的相關資訊。Content understanding in Microsoft SharePoint Syntex allows you to identify and classify documents that are uploaded to SharePoint document libraries, and extract relevant information from each file. 例如,當文件上傳到 SharePoint 文件庫時,所有識別為 [採購單] 的檔案都將被分類,然後顯示在自訂文件庫檢視中。For example, as files are uploaded to a SharePoint document library, all files that are identified as Purchase Orders are classified as such, and then displayed in a custom document library view. 此外,還可以從每個檔案中提取特定資訊(例如,PO 號總額),並將其作為欄顯示在文件庫檢視中。Additionally, you can pull specific information from each file (for example, PO Number and Total) and display it as a column in your document library view.

內容瞭解讓您建立 模型 來識別和擷取所需的資訊。Content understanding lets you create models to identify and extract the information you need. 模型在協助解決搜尋、業務流程、合規性和許多其他方面的業務問題方面具有價值。Models have value in helping to resolve business issues for search, business processes, compliance, and many others.

您可以使用兩種模型類型:There are two model types that you can use:

雖然這兩種模型通常用於相同用途,但以下列出的關鍵差異會影響您可以使用哪種模型。While both models are generally used for the same purpose, the key differences listed below affect which ones you can use.

注意

如需有關表單處理和文件瞭解案例範例的詳細資訊,請參閱 SharePoint Syntex 採用:入門指南See the SharePoint Syntex adoption: Get started guide for more information about form processing and document understanding scenario examples.

結構化、非結構化和半結構化內容Structured versus unstructured and semi-structured content

使用文件瞭解模型從非結構化文件(如信件或合約)中識別和擷取資料,這些文件中要擷取的文字實體位於句子或文件的特定區域中。Use document understanding models to identify and extract data from unstructured documents, such as letters or contracts, where the text entities you want to extract is in sentences or specific regions of the document. 例如,非結構化文件可以是可以用不同方式撰寫的合同續約函。For example, an unstructured document could be a contract renewal letter that can be written in different ways. 不過,資訊會持續存在於每個合同續約文件的本文中,例如文字字串 服務開始日期 後接著實際日期。However, information exists consistently in the body of each contract renewal document, such as the text string Service start date of followed by an actual date.

使用表單處理模型來識別檔案並從結構化或半結構化文件(如表單或發票)中擷取資料。Use form processing models to identify files and extract data from structured or semi-structured documents, such as forms or invoices. 表單處理模型經過訓練,能够從範例文件中瞭解表單的版面配置,並學會查找需要從類似位置擷取的資料。Form processing models are trained to understand the layout of your form from example documents, and learn to look for the data you need to extract from similar locations. 表單通常具有更具結構化的版面配置,其中實體皆位於同一個位置 (例如,在稅務表單中的社會保險號碼)。Forms usually have a more structured layout where entities are in the same location (for example, a social security number in a tax form).

注意

您必須具有内容中心網站的存取權限才能建立文件瞭解模型或將其套用至 SharePoint 文件庫。You must have access to a content center site to create a document understanding model or to apply one to a SharePoint document library.

建立模型的位置Where models are created

文件瞭解模型在 SharePoint 內容中心網站中建立和管理。Document understanding models are created and managed in a SharePoint content center site.

注意

如需有關輸入檔的詳細資訊,請參閱表單處理模型需求和限制For more information about input documents, see Form processing model requirements and limitations.

表單處理模型是在 PowerApps AI Builder中建立,但建立是直接從 SharePoint 文件庫啟動。Form processing models are created in PowerApps AI Builder, but the creation starts directly from a SharePoint document library. 文件庫必須先啟用表單處理模型建立,使用者才能建立表單處理模型。A document library must have form processing model creation enabled before a user can create a form processing model for it. 系統管理員可以在內容瞭解系統管理員設定中啟用表單處理模型建立。Admins can enable form processing model creation in the content understanding admin settings. 表單處理模型使用 PowerAutomate 流程以在文件上傳到文件庫時處理這些檔案。Form processing models use PowerAutomate flows to process files when they're uploaded to the document library.

當您建立文件瞭解模型時,將建立儲存在 SharePoint 內容類型庫中的 新 SharePoint 內容類型When you create a document understanding model, you create a new SharePoint content type that is saved to the SharePoint Content Types gallery. 或者,您可以視需要使用現有內容類型來定義模型。Or you can use existing content types to define your model if needed.

表單處理模型也會建立 新 SharePoint 內容類型,而且也會儲存在 SharePoint 內容類型庫中。Form processing models also create new SharePoint content types, and are also stored in the SharePoint Content Types gallery.

可以套用的位置Where they can be applied

您可以將文件瞭解模型套用至您有權存取的 SharePoint 文件庫。You can apply document understanding models to SharePoint document libraries that you have access to. 使用內容中心建立文件瞭解模型,並將它套用至不同的文件庫。Use the content center to create a document understanding model, and apply it to different document libraries. 内容中心使您能够更集中地控制文件瞭解模型的使用方式和套用位置。The content center gives you a more central control for how document understanding models are used and where they're applied. 注意:此資訊還必須匯總到内容中心。Note this information must also roll up to a content center.

表單處理模型目前只能套用到您建立它們時使用的 SharePoint 文件庫。Form processing models can currently only be applied to the SharePoint document library from which you created them. 這可讓擁有網站存取權的授權使用者建立表單處理模型。This allows licensed users with access to the site to create a form processing model. 請注意,您的系統管理員必須在 SharePoint 文件庫啟用表單處理,以供授權使用者使用之。Note that an admin needs to enable form processing on a SharePoint document library for it to be available to licensed users.

另請參閱See Also

訓練:使用 AI Builder 改善商務效能Training: Improve business performance with AI Builder

文件瞭解概觀Document understanding overview

表單處理概觀Form processing overview

SharePoint Syntex 簡介Introduction to SharePoint Syntex