Windows Vision Skills (預覽)Windows Vision Skills (Preview)

注意

針對發行前產品的部分相關資訊,在產品正式發行時可能會有大幅修改。Some information relates to pre-released product, which may be substantially modified before it's commercially released. 針對此處提供的資訊,Microsoft 不做任何明示或默許的擔保。Microsoft makes no warranties, express or implied, with respect to the information provided here.

實作和整合有效率的機器學習與電腦視覺解決方案,對於開發人員是一件困難的工作。Implementing and integrating efficient machine learning and computer vision solutions is a hard task for developers. 產業正以快速的步調前進,而量身打造的自訂解決方案不斷推陳出新,讓應用程式開發人員要跟上就費盡心力。The industry is moving at a fast pace and the amount of custom-tailored solutions coming out makes it strenuous for application developers to keep up. 現有的 API 和低階架構讓開發人員在有效率利用這些工具之前,先面臨到陡峭的學習曲線。Existing APIs and lower-level frameworks add a steep learning curve before developers can leverage them effectively.

Windows Vision Skills 架構旨在讓利用電腦視覺更加容易。The Windows Vision Skills framework is meant to make it easier to utilize computer vision. 它會將電腦視覺模組放置到 Windows 應用程式 (在本機裝置上執行) 內使用的方式標準化。It standardizes the way computer vision modules are put to use within a Windows application, running on the local device. 它會使用標準化的基本型別,將複雜度精簡到單一程式設計範例,協助開發人員聚焦在建置絕佳的電腦視覺應用程式。It abstracts complexities into a single programming paradigm with standardized primitives, helping developers focus on building awesome computer vision applications.

Windows Vision Skills 如何適配開發堆疊的圖表;從最底層開始 (GPU、CPU、VPU 等等);上面是硬體加速架構 (DirectX、DirectML 等等);下一層是 Windows Vision Skills API,包含 Windows API 和第三方架構;最上層包含 UWP、.NET Core 和 Win32 應用程式

在實作中包含著複雜的詳細資料,並由可延伸的 WinRT API 加以封裝,而該 API 則繼承了 Microsoft.AI.Skills.SkillInterfacePreview 命名空間中的基底類別和介面。The implementation that contains the complex details is encapsulated by an extensible WinRT API that inherits the base classes and interfaces in the Microsoft.AI.Skills.SkillInterfacePreview namespace. 這個 API 可以由所有類型的 Windows 應用程式 (UWP、.NET Core 和 Win32) 擷取。This API can be ingested by all types of Windows apps (UWP, .NET Core, and Win32). 此架構開放給所有開發人員用來當作建置的基礎。This framework is open for all developers to build on top of.

何謂技能**?What is a skill?

在 Windows Vision Skills 的內容中,技能是簡化、模組化的程式碼片段,可以處理輸入並且產生輸出。In the context of Windows Vision Skills, a skill is a streamlined, modular piece of code that processes input and produces output. 技能可以使用單一目的微技能來封裝簡單的功能 (例如邊緣偵測),或者封裝豐富集合的功能,形成案例技能,來解決像是骨架偵測的複雜問題。A skill can encapsulate simple functionalities like edge detection with a single-purpose micro-skill, or rich sets of functionalities forming a scenario skill to address a complex problem like skeletal detection.

優點Benefits

  • 簡單整合:技能可以讓使用立即可用 API 將功能新增到您的應用程式變得更加容易,不需要任何機器學習或電腦視覺專業知識,或者先有低階 API 的 Windows 知識。Simple integration: Skills make it easy to add features to your application with out-of-the-box APIs that don’t require any machine learning or computer vision expertise, or prior Windows knowledge of low-level APIs.

  • 精簡硬體加速:Windows Vision Skills 架構會查詢硬體資產,並且提供 OS 佈建,讓開發人員在執行階段進行有效率的計算決策。Abstracting hardware acceleration: The Windows Vision Skills framework queries the hardware assets and provides OS provisions that allow developers to make efficient compute decisions at runtime.

  • 互通性:架構與 OS 介面和資產 (例如來自相機、相片和影片的影像) 搭配運作,並且可與 Windows Machine Learning API 搭配使用。Interoperability: The framework works with OS interfaces and assets such as image primitives from cameras, photos, and video, and it can be used in conjunction with the Windows Machine Learning APIs.

  • NuGet 套件:Windows Vision Skills 嚴密地建立版本以減緩反覆項目,而不中斷現有應用程式。NuGet packages: Windows Vision Skills is strongly versioned to ease iteration without breaking existing applications. 易於内嵌、方便更新,並且透過授權保護智慧財產權。They are easy to ingest, easy to update, and they preserve intellectual property through licensing.

  • 擴充性: :架構可以輕易地延伸以便與現有機器學習架構和程式庫 (例如 OpenCV) 搭配運作。Extensibility: The framework can be easily extended to work with existing machine learning frameworks and libraries such as OpenCV.

  • 模組化:技能可以在應用程式內結合起來,就像是解決複雜案例的配方。Modularity: Skills can be pieced together in succession within an application just like a recipe to address a complex scenario. 技能也可以由開發人員在單一套件中繫結在一起。Skills can also be bundled together in a single package by the developer.

雖然此預覽著重在視覺導向的案例和基本型別,而 API 是要容納大範圍的輸入和輸出變數 (進行音訊處理、文字處理等等)。While this preview focuses on vision-oriented scenarios and primitives, the API is meant to accommodate a wide range of input and output variables that enable audio processing, text processing, and more.

請參閱See also

注意

使用下列資源取得 Windows Vision Skills 的說明:Use the following resources for help with Windows Vision Skills: