Ingestion Client for the Speech service
The Ingestion Client is a tool released on GitHub that enables customers to transcribe audio files through Speech services quickly with little or no development effort. It works by wiring up a dedicated Azure storage account to custom Azure Functions that use either the REST API or the SDK in a serverless fashion to pass transcription requests to the service.
Architecture
The tool helps those customers that want to get an idea of the quality of the transcript without making development investments up front. The tool connects a few resources to transcribe audio files that land in the dedicated Azure Storage container.
Internally, the tool uses our V3.0 Batch API or SDK, and follows best practices to handle scale-up, retries and failover. The following schematic describes the resources and connections.
The Getting Started Guide for the Ingestion Client describes how to setup and use the tool.
Important
Pricing varies depending on the mode of operation (batch vs real time) as well as the Azure Function SKU selected. By default the tool will create a Premium Azure Function SKU to handle large volume. Visit the Pricing page for more information.
Both, the Microsoft Speech SDK and the Speech-to-text REST API v3.0, can be used to obtain transcripts. The decision does impact overall costs as it is explained in the guide.
Tip
You can use the tool and resulting solution in production to process a high volume of audio.
Tool customization
The tool is built to show customers results quickly. You can customize the tool to your preferred SKUs and setup. The SKUs can be edited from the Azure portal and the code itself is available on GitHub.
Note
We suggest creating the resources in the same dedicated resource group to understand and track costs more easily.
Next steps
Tilbakemeldinger
Send inn og vis tilbakemelding for