Test recognition quality of a Custom Speech model
You can inspect the recognition quality of a Custom Speech model in the Speech Studio. You can play back uploaded audio and determine if the provided recognition result is correct. After a test has been successfully created, you can see how a model transcribed the audio dataset, or compare results from two models side by side.
Tip
You can also use the online transcription editor to create and refine labeled audio datasets.
Create a test
Follow these instructions to create a test:
Sign in to the Speech Studio.
Navigate to Speech Studio > Custom Speech and select your project name from the list.
Select Test models > Create new test.
Select Inspect quality (Audio-only data) > Next.
Choose an audio dataset that you'd like to use for testing, and then select Next. If there aren't any datasets available, cancel the setup, and then go to the Speech datasets menu to upload datasets.
Choose one or two models to evaluate and compare accuracy.
Enter the test name and description, and then select Next.
Review your settings, and then select Save and close.
Note
When testing, the system will perform a transcription. This is important to keep in mind, as pricing varies per service offering and subscription level. Always refer to the official Azure Cognitive Services pricing - Speech service for the latest details.
Side-by-side model comparisons
When the test status is Succeeded, select the test item name to see details of the test. This detail page lists all the utterances in your dataset, and shows the recognition results of the two models you are comparing.
To help inspect the side-by-side comparison, you can toggle various error types including insertion, deletion, and substitution. By listening to the audio and comparing recognition results in each column (showing human-labeled transcription and the results of two speech-to-text models), you can decide which model meets your needs and where improvements are needed.
Side-by-side model testing is useful to validate which speech recognition model is best for an application. For an objective measure of accuracy, requiring transcribed audio, see Test model quantitatively.
Next steps
Povratne informacije
Pošalјite i prikažite povratne informacije za