Azure Speech-to-Text REST API

Lynn 1 Reputation point
2021-05-07T23:39:04.513+00:00

I'm using Azure Speech-to-text REST API in python to transcribe audio to text. I set the output format to be "detailed" and expect to get multiple results for a input audio. But I only can get one text. Here is the code I ran:

url = "https://" + location + ".stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?{}".format(
urlencode({
"language": language,
"format": "detailed",
"profanity": profanity
}))

Could you let me know how I can get multiple text results for an input audio using Azure Speech-to-text REST API? Thanks.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,416 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,616 Reputation points
    2021-05-10T15:24:33.907+00:00

    @Lynn Thanks for the question. Can you please share the audio input that you are trying, Since the parameter "format=detailed" returns this in NBest list of JSONs.

    Please follow our documentation for batch transcription: Speech service - Azure Cognitive Services | Microsoft Learn

    0 comments No comments