question

Hi-6046 avatar image
0 Votes"
Hi-6046 asked romungi-MSFT commented

Speech to text - Diarization Batch API does not work

Hi,

I am using STT API 3.0 (endpoint : https://southcentralus.api.cognitive.microsoft.com/speechtotext/v3.0/transcriptions)

I am using the API Batch Transcription API since I am working with audio files.
I am then retrieving the JSON results and more specifically the property "display" from "combinedRecognizedPhrases".

I am using audio files which contain interviews.
I set the property diarizationEnabled to true to get the distinction between speakers but nothing seems to work and I do not see anything which allows me to understand who is speaking.

Does it work with WAV file with 2 channels?
Do I need to do something specific ?

azure-cognitive-services
· 1
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@Hi-6046 The API should support diarization and is capable of recognizing two speakers on mono channel recordings, The setup for the same is mentioned in detail in this documentation. Hope this helps.


0 Votes 0 ·

0 Answers