question

JadeNameless-8330 avatar image
0 Votes"
JadeNameless-8330 asked ramr-msft answered

Word level timestamps for real time transcription

My team needs to synch up words in the transcript with events from another source (button presses, specifically). The final results of transcription have word level timestamps when we use the appropriate config arguments, but intermediate results (associated with Recognizing events) do not. How can we get word level timestamps when doing real time transcription?


dotnet-csharpazure-cognitive-servicesazure-speech
· 1
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

why are 8 people following this question but 0 people are answering?

I am seriously weighing the tradeoffs between paying for a support plan and just switching to amazon transcribe.

@ramr-msft what's the deal.

0 Votes 0 ·

1 Answer

ramr-msft avatar image
0 Votes"
ramr-msft answered

@JadeNameless-8330 Thanks for the question. Can you please share link to the code for transcription and API that you are trying. Please add more details about the intermediate results that you are getting.

Please follow the threads to request word level timestamps in the speech config.
To Generate Timestamps in STT model.


· 1
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Here is the Doc for Format of v3 transcription results.


0 Votes 0 ·