Azure Speech SDK Viseme Audio Offset

현우 오 181 Reputation points
2021-09-09T11:46:35.763+00:00

Hello!

I'm using Azure TTS Viseme events.
(https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/how-to-speech-synthesis-viseme?pivots=programming-language-csharp)
When using Azure tts viseme SDK, the audio offset of viseme seems to be wrong.
Some viseme comes before the real audio and some viseme comes after the real audio.
In start part of every sentence is more critical to use.

https://drive.google.com/file/d/1f3I5Lny2iJNv49bi-rG2dsZxcyygZ17u/view?usp=sharing

I attach a image file that summarizes the viseme problems that have been identified so far.

Is there any way to align these viseme audio offset to real audio?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,391 questions
{count} votes