question

DWinkler-0087 avatar image
0 Votes"
DWinkler-0087 asked DWinkler-0087 commented

Viseme rendering service

The documentation of the TTS viseme callbacks states
"With the help of a 2D or 3D rendering engine, you can use the viseme output to control the animation of your avatar" 2356748

I see that you already provide such functionality for bing, e.g.
https://www.bing.com/search?q=Translate%20house%20to%20Italian

but I couldn't find any public service with an API for the generation of visualizations.

Are you planning to introduce this functionality?


azure-speech
· 4
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@DWinkler-0087 Do you mean the REST API version of the feature? If Yes, this is not supported yet by the service and there is no visibility of rolling out the same through REST API.
I believe the bing service is using the JS SDK to enable the feature with bing search.


1 Vote 1 ·

Thanks a lot for your feedback, I am currently using the .net SDK.
I do receive the events, events but the Animation property is empty
.NET SDK JS SDK

Do you think this is a JS only thing, or is there anything else i have to enable?


0 Votes 0 ·

@DWinkler-0087 I think the animation property is only used to pass the format of file used. The actual animation from viseme events is based on viseme ID and the mapping between viseme events and phonemes. The time based output should help map the viseme ids to SVGs. I could not find a working sample that can help run the animation but this mapping can help build custom animations based on SVGs.

Will certainly lookout for any samples published by speech service team that can be used to build the animation.


1 Vote 1 ·
Show more comments

0 Answers