question

gma-8880 avatar image
0 Votes"
gma-8880 asked gma-8880 commented

Get facial pose events - Text-to-Speech

Hi !

I see that viseme events are only available for en-US-AriaNeural voice for now.
I was wondering if support for other languages (especially French) is planned in the near future?
And if so, is an approximate date known?

Thanks,
gma

azure-speech
· 2
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@gma-8880 Thanks for the question. Can you please add more details about the What type of character do you want to build, 2d characters or 3d characters?
Do you need realtime viseme?. Azure Neural Text-to-Speech blog for language supported.
We have forwarded to the product team to check on for other languages.


1 Vote 1 ·

Hi @ramr-msft
Do you know where I can get the corresponding facial pose to each viseme ID and not the corresponding phoneme.

I test and see that this feature is now available for french voice, is that correct ?

Thank you so much,
gma

0 Votes 0 ·

1 Answer

gma-8880 avatar image
0 Votes"
gma-8880 answered gma-8880 published

Thanks for your answer @ramr-msft . I am a developer for a company and we indeed want to animate 3d characters. We also need realtime viseme.
To be more precise, we are looking to something like this :

{"time":0,"type":"sentence","start":0,"end":23,"value":"Mary had a little lamb."}
{"time":6,"type":"word","start":0,"end":4,"value":"Mary"}
{"time":6,"type":"viseme","value":"p"}
{"time":73,"type":"viseme","value":"E"}
{"time":180,"type":"viseme","value":"r"}
{"time":292,"type":"viseme","value":"i"}
...

Your solution to get facial pose events may interest us but we need it in more than just en-US-AriaNeural voice. Specially, french voice is important for us.
Hence my question about an approximate date for this functionality in other languages

Thanks,
gma

5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.