question

Oxueillirep-5637 avatar image
0 Votes"
Oxueillirep-5637 asked romungi-MSFT edited

TTS - pronounciation of words in foreign language

I am using de-DE-ConradNeural and de-DE-KatjaNeural to synthesize german text. They seem to be able to recognize some english words and pronounce them properly in English - but not all.

Using SSML tags <lang xml:lang='en-US'>xxx</lang> it seems to be possible to force Conrad and Katja to pronounce the terms in English. But this "trick" does not seem to work at all when using it with french or italian words - Conrad and Katja will ignore <lang xml:lang='fr-FR'> or <lang xml:lang='it-IT'>and read the foreign words as if they were in german.
Is this behaviour correct and is there any workaround, other than spelling the words in IPA-phonetics, which is very clumsy to handle.

azure-cognitive-servicesazure-speech
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

1 Answer

romungi-MSFT avatar image
0 Votes"
romungi-MSFT answered romungi-MSFT edited

@Oxueillirep-5637 I think you are using the right tag to include a foreign language in your XML as this scenario is available as adjust speaking languages in the service. But, the only voice that seems to support seems to be en-US-JennyMultilingualNeural as per documentation.

Infact, you could simply try using JennyMultilingualNeural and check if this works for you because it supports 13 secondary languages. Please check this blog for more details. You could also build your own custom voice with multi-lingual, multi-speaker & universal model with custom neural voice by submitting the intake form.

Hope this helps.



· 2
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

thanks @romungi-MSFT there is one drawback though: While Katja will recognize (most) english words and pronounce them accordingly, JennyMultilingual will pronounce everything in the main-language (German in my case), and every single english term needs to be marked as english. But for a text mixed with words from different languages Jenny currently seems to be the best option.
How about a male colleague for JennyMultilingual, anything coming near term?

0 Votes 0 ·

@Oxueillirep-5637This is currently in preview so whenever it goes to GA you can expect more voices but it depends mostly on the feedback from the feature.
You would definitely see an announcement or blog if there are any new multilingual language voices. Thanks.

0 Votes 0 ·