question

DanielNalesnik-5166 avatar image
0 Votes"
DanielNalesnik-5166 asked GiftA-MSFT edited

REST URL for TTS

I'm trying to use Microsoft Azure's Cognitive Services TTS to convert written Chinese into spoken audio. I have a Chinese-language learning app that includes a dictionary of 120,000+ Chinese words.

I was previously using TTS from Baidu, which let me dynamically reference audio with a URL that looked something like this:

https://tsn.baidu.com/text2audio?tex=战(zhan4)线(xian4)&lan=zh&spd=4&tok={my token that was re-generated every month}

This was useful for three reasons:

  1. I could embed this into an HTML audio tag and play it with JavaScript only when (or if!) I ever needed it.

  2. I did not need to store the audio anywhere -- it was always available through the URL.

  3. As the speech engine improved over the years, the quality of my app's audio improved.

With Microsoft Azure, I've figured out how to create TTS audio, but it seems as if the result must always be downloaded and saved in order for me to do anything with it.

Thus:
1) I have to create millions of audio files and then save them (Amazon S3) ahead of time, even if I may never need them.
2) I will never get updates to the voice quality unless I re-run a batch process to re-create all the audio again.

Am I missing something? Is it true that I can't create URLs for audio I want converted on the fly by Azure TTS?

azure-cognitive-services
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

GiftA-MSFT avatar image
0 Votes"
GiftA-MSFT answered GiftA-MSFT commented

Hi, thanks for reaching out. Is this what you're looking for Text-to-speech REST API? Let us know if this helps your scenario. Thanks.


· 2
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Thanks GiftA, but this is the document I used originally (and which leads me to my questions above). I still have every question I asked above.

0 Votes 0 ·
GiftA-MSFT avatar image GiftA-MSFT DanielNalesnik-5166 ·

Hi, thanks for following up. Upon further review, I discovered that we currently don't support the option to dynamically reference TTS audio via url. I've also reached out to the product group to verify. Will share updates as soon as possible.

0 Votes 0 ·
GiftA-MSFT avatar image
0 Votes"
GiftA-MSFT answered GiftA-MSFT edited

Hi, quick follow-up. I confirmed with the product group. We don't currently support this scenario. Our recommendation is to output the audio to blob storage (to generate SAS url). For more information, review Azure Blob Storage. Hope this helps.



5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.