question

PeterMurphy-2264 avatar image
0 Votes"
PeterMurphy-2264 asked cloudstarter-2058 answered

How to improve audio sound quality for Cognitive Services - Text to Speech

Hi there,

I'm learning the Text to Speech service and am trying to improve the sound quality.

My code is here ...
https://gist.github.com/risingBirdSong/6568ba42f13b4eeda6deb1f09e8b38f7

the stock quality using

"X-Microsoft-OutputFormat": "riff-8khz-8bit-mono-alaw",

is poor sound quality...

however when i switch to another option (listed here)

https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/rest-text-to-speech

my audio file (mp3) won't play on and i get an error that the file is corrupt.

Do you have any suggestions for improving the sound quality? thank you :)





azure-cognitive-services
· 1
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Can you try following the steps provided in Quickstart: Convert text-to-speech using Node.js?


0 Votes 0 ·
GiftA-MSFT avatar image
0 Votes"
GiftA-MSFT answered GiftA-MSFT edited

Please refer to this sample for converting text to speech using Node.js. Thanks.


5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

cloudstarter-2058 avatar image
0 Votes"
cloudstarter-2058 answered

In the case of Python, by using the RIFF format (stated in there), you can get better quality file.


5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.