I'm using the speech synthesis function. I wonder if this function is limited to the length of text. Can I modify the synthesized sound? For example, male, female, young, mature voices.
I'm using the speech synthesis function. I wonder if this function is limited to the length of text. Can I modify the synthesized sound? For example, male, female, young, mature voices.
@Kakasoft-7728 The limits on the size of the SSML message and other limits related to text to speech are documented here. Yes, you can choose a voice that is offered in the language selected including neural voices. You can also adjust the rate, pitch, intonation, speed and other parameters and make the audio sound to your liking. A very easy and simple way to try this is to use the audio content creation tool from speech studio or try the lite version for free. I hope this helps. Thanks.
Thank you very much for your reply. what should I do if I want to save the synthesized audio locally? I noticed that there is audioData in the SDK, but will there be no problem when the audio file is large?
@Kakasoft-7728 You can save the file locally. The steps to do so with SDK are documented here. The max audio length produced per request would be 10 min. The limits for the same are also documented here.
4 people are following this question.