question

RaoulW-5169 avatar image
0 Votes"
RaoulW-5169 asked ·

Question on questions

Hi,

I've noticed that in about 50% of the voices asking a question has no effect on the voice. In the example bellow I have used all English voices to have each voice say the same sentence with and without a question mark. The problem is mainy in the last word before the question mark. We've tried correcting the problem with the intonation editor but that is in most cases very difficult to do.

Please report the issue to the appropriate teams. I have pasted the SSML below for your convenience.

Thanks,

 <!--ID=B7267351-473F-409D-9765-754A8EBCDE05;Version=1|{"VoiceNameToIdMapItems":[{"Id":"e5e4f59b-65c6-42b2-a6e3-5985d1a1ea07","Name":"Microsoft Server Speech Text to Speech Voice (en-US, JennyNeural)","VoiceType":"StandardVoice"},{"Id":"27e2f1c8-cfe0-4324-88e2-cd0bafeffe1b","Name":"Microsoft Server Speech Text to Speech Voice (en-US, AriaNeural)","VoiceType":"StandardVoice"},{"Id":"e0638b39-fbd2-4497-a482-e2f65759412a","Name":"Microsoft Server Speech Text to Speech Voice (en-US, GuyNeural)","VoiceType":"StandardVoice"},{"Id":"e4fbab32-f3f3-4943-b4db-0d8a7469b383","Name":"Microsoft Server Speech Text to Speech Voice (en-GB, LibbyNeural)","VoiceType":"StandardVoice"},{"Id":"2367bbe4-0039-4222-a92a-12b37d66a362","Name":"Microsoft Server Speech Text to Speech Voice (en-GB, MiaNeural)","VoiceType":"StandardVoice"},{"Id":"865ed125-9b77-4022-bf44-142ca2522695","Name":"Microsoft Server Speech Text to Speech Voice (en-GB, RyanNeural)","VoiceType":"StandardVoice"},{"Id":"7db746e5-4da7-41da-8c5a-906f244effb5","Name":"Microsoft Server Speech Text to Speech Voice (en-IE, ConnorNeural)","VoiceType":"StandardVoice"},{"Id":"948c1dbe-75f6-4c3e-a7be-cfa1ecdc2c9c","Name":"Microsoft Server Speech Text to Speech Voice (en-IE, EmilyNeural)","VoiceType":"StandardVoice"},{"Id":"9622ee9e-a68a-444b-ad30-d84db7340f07","Name":"Microsoft Server Speech Text to Speech Voice (en-IN, NeerjaNeural)","VoiceType":"StandardVoice"},{"Id":"637ffba4-caa2-436e-a6d1-fda7a339eafe","Name":"Microsoft Server Speech Text to Speech Voice (en-IN, PrabhatNeural)","VoiceType":"StandardVoice"},{"Id":"f6c86801-3b7f-4cc3-abd7-996a740183fb","Name":"Microsoft Server Speech Text to Speech Voice (en-CA, ClaraNeural)","VoiceType":"StandardVoice"},{"Id":"376a5073-f406-4a9e-bc6f-7c50b23201f8","Name":"Microsoft Server Speech Text to Speech Voice (en-CA, LiamNeural)","VoiceType":"StandardVoice"},{"Id":"40656072-27e0-4599-8cc4-4de109bcb0b1","Name":"Microsoft Server Speech Text to Speech Voice (en-AU, NatashaNeural)","VoiceType":"StandardVoice"},{"Id":"ada99918-f740-4e47-86f7-0d3c8e95c027","Name":"Microsoft Server Speech Text to Speech Voice (en-AU, WilliamNeural)","VoiceType":"StandardVoice"}]}-->
 <speak xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xmlns:emo="http://www.w3.org/2009/10/emotionml" version="1.0" xml:lang="en-US"><voice name="Microsoft Server Speech Text to Speech Voice (en-US, JennyNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-US, AriaNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-US, GuyNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-GB, LibbyNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-GB, MiaNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-GB, RyanNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-IE, ConnorNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-IE, EmilyNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-IN, NeerjaNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-IN, PrabhatNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-CA, ClaraNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-CA, LiamNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-AU, NatashaNeural)">Where is my telephone.
 Where is my telephone?</voice>
 <voice name="Microsoft Server Speech Text to Speech Voice (en-AU, WilliamNeural)">Where is my telephone.
 Where is my telephone?</voice></speak>
azure-cognitive-servicesazure-speech
· 1
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@RaoulW-5169 Thanks for the feedback. We will pass the same to our team to check for improvements to the voices. I have also reviewed these cases and altered some to find the best fit. Attaching the file with details to check if this works for you as a workaround.

77620-question-on-questions-test.txt


0 Votes 0 ·

1 Answer

RaoulW-5169 avatar image
0 Votes"
RaoulW-5169 answered ·

Thats for the help. Your results for custom intonantion are the same as mine. In some cases it works quite well in some cases not. But ofcourse it would be best for this to happen automatically whena questionmark is added.

·
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.