Is it possible to stream Groq LLM responses as and when I get it into Azure TTS?
Hi! I'm trying to build a real time LLM conversation bot, and need it to be as low latency as possible. I have successfully set up TTS audio output streaming…
How to have multiple mstts:audioduration in a single <speak>?
I'm trying to adjust the duration of individual phrases so that the synthesized voice matches with the voice in the original audio. It's working perfectly when done like this: <speak xmlns="http://www.w3.org/2001/10/synthesis"…
Is there any way to dub audios maintaining its original intonation, breaks and speed?
I've a voice audio that has a lot of deeper and higher tones and some breaks and "word-emphasis" in specific moments, but, when using the "Speech Translation" functionality, this audio loses all of its life (all this complexity),…
Error while trying to train a 202240228 Whisper Large v2 baseline model
When trying to train a custom speech model using a dataset containing an audio file and its transcript, the model failed to train due to an internal error. Can anyone provide any insights on how to troubleshoot this issue?
Low Confidence level of Language Identification
Hi, I was testing the this file , which is in English language, and somehow the language identification returned with Low confidence level for en-US locale. I used both continuous and recognize once option. Are there options I can set, to always ensure…
Do Text to Speech containers TTS provide visemes and blendshapes like the API?
I'm currently using the Speech API and consuming the visemes and blendshapes that are returned. In an effort to reduce latency I would like to run the speech services locally via the text to speech container. Does the response of the container STT…
How can I use AzureTextToSpeech in PowerApps?
I selected the connector and put a button on the canvas. On the OnSelect method I placed following code Set( _myOutput, AzureTexttospeech.ConvertTextToSpeech("en-US-JennyNeural", 'AddressInput.Language'.'en-US', TextOut.Text)); The…
OpenSSL Issue When Running Azure Speech to text on docker
Hey folks, I'm trying to run speech-to-text using Python on a docker container, but I'm getting an SSL error, I have tried following the steps mentioned here for SSL setup and also installed the required dependencies as mentioned here. However, I'm still…
Exception while running Azure Speech to text SDK with jar file (UnsatisfiedLinkError , setTempDirectory)
Hi Team I'm getting errors while running my Java jar in Windows and centos7, However the same is running fine in my Eclipse IDE. The issue is coming when I build the jar and run it in the environment. The error details are below: 2024-05-01 15:50:10…
Customizing a Conversation Model for a Hebrew Car Sales Call Center
Hi, I am looking for guidance on the process of customizing a model to transcribe conversations in a Hebrew car service call center. The conversations predominantly involve Hebrew-specific domain terms and non-dictionary words. Could you provide some…
Why am I getting a quota error?
I'm using Azure TTS and getting the following quota error: "You have reached the quota with your free-tier (F0) Speech resource. To continue to create audios with neural voices, switch to a standard paid resource, or upgrade your free-tier…
Speech Recognition Live transcription not detecting any other language instead of English
Hi, I am using Speech Recognition resource in my application for live transcription. It's perfectly going with English language but when I am trying to say in Hindi then it's not detecting. I want to create my application for multiple languages used in…
Android uses TTS SDK and 3 errors occur
Hello, our App Android version has used Microsoft's TTS SDK "com.microsoft.cognitiveservices.speech:client-sdk:1.34.0" But 3 errors appear frequently: Error 1: {CancellationReason:Error ErrorCode: ServiceTimeout ErrorDetails:USP error: timeout…
I am happy with the results in "Speech Studio" for a sample wav file. How do I scale this up to longer files?
I have run a 1-minute wav file through the Speech Studio sample process and am pleased with the result. I can't figure out how to move forward in the system to process larger speech files. One branch seems to take me into a training setting where I…
zh-CN-XiaochenNeural Abnormal timbre
zh-CN-XiaochenNeural, abnormal timbre. The same problem occurred in October last year. https://learn.microsoft.com/en-us/answers/questions/1431823/the-timbre-of-the-voice-of-zh-cn-xiaochenneural-ha —————————————————————— How long will it take to recover…
Azure Pronuciation Assessment recognition offset lag
I'm using the Pronunciation Assessment with the recognizeOnceAsync method. We are presenting a word for assessment and measuring the response time. Sometimes the offset returned with the recognition corresponds closely with the time reported from the…
speech Synthesis Language hebrew not working
hey I am reaching out to address an issue I have encountered with the speech Synthesis Language( microsoft.cognitiveservices.speech.sdk ) functionality in JavaScript. I have noticed that when attempting to use the Hebrew language code (he-IL) for…
Do I have to be on GovCloud in order to connect/use Azure Speech Services hosted on GovCloud US Virginia?
Hi. I am working with a cloud providers solution that is located in Amazon us-east2 region. I am hoping you can help confirm if the Azure Cognitive STT and TTS integration will/should work with Azure Speech Services hosted on GovCloud US Virginia? …
Azure speech service bot working in firefox
Firefox can’t establish a connection to the server at wss://centralindia.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?language=en-US&format=simple&Ocp-Apim-Subscription-
Error when calling "Audio Content Creation" in Speech Service
I (global admin) have assigned "Cognitive Service Speech Contributor" Role to our Developer in Speech Service. When he chooses "Audio Content Creation" he gets the message "The role you've assigned for the ressource [...] has not…