import user_config_helper
not able to find library to import user_config_helper import user_config_helper
Regarding usage cost calculation using Azure Retail Price API
We are using Azure subscription with the Standard Tier. We have a requirement to calculate the monthly usage cost in JPY (Japanese Yen) of the Azure Speech to Text service and Azure Blob Storage in our application. we analyzed the Azure Retail Price API…
Azure TTS batch synthesis activity logs
Hi there, we're using Azure speech synthesis (batch, since we have content over 10mins). In the Azure Portal, I can see metrics for my speech resource but I can't see any records of past jobs. Is there any way to see these? Thanks, Tim
Multilingual voice mispronounces Ukrainian as Russian
How can I resolve the issue of multilingual voices pronouncing Ukrainian as Russian when using Text to Speech with the Microsoft.CognitiveServices.Speech package in C#? Explicitly specifying the language in the code through the SpeechSynthesisLanguage…
Phonemes are not available for pronunciation recognition in french
On the result of the pronunciation recognition, if we set to "en-US", we have all the results for the phonemes spoken/matches. As below. "Phonemes": [ { "Phoneme":…
Speech Studio Audio Content Creation (x) Content Format and Audio Export Fail
I discovered https://speech.microsoft.com/portal, audio creation tile. (I think it should be the first one and described as "interactive batch TTS web interface.") I uploaded a file named test.txt, which has two paragraphs. For decades now,…
Markdown to SSML ?
Does anyone know of a basic "preparer-converter" that takes a markdown (.md) file and converts it into an SSML file?
Issue with speech-to-text service
While converting the given wave file from Speech-to-Text using Microsoft's Speech-to-Text service, it is not detecting "No" at 57th second in this file but detecting at 1:12 min and in other places. Speech recognised is as follow RECOGNIZED:…
Cognitive services pronunciation assessment always gives 100% score, even with badly pronounced words
I built a svelte (javascript) application that uses the microsoft speech sdk (v1.36), and i am using it to evaluate pronunciation in 3 languages: english, german and french. Initially i was using RecognizeOnceAsync() which waits for silence at the end of…
Inquiry Regarding Azure AI Speech Error
Dear Azure Support Team I recently encountered an issue while using Azure AI Speech service with recordings from the VoiceMemo app on iPhone. Specifically, when attempting to process recordings of approximately 30 minutes in length, I received the…
Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS
Subject: Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS Description: The Azure Neural TTS system is mispronouncing the Welsh contraction "i’w." Instead of producing the correct pronunciation…
Batch TTS with REST: YourSynthesisId and other intro questions
I got the REST API to work on macos. Yeah!!! I could hear the output from the sample code. Alas, now I would like to submit a longer document I wrote to batch TTS and post it as my podcast. I am taking the example right off the webpage, and just…
azure prononciation assessment time limit
i am using azure prononciation assessment to assess an audio , but the problem the assessment happens only for the 1 min of the speech and it doesnt assess the rest of the audio this is my code const sdk =…
Can you add a phrase list to the CallMediaRecognizeSpeechOptions class when using speech-to- text cognitive services from azure communications service
I am using ACS to access a multi-service Cognitive Services endpoint and doing recognition of speech input in real time via acs/telephone. I am using the default model provided by Microsoft. This is sufficient in most case but I have some place names…
Is it possible to specify in Speech SDK to always use "lbs" instead of "£" when "pounds" is recognized?
Hi, is it possible somehow to configure speech sdk in a way when word "pound" is detected that it is always meant to be lbs, not £, for example when I say, "99 pounds" it is detected as "99 lbs", but if I said, "100…
here i cannot find To create a custom avatar endpoint, follow these steps: Sign in to Speech Studio. Navigate to Custom Avatar > Your project name > Train model.
i cannot find custom avatar key after sign in to the speech studio .
How to use an Microsoft Entra ID to authenticate with the Speech to text REST API (for batch transcription
I looks like you can only authenticate to the "Speech to text REST API" with a api key (Ocp-Apim-Subscription-Key). What we would like is to authenticate with a Microsoft Entra ID. Why? Our application is running a AKS and all our containers…
How to output transcription on a word-level
With the provided callback function, the text is outputted as described by you, either after a short pause or after a maximum of 15 seconds. Is it possible to output word by word so that the text can be seen while speaking? def…
Set sound threshold for microsoft speech-to-text
Hi, It is possible setting a volume-threshold for the speech that gets transcribed? Such that if the speech is below a certain threshold then it would not get transcribed. I am using the speechSDK Br, Daniel
macos cli starter guide
I am trying to play around with azure text to speech on macos. the instructions are woefully incomplete. I start with…