1,438 questions with Azure AI Speech tags

Sort by: Updated
0 answers

import user_config_helper

not able to find library to import user_config_helper import user_config_helper

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-22T08:42:34.7766667+00:00
AT13519148 0 Reputation points
commented 2024-05-22T09:42:28.5833333+00:00
dupammi 7,225 Reputation points Microsoft Vendor
0 answers

Regarding usage cost calculation using Azure Retail Price API

We are using Azure subscription with the Standard Tier. We have a requirement to calculate the monthly usage cost in JPY (Japanese Yen) of the Azure Speech to Text service and Azure Blob Storage in our application. we analyzed the Azure Retail Price API…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
Azure Cost Management
Azure Cost Management
A Microsoft offering that enables tracking of cloud usage and expenditures for Azure and other cloud providers.
2,114 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,481 questions
asked 2024-05-21T12:27:41.82+00:00
Test Admin 171 Reputation points
edited a comment 2024-05-22T07:02:03.99+00:00
Test Admin 171 Reputation points
1 answer

Azure TTS batch synthesis activity logs

Hi there, we're using Azure speech synthesis (batch, since we have content over 10mins). In the Azure Portal, I can see metrics for my speech resource but I can't see any records of past jobs. Is there any way to see these? Thanks, Tim

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-21T18:11:34.5866667+00:00
Tim Schmidt 0 Reputation points
edited the question 2024-05-22T05:44:09.7+00:00
navba-MSFT 17,405 Reputation points Microsoft Employee
0 answers

Multilingual voice mispronounces Ukrainian as Russian

How can I resolve the issue of multilingual voices pronouncing Ukrainian as Russian when using Text to Speech with the Microsoft.CognitiveServices.Speech package in C#? Explicitly specifying the language in the code through the SpeechSynthesisLanguage…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-19T06:44:50.0966667+00:00
Serhii Kapin 0 Reputation points
commented 2024-05-22T05:01:42.1233333+00:00
navba-MSFT 17,405 Reputation points Microsoft Employee
1 answer

Phonemes are not available for pronunciation recognition in french

On the result of the pronunciation recognition, if we set to "en-US", we have all the results for the phonemes spoken/matches. As below. "Phonemes": [ { "Phoneme":…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-18T21:38:28.0266667+00:00
GOMES-ALVES-DOS-SANTOS Bruna 0 Reputation points
commented 2024-05-22T04:33:55.92+00:00
navba-MSFT 17,405 Reputation points Microsoft Employee
1 answer

Speech Studio Audio Content Creation (x) Content Format and Audio Export Fail

I discovered https://speech.microsoft.com/portal, audio creation tile. (I think it should be the first one and described as "interactive batch TTS web interface.") I uploaded a file named test.txt, which has two paragraphs. For decades now,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-18T19:20:53.55+00:00
ivo welch 20 Reputation points
commented 2024-05-21T07:40:17.86+00:00
dupammi 7,225 Reputation points Microsoft Vendor
1 answer

Markdown to SSML ?

Does anyone know of a basic "preparer-converter" that takes a markdown (.md) file and converts it into an SSML file?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-18T19:23:52.8433333+00:00
ivo welch 20 Reputation points
commented 2024-05-21T07:39:27.8+00:00
dupammi 7,225 Reputation points Microsoft Vendor
1 answer

Issue with speech-to-text service

While converting the given wave file from Speech-to-Text using Microsoft's Speech-to-Text service, it is not detecting "No" at 57th second in this file but detecting at 1:12 min and in other places. Speech recognised is as follow RECOGNIZED:…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-16T06:18:15.6466667+00:00
Vidyadhar Busam 0 Reputation points
commented 2024-05-21T05:52:29.9766667+00:00
Vidyadhar Busam 0 Reputation points
0 answers

Cognitive services pronunciation assessment always gives 100% score, even with badly pronounced words

I built a svelte (javascript) application that uses the microsoft speech sdk (v1.36), and i am using it to evaluate pronunciation in 3 languages: english, german and french. Initially i was using RecognizeOnceAsync() which waits for silence at the end of…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-17T10:49:24.3766667+00:00
Schoolblocks 0 Reputation points
commented 2024-05-20T09:06:04.1166667+00:00
Schoolblocks 0 Reputation points
0 answers

Inquiry Regarding Azure AI Speech Error

Dear Azure Support Team I recently encountered an issue while using Azure AI Speech service with recordings from the VoiceMemo app on iPhone. Specifically, when attempting to process recordings of approximately 30 minutes in length, I received the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-15T12:18:25.3266667+00:00
y.ashibe 0 Reputation points
commented 2024-05-20T07:34:10.74+00:00
navba-MSFT 17,405 Reputation points Microsoft Employee
0 answers

Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS

Subject: Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS Description: The Azure Neural TTS system is mispronouncing the Welsh contraction "i’w." Instead of producing the correct pronunciation…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-16T14:22:18.8166667+00:00
Verbari LLC 0 Reputation points
commented 2024-05-20T05:46:20.76+00:00
navba-MSFT 17,405 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Batch TTS with REST: YourSynthesisId and other intro questions

I got the REST API to work on macos. Yeah!!! I could hear the output from the sample code. Alas, now I would like to submit a longer document I wrote to batch TTS and post it as my podcast. I am taking the example right off the webpage, and just…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-16T21:32:09.3333333+00:00
ivo welch 20 Reputation points
accepted 2024-05-18T19:09:24.1933333+00:00
ivo welch 20 Reputation points
0 answers

azure prononciation assessment time limit

i am using azure prononciation assessment to assess an audio , but the problem the assessment happens only for the 1 min of the speech and it doesnt assess the rest of the audio this is my code const sdk =…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-17T11:36:28.55+00:00
Iheb Jandoubi 5 Reputation points
commented 2024-05-17T18:28:20.5166667+00:00
romungi-MSFT 42,761 Reputation points Microsoft Employee
1 answer

Can you add a phrase list to the CallMediaRecognizeSpeechOptions class when using speech-to- text cognitive services from azure communications service

I am using ACS to access a multi-service Cognitive Services endpoint and doing recognition of speech input in real time via acs/telephone. I am using the default model provided by Microsoft. This is sufficient in most case but I have some place names…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
Azure Communication Services
Azure Communication Services
An Azure communication platform for deploying applications across devices and platforms.
708 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,444 questions
asked 2024-05-17T08:49:02.57+00:00
John 0 Reputation points
answered 2024-05-17T14:33:27.65+00:00
romungi-MSFT 42,761 Reputation points Microsoft Employee
0 answers

Is it possible to specify in Speech SDK to always use "lbs" instead of "£" when "pounds" is recognized?

Hi, is it possible somehow to configure speech sdk in a way when word "pound" is detected that it is always meant to be lbs, not £, for example when I say, "99 pounds" it is detected as "99 lbs", but if I said, "100…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-04-23T08:38:47.7566667+00:00
Faris Lemes 20 Reputation points
commented 2024-05-17T12:58:40.3666667+00:00
Faris Lemes 20 Reputation points
1 answer

here i cannot find To create a custom avatar endpoint, follow these steps: Sign in to Speech Studio. Navigate to Custom Avatar > Your project name > Train model.

i cannot find custom avatar key after sign in to the speech studio .

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-16T11:23:22.97+00:00
Praveen Jaganivasan 0 Reputation points
commented 2024-05-17T12:01:04.6633333+00:00
santoshkc 5,080 Reputation points Microsoft Vendor
1 answer

How to use an Microsoft Entra ID to authenticate with the Speech to text REST API (for batch transcription

I looks like you can only authenticate to the "Speech to text REST API" with a api key (Ocp-Apim-Subscription-Key). What we would like is to authenticate with a Microsoft Entra ID. Why? Our application is running a AKS and all our containers…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-10T13:45:07.2666667+00:00
Johan Klijn 41 Reputation points
commented 2024-05-17T12:00:35.7433333+00:00
navba-MSFT 17,405 Reputation points Microsoft Employee
1 answer

How to output transcription on a word-level

With the provided callback function, the text is outputted as described by you, either after a short pause or after a maximum of 15 seconds. Is it possible to output word by word so that the text can be seen while speaking? def…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-17T08:41:50.08+00:00
Sophie 0 Reputation points
answered 2024-05-17T09:11:07.3166667+00:00
Gowtham CP 1,090 Reputation points
1 answer One of the answers was accepted by the question author.

Set sound threshold for microsoft speech-to-text

Hi, It is possible setting a volume-threshold for the speech that gets transcribed? Such that if the speech is below a certain threshold then it would not get transcribed. I am using the speechSDK Br, Daniel

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2023-05-19T11:26:14.9833333+00:00
Daniel Beck Hansen 21 Reputation points
commented 2024-05-17T07:51:10.4066667+00:00
Amila Hapuarachchi 0 Reputation points
1 answer

macos cli starter guide

I am trying to play around with azure text to speech on macos. the instructions are woefully incomplete. I start with…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
asked 2024-05-15T22:32:07.6566667+00:00
Welch, Ivo 0 Reputation points
answered 2024-05-16T20:16:17.4966667+00:00
ivo welch 20 Reputation points