Speech service supported regions

The Speech service allows your application to convert audio to text, perform speech translation, and convert text to speech. The service is available in multiple regions with unique endpoints for the Speech SDK and REST APIs. You can perform custom configurations to your speech experience, for all regions, at the Speech Studio.

Keep in mind the following points:

  • If your application uses a Speech SDK, you provide the region identifier, such as westus, when you create a speech configuration. Make sure the region matches the region of your subscription.
  • If your application uses one of the Speech service REST APIs, the region is part of the endpoint URI you use when making requests.
  • Keys created for a region are valid only in that region. If you attempt to use them with other regions, you get authentication errors.

Note

Speech service doesn't store or process customer data outside the region the customer deploys the service instance in.

Speech SDK

In the Speech SDK, you specify the region as a parameter (for example, in the Speech SDK for C#, you specify the region as a parameter to SpeechConfig.FromSubscription).

Speech-to-text, pronunciation assessment, text-to-speech, and translation

The Speech service is available in these regions for speech-to-text, pronunciation assessment, text-to-speech, and translation:

Geography Region Region identifier
Africa South Africa North southafricanorth
Asia Pacific East Asia eastasia
Asia Pacific Southeast Asia southeastasia 1
Asia Pacific Australia East australiaeast 1
Asia Pacific Central India centralindia 1
Asia Pacific Japan East japaneast
Asia Pacific Japan West japanwest
Asia Pacific Korea Central koreacentral
Canada Canada Central canadacentral 1
Europe North Europe northeurope 1
Europe West Europe westeurope 1
Europe France Central francecentral
Europe Germany West Central germanywestcentral
Europe Norway East norwayeast
Europe Switzerland North switzerlandnorth
Europe Switzerland West switzerlandwest
Europe UK South uksouth 1
Middle East UAE North uaenorth
South America Brazil South brazilsouth
US Central US centralus
US East US eastus 1
US East US 2 eastus2 1
US North Central US northcentralus 1
US South Central US southcentralus 1
US US Gov Arizona usgovarizona 1
US US Gov Virginia usgovvirginia 1
US West Central US westcentralus
US West US westus
US West US 2 westus2 1
US West US 3 westus3

1 The region has dedicated hardware for Custom Speech training. In regions with dedicated hardware for Custom Speech training, the Speech service will use up to 20 hours of your audio training data, and can process about 10 hours of data per day. In other regions, the Speech service uses up to 8 hours of your audio data, and can process about 1 hour of data per day.

If you plan to train a custom model with audio data, use one of the regions with dedicated hardware for faster training. Then you can use the Speech-to-text REST API v3.0 to copy the trained model to another region.

Tip

For pronunciation assessment, en-US and en-GB are available in all regions listed above, zh-CN is available in East Asia and Southeast Asia regions, de-DE, es-ES, and fr-FR are available in West Europe region, and en-AU is available in Australia East region.

Intent recognition

Available regions for intent recognition via the Speech SDK are in the following table.

Global region Region Region identifier
Asia East Asia eastasia
Asia Southeast Asia southeastasia
Australia Australia East australiaeast
Europe North Europe northeurope
Europe West Europe westeurope
North America East US eastus
North America East US 2 eastus2
North America South Central US southcentralus
North America West Central US westcentralus
North America West US westus
North America West US 2 westus2
South America Brazil South brazilsouth

This is a subset of the publishing regions supported by the Language Understanding service (LUIS).

Voice assistants

The Speech SDK supports voice assistant capabilities through Direct Line Speech for regions in the following table.

Global region Region Region identifier
North America West US westus
North America West US 2 westus2
North America East US eastus
North America East US 2 eastus2
North America West Central US westcentralus
North America South Central US southcentralus
Europe West Europe westeurope
Europe North Europe northeurope
Asia East Asia eastasia
Asia Southeast Asia southeastasia
India Central India centralindia

Speaker recognition

Available regions for speaker recognition are in the following table.

Geography Region Region identifier
Americas Central US centralus
Americas East US eastus
Americas East US 2 eastus2
Americas West Central US westcentralus
Americas West US westus
Americas West US 2 westus2
Americas West US 3 westus3
Asia Pacific East Asia eastasia
Asia Pacific Southeast Asia southeastasia
Asia Pacific Central India centralindia
Asia Pacific Japan East japaneast
Asia Pacific Japan West japanwest
Asia Pacific Korea Central koreacentral
Australia Australia East australiaeast
Canada Canada Central canadacentral
Europe North Europe northeurope
Europe West Europe westeurope
Europe France Central francecentral
Europe Germany West Central germanywestcentral
Europe Norway East norwayeast
Europe Switzerland West switzerlandwest
Europe UK South uksouth

Keyword recognition

Available regions for keyword recognition are in the following table.

Region Custom keyword (basic models) Custom keyword (advanced models) Keyword verification
West US Yes No Yes
West US 2 Yes Yes Yes
East US Yes Yes Yes
East US 2 Yes Yes Yes
West Central US Yes No Yes
South Central US Yes Yes Yes
West Europe Yes Yes Yes
North Europe Yes Yes Yes
UK South Yes Yes No
East Asia Yes No Yes
Southeast Asia Yes Yes Yes
Central India Yes Yes Yes
Japan East Yes No Yes
Japan West Yes No No
Australia East Yes Yes No
Brazil South Yes No No
Canada Central Yes No No
Korea Central Yes No No
France Central Yes No No
North Central US Yes Yes No
Central US Yes No No
South Africa North Yes No No

REST APIs

The Speech service also exposes REST endpoints for speech-to-text, text-to-speech, and speaker recognition requests.

Speech-to-text

The endpoint for the REST API has this format:

https://<REGION_IDENTIFIER>.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1

Replace <REGION_IDENTIFIER> with the identifier matching the region of your subscription from this table:

Geography Region Region identifier
Africa South Africa North southafricanorth
Asia Pacific East Asia eastasia
Asia Pacific Southeast Asia southeastasia 1
Asia Pacific Australia East australiaeast 1
Asia Pacific Central India centralindia 1
Asia Pacific Japan East japaneast
Asia Pacific Japan West japanwest
Asia Pacific Korea Central koreacentral
Canada Canada Central canadacentral 1
Europe North Europe northeurope 1
Europe West Europe westeurope 1
Europe France Central francecentral
Europe Germany West Central germanywestcentral
Europe Norway East norwayeast
Europe Switzerland North switzerlandnorth
Europe Switzerland West switzerlandwest
Europe UK South uksouth 1
Middle East UAE North uaenorth
South America Brazil South brazilsouth
US Central US centralus
US East US eastus 1
US East US 2 eastus2 1
US North Central US northcentralus 1
US South Central US southcentralus 1
US US Gov Arizona usgovarizona 1
US US Gov Virginia usgovvirginia 1
US West Central US westcentralus
US West US westus
US West US 2 westus2 1
US West US 3 westus3

1 The region has dedicated hardware for Custom Speech training. In regions with dedicated hardware for Custom Speech training, the Speech service will use up to 20 hours of your audio training data, and can process about 10 hours of data per day. In other regions, the Speech service uses up to 8 hours of your audio data, and can process about 1 hour of data per day.

Note

The language parameter must be appended to the URL to avoid receiving an HTTP error. For example, the language set to US English by using the West US endpoint is: https://westus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?language=en-US.

For more information, see the speech-to-text REST API.

Text-to-speech

For more information, see the text-to-speech REST API.

Prebuilt neural voices

Use this table to determine availability of neural voices by region or endpoint:

Region Endpoint
Australia East https://australiaeast.tts.speech.microsoft.com/cognitiveservices/v1
Brazil South https://brazilsouth.tts.speech.microsoft.com/cognitiveservices/v1
Canada Central https://canadacentral.tts.speech.microsoft.com/cognitiveservices/v1
Central US https://centralus.tts.speech.microsoft.com/cognitiveservices/v1
China East 2 https://chinaeast2.tts.speech.azure.cn/cognitiveservices/v1
China North 2 https://chinanorth2.tts.speech.azure.cn/cognitiveservices/v1
East Asia https://eastasia.tts.speech.microsoft.com/cognitiveservices/v1
East US https://eastus.tts.speech.microsoft.com/cognitiveservices/v1
East US 2 https://eastus2.tts.speech.microsoft.com/cognitiveservices/v1
France Central https://francecentral.tts.speech.microsoft.com/cognitiveservices/v1
Germany West Central https://germanywestcentral.tts.speech.microsoft.com/cognitiveservices/v1
India Central https://centralindia.tts.speech.microsoft.com/cognitiveservices/v1
Japan East https://japaneast.tts.speech.microsoft.com/cognitiveservices/v1
Japan West https://japanwest.tts.speech.microsoft.com/cognitiveservices/v1
Jio India West https://jioindiawest.tts.speech.microsoft.com/cognitiveservices/v1
Korea Central https://koreacentral.tts.speech.microsoft.com/cognitiveservices/v1
North Central US https://northcentralus.tts.speech.microsoft.com/cognitiveservices/v1
North Europe https://northeurope.tts.speech.microsoft.com/cognitiveservices/v1
Norway East https://norwayeast.tts.speech.microsoft.com/cognitiveservices/v1
South Central US https://southcentralus.tts.speech.microsoft.com/cognitiveservices/v1
Southeast Asia https://southeastasia.tts.speech.microsoft.com/cognitiveservices/v1
Sweden Central https://swedencentral.tts.speech.microsoft.com/cognitiveservices/v1
Switzerland North https://switzerlandnorth.tts.speech.microsoft.com/cognitiveservices/v1
Switzerland West https://switzerlandwest.tts.speech.microsoft.com/cognitiveservices/v1
UAE North https://uaenorth.tts.speech.microsoft.com/cognitiveservices/v1
US Gov Arizona https://usgovarizona.tts.speech.azure.us/cognitiveservices/v1
US Gov Virginia https://usgovvirginia.tts.speech.azure.us/cognitiveservices/v1
UK South https://uksouth.tts.speech.microsoft.com/cognitiveservices/v1
West Central US https://westcentralus.tts.speech.microsoft.com/cognitiveservices/v1
West Europe https://westeurope.tts.speech.microsoft.com/cognitiveservices/v1
West US https://westus.tts.speech.microsoft.com/cognitiveservices/v1
West US 2 https://westus2.tts.speech.microsoft.com/cognitiveservices/v1
West US 3 https://westus3.tts.speech.microsoft.com/cognitiveservices/v1

Tip

Voices in preview are available in only these three regions: East US, West Europe, and Southeast Asia.

Custom neural voices

If you've created a custom neural voice font, use the endpoint that you've created. You can also use the following endpoints. Replace {deploymentId} with the deployment ID for your neural voice model.

Region Training Deployment Endpoint
Australia East Yes Yes https://australiaeast.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Brazil South No Yes https://brazilsouth.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Canada Central No Yes https://canadacentral.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Central US No Yes https://centralus.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
East Asia No Yes https://eastasia.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
East US Yes Yes https://eastus.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
East US 2 Yes Yes https://eastus2.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
France Central No Yes https://francecentral.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Germany West Central No Yes https://germanywestcentral.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
India Central Yes Yes https://centralindia.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Japan East Yes Yes https://japaneast.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Japan West No Yes https://japanwest.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Jio India West No Yes https://jioindiawest.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Korea Central Yes Yes https://koreacentral.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
North Central US No Yes https://northcentralus.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
North Europe Yes Yes https://northeurope.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Norway East No Yes https://norwayeast.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
South Africa North No Yes https://southafricanorth.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
South Central US Yes Yes https://southcentralus.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Southeast Asia Yes Yes https://southeastasia.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Switzerland North No Yes https://switzerlandnorth.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
Switzerland West No Yes https://switzerlandwest.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
UAE North No Yes https://uaenorth.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
UK South Yes Yes https://uksouth.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
West Central US No Yes https://westcentralus.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
West Europe Yes Yes https://westeurope.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
West US Yes Yes https://westus.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
West US 2 Yes Yes https://westus2.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}
West US 3 No Yes https://westus3.voice.speech.microsoft.com/cognitiveservices/v1?deploymentId={deploymentId}

Note

The preceding regions are available for neural voice model hosting and real-time synthesis. Custom neural voice training is only available in some regions. But users can easily copy a neural voice model from these regions to other regions in the preceding list.

Long Audio API

The Long Audio API is available in multiple regions with unique endpoints:

Region Endpoint
Australia East https://australiaeast.customvoice.api.speech.microsoft.com
East US https://eastus.customvoice.api.speech.microsoft.com
India Central https://centralindia.customvoice.api.speech.microsoft.com
South Central US https://southcentralus.customvoice.api.speech.microsoft.com
Southeast Asia https://southeastasia.customvoice.api.speech.microsoft.com
UK South https://uksouth.customvoice.api.speech.microsoft.com
West Europe https://westeurope.customvoice.api.speech.microsoft.com

Speaker recognition

For more information, see the speaker recognition REST API. The regions available are the same as those for the speaker recognition SDK.