Text-to-Speech-API von BingBing text to speech API

Hinweis

Der neue Spracherkennungsdienst und das SDK ersetzen die Bing-Spracheingabe, die ab dem 14. Januar 2020 nicht mehr genutzt werden kann.The new Speech Service and SDK is replacing Bing Speech, which will no longer work starting January 14, 2020. Informationen zum Umstieg auf den Spracherkennungsdienst finden Sie unter Migration von der Bing-Spracheingabe zum Spracherkennungsdienst.For information on switching to the Speech Service, see Migrating from Bing Speech to the Speech Service.

EinführungIntroduction

Mit der Text-to-Speech-API von Bing kann Ihre Anwendung HTTP-Anforderungen an einen Cloudserver senden. Dort wird der Text dann umgehend in natürlich klingende Sprache umgewandelt und als Audiodatei zurückgegeben.With the Bing text to speech API, your application can send HTTP requests to a cloud server, where text is instantly synthesized into human-sounding speech and returned as an audio file. Diese API kann bietet eine Echtzeitumwandlung von Text in Sprache mit verschiedenen Stimmen und Sprachen für unterschiedlichste Szenarien.This API can be used in many different contexts to provide real-time text-to-speech conversion in a variety of different voices and languages.

SprachsyntheseanforderungVoice synthesis request

AutorisierungstokenAuthorization token

Jede Sprachsyntheseanforderung erfordert ein JWT-Zugriffstoken (JSON Web Token).Every voice synthesis request requires a JSON Web Token (JWT) access token. Das JWT-Zugriffstoken wird über den Sprachanforderungsheader übergeben.The JWT access token is passed through in the speech request header. Das Token ist zehn Minuten lang gültig.The token has an expiry time of 10 minutes. Informationen zum Abonnieren und Beziehen von API-Schlüsseln für den Abruf gültiger JWT-Zugriffstoken finden Sie unter Cognitive Services ausprobieren.For information about subscribing and obtaining API keys that are used to retrieve valid JWT access tokens, see Cognitive Services Subscription.

Der API-Schlüssel wird an den Tokendienst übergeben.The API key is passed to the token service. Beispiel:For example:

POST https://api.cognitive.microsoft.com/sts/v1.0/issueToken
Content-Length: 0

Für den Tokenzugriff sind folgende Headerinformationen erforderlich:The required header information for token access is as follows.

NAMEName FormatFormat BESCHREIBUNGDescription
Ocp-Apim-Subscription-KeyOcp-Apim-Subscription-Key ASCIIASCII Your subscription key (Ihr Abonnementschlüssel)Your subscription key

Der Tokendienst gibt das JWT-Zugriffstoken im Format text/plain zurück.The token service returns the JWT access token as text/plain. Das JWT wird dann im Format Base64 access_token mit der Präfixzeichenfolge Bearer als Autorisierungsheader an den Sprachendpunkt übergeben.Then the JWT is passed as a Base64 access_token to the speech endpoint as an authorization header prefixed with the string Bearer. Beispiel:For example:

Authorization: Bearer [Base64 access_token]

Für den Zugriff auf den Sprachsynthesedienst müssen Clients den folgenden Endpunkt verwenden:Clients must use the following endpoint to access the text-to-speech service:

https://speech.platform.bing.com/synthesize

Hinweis

Wenn Sie noch kein Zugriffstoken mit Ihrem Abonnementsschlüssel bezogen haben, wie weiter oben beschrieben, generiert dieser Link einen Antwortfehler vom Typ 403 Forbidden.Until you have acquired an access token with your subscription key as described earlier, this link generates a 403 Forbidden response error.

HTTP-HeaderHTTP headers

Die folgende Tabelle enthält die HTTP-Header für Sprachsyntheseanforderungen:The following table shows the HTTP headers that are used for voice synthesis requests.

HeaderHeader WertValue KommentareComments
Content-TypeContent-Type application/ssml+xmlapplication/ssml+xml Der Inhaltstyp der Eingabe.The input content type.
X-Microsoft-OutputFormatX-Microsoft-OutputFormat 1. ssml-16khz-16bit-mono-tts1. ssml-16khz-16bit-mono-tts
2. raw-16khz-16bit-mono-pcm2. raw-16khz-16bit-mono-pcm
3. audio-16khz-16kbps-mono-siren3. audio-16khz-16kbps-mono-siren
4. riff-16khz-16kbps-mono-siren4. riff-16khz-16kbps-mono-siren
5. riff-16khz-16bit-mono-pcm5. riff-16khz-16bit-mono-pcm
6. audio-16khz-128kbitrate-mono-mp36. audio-16khz-128kbitrate-mono-mp3
7. audio-16khz-64kbitrate-mono-mp37. audio-16khz-64kbitrate-mono-mp3
8. audio-16khz-32kbitrate-mono-mp38. audio-16khz-32kbitrate-mono-mp3
Das Audioformat der Ausgabe.The output audio format.
X-Search-AppIdX-Search-AppId Eine GUID (nur hexadezimal, keine Bindestriche)A GUID (hex only, no dashes) Eine ID zur eindeutigen Identifizierung der Clientanwendung.An ID that uniquely identifies the client application. Hierbei kann es sich um die Shop-ID für Apps handeln.This can be the store ID for apps. Sollte keine verfügbar sein, kann die ID für eine Anwendung vom Benutzer generiert werden.If one is not available, the ID can be user generated for an application.
X-Search-ClientIDX-Search-ClientID Eine GUID (nur hexadezimal, keine Bindestriche)A GUID (hex only, no dashes) Eine ID zur eindeutigen Identifizierung einer Anwendungsinstanz für jede Installation.An ID that uniquely identifies an application instance for each installation.
User-AgentUser-Agent AnwendungsnameApplication name Der Anwendungsname ist erforderlich und muss weniger als 255 Zeichen umfassen.The application name is required and must be fewer than 255 characters.
AuthorizationAuthorization AutorisierungstokenAuthorization token Informationen hierzu finden Sie im Abschnitt Authentifizierungstoken.See the Authorization token section.

EingabeparameterInput parameters

Anforderungen für die Text-to-Speech-API von Bing werden in Form von HTTP POST-Aufrufen vorgenommen.Requests to the Bing text to speech API are made using HTTP POST calls. Die Header sind im vorherigen Abschnitt angegeben.The headers are specified in the previous section. Der Hauptteil enthält eine SSML-Eingabe (Speech Synthesis Markup Language, Markupsprache für Sprachsynthese), die den umzuwandelnden Text darstellt.The body contains Speech Synthesis Markup Language (SSML) input that represents the text to be synthesized. Eine Beschreibung des Markups zur Steuerung von Sprachaspekten wie Sprache und Geschlecht des Sprechers finden Sie in der SSML-W3C-Spezifikation.For a description of the markup used to control aspects of speech such as the language and gender of the speaker, see the SSML W3C Specification.

Hinweis

Eine SSML-Eingabe darf maximal 1.024 Zeichen (einschließlich aller Tags) umfassen.The maximum size of the SSML input that is supported is 1,024 characters, including all tags.

Beispiel: SprachausgabeanforderungExample: voice output request

Hier sehen Sie ein Beispiel für eine Sprachausgabeanforderung:An example of a voice output request is as follows:

POST /synthesize
HTTP/1.1
Host: speech.platform.bing.com

X-Microsoft-OutputFormat: riff-8khz-8bit-mono-mulaw
Content-Type: application/ssml+xml
Host: speech.platform.bing.com
Content-Length: 197
Authorization: Bearer [Base64 access_token]

<speak version='1.0' xml:lang='en-US'><voice xml:lang='en-US' xml:gender='Female' name='Microsoft Server Speech Text to Speech Voice (en-US, ZiraRUS)'>Microsoft Bing Voice Output API</voice></speak>

SprachausgabeantwortVoice output response

Die Text-to-Speech-API von Bing verwendet HTTP POST, um Audio an den Client zurückzugeben.The Bing text to speech API uses HTTP POST to send audio back to the client. Die API-Antwort enthält den Audiostream und den Codec und entspricht dem angeforderten Ausgabeformat.The API response contains the audio stream and the codec, and it matches the requested output format. Das zurückgegebene Audio für eine Anforderung darf maximal 15 Sekunden lang sein.The audio returned for a given request must not exceed 15 seconds.

Beispiel: erfolgreiche SyntheseantwortExample: successful synthesis response

Der folgende Code ist ein Beispiel für eine JSON-Antwort auf eine erfolgreiche Sprachsyntheseanforderung.The following code is an example of a JSON response to a successful voice synthesis request. Die Kommentare und die Formatierung des Codes wurden nur für dieses Beispiel eingefügt und sind in der tatsächlichen Antwort nicht enthalten.The comments and formatting of the code are for purposes of this example only and are omitted from the actual response.

HTTP/1.1 200 OK
Content-Length: XXX
Content-Type: audio/x-wav

Response audio payload

Beispiel: SynthesefehlerExample: synthesis failure

Der folgende Beispielcode zeigt eine JSON-Antwort auf eine nicht erfolgreiche Sprachsyntheseabfrage:The following example code shows a JSON response to a voice-synthesis query failure:

HTTP/1.1 400 XML parser error
Content-Type: text/xml
Content-Length: 0

FehlerantwortenError responses

ErrorError BESCHREIBUNGDescription
HTTP/400 Bad RequestHTTP/400 Bad Request Ein erforderlicher Parameter fehlt oder ist leer oder NULL, oder der an einen erforderlichen oder optionalen Parameter übergebene Wert ist ungültig.A required parameter is missing, empty, or null, or the value passed to either a required or optional parameter is invalid. Eine Antwort vom Typ „Ungültig“ kann unter anderem auf die Übergabe eines zu langen Zeichenfolgenwerts zurückzuführen sein.One reason for getting the “invalid” response is passing a string value that is longer than the allowed length. Eine kurze Beschreibung des problematischen Parameters ist enthalten.A brief description of the problematic parameter is included.
HTTP/401 UnauthorizedHTTP/401 Unauthorized Die Anforderung ist nicht autorisiert.The request is not authorized.
HTTP/413 RequestEntityTooLargeHTTP/413 RequestEntityTooLarge Die SSML-Eingabe ist zu groß.The SSML input is larger than what is supported.
HTTP/502 BadGatewayHTTP/502 BadGateway Es liegt ein Netzwerk- oder Serverproblem vor.There is a network-related problem or a server-side issue.

Hier sehen Sie ein Beispiel für eine Fehlerantwort:An example of an error response is as follows:

HTTP/1.0 400 Bad Request
Content-Length: XXX
Content-Type: text/plain; charset=UTF-8

Voice name not supported

Ändern der Sprachausgabe per SSMLChanging voice output via SSML

Die Text-to-Speech-API von Microsoft unterstützt SSML 1.0 gemäß W3C-Definition (Speech Synthesis Markup Language (SSML) Version 1.0).Microsoft Text-to-Speech API supports SSML 1.0 as defined in W3C Speech Synthesis Markup Language (SSML) Version 1.0. Dieser Abschnitt enthält Beispiele für das Ändern bestimmter Eigenschaften der generierten Sprachausgabe wie Sprechgeschwindigkeit und Aussprache mithilfe von SSML-Tags.This section shows examples of changing certain characteristics of generated voice output like speaking rate, pronunciation etc. by using SSML tags.

  1. Hinzufügen einer PauseAdding break

    <speak version='1.0' xmlns="https://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, BenjaminRUS)'> Welcome to use Microsoft Cognitive Services <break time="100ms" /> Text-to-Speech API.</voice> </speak>
    
  2. Ändern der SprechgeschwindigkeitChange speaking rate

    <speak version='1.0' xmlns="https://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'><prosody rate="+30.00%">Welcome to use Microsoft Cognitive Services Text-to-Speech API.</prosody></voice> </speak>
    
  3. AussprachePronunciation

    <speak version='1.0' xmlns="https://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'> <phoneme alphabet="ipa" ph="t&#x259;mei&#x325;&#x27E;ou&#x325;"> tomato </phoneme></voice> </speak>
    
  4. Ändern der LautstärkeChange volume

    <speak version='1.0' xmlns="https://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'><prosody volume="+20.00%">Welcome to use Microsoft Cognitive Services Text-to-Speech API.</prosody></voice> </speak>
    
  5. Ändern der TonhöheChange pitch

    <speak version='1.0' xmlns="https://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'>Welcome to use <prosody pitch="high">Microsoft Cognitive Services Text-to-Speech API.</prosody></voice> </speak>
    
  6. Ändern des SatzrhythmusChange prosody contour

    <speak version='1.0' xmlns="https://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'><prosody contour="(80%,+20%) (90%,+30%)" >Good morning.</prosody></voice> </speak>
    

Hinweis

Beachten Sie, dass die Audiodaten in 8k- oder 16k-WAV-Dateien im folgenden Format vorliegen müssen: CRC-Code (CRC-32): 4 Bytes (DWORD) mit gültigen Bereich 0x00000000 ~ 0xFFFFFFFF; Audioformatflag: 4 Bytes (DWORD) mit gültigen Bereich 0x00000000 ~ 0xFFFFFFFF; Beispielzahl: 4 Bytes (DWORD) mit gültigen Bereich 0x00000000 ~ 0x7FFFFFFF; Größe des binären Texts: 4 Bytes (DWORD) mit gültigen Bereich 0x00000000 ~ 0x7FFFFFFF; Binärer Text: n Bytes.Note the audio data has to be 8k or 16k wav filed in the following format: CRC code (CRC-32): 4 bytes (DWORD) with valid range 0x00000000 ~ 0xFFFFFFFF; Audio format flag: 4 bytes (DWORD) with valid range 0x00000000 ~ 0xFFFFFFFF; Sample count: 4 bytes (DWORD) with valid range 0x00000000 ~ 0x7FFFFFFF; Size of binary body: 4 bytes (DWORD) with valid range 0x00000000 ~ 0x7FFFFFFF; Binary body: n bytes.

BeispielanwendungSample application

Implementierungsdetails finden Sie in der Visual C# .NET-Beispielanwendung für die Sprachsynthese.For implementation details, see the Visual C#.NET text-to-speech sample application.

Unterstützte Gebietsschemas und VoicefontsSupported locales and voice fonts

Die folgende Tabelle enthält einige der unterstützten Gebietsschemas und die dazugehörigen Voicefonts.The following table identifies some of the supported locales and related voice fonts.

GebietsschemaLocale GeschlechtGender DienstnamenzuordnungService name mapping
ar-EG*ar-EG* FemaleFemale Microsoft Server Speech Text to Speech Voice (ar-EG, Hoda)"Microsoft Server Speech Text to Speech Voice (ar-EG, Hoda)"
ar-SAar-SA MaleMale Microsoft Server Speech Text to Speech Voice (ar-SA, Naayf)"Microsoft Server Speech Text to Speech Voice (ar-SA, Naayf)"
bg-BGbg-BG MaleMale Microsoft Server Speech Text to Speech Voice (bg-BG, Ivan)"Microsoft Server Speech Text to Speech Voice (bg-BG, Ivan)"
ca-ESca-ES FemaleFemale Microsoft Server Speech Text to Speech Voice (ca-ES, HerenaRUS)"Microsoft Server Speech Text to Speech Voice (ca-ES, HerenaRUS)"
cs-CZcs-CZ MaleMale Microsoft Server Speech Text to Speech Voice (cs-CZ, Jakub)"Microsoft Server Speech Text to Speech Voice (cs-CZ, Jakub)"
da-DKda-DK FemaleFemale Microsoft Server Speech Text to Speech Voice (da-DK, HelleRUS)"Microsoft Server Speech Text to Speech Voice (da-DK, HelleRUS)"
de-ATde-AT MaleMale Microsoft Server Speech Text to Speech Voice (de-AT, Michael)"Microsoft Server Speech Text to Speech Voice (de-AT, Michael)"
de-CHde-CH MaleMale Microsoft Server Speech Text to Speech Voice (de-CH, Karsten)"Microsoft Server Speech Text to Speech Voice (de-CH, Karsten)"
de-DEde-DE FemaleFemale Microsoft Server Speech Text to Speech Voice (de-DE, Hedda)"Microsoft Server Speech Text to Speech Voice (de-DE, Hedda) "
de-DEde-DE FemaleFemale Microsoft Server Speech Text to Speech Voice (de-DE, HeddaRUS)"Microsoft Server Speech Text to Speech Voice (de-DE, HeddaRUS)"
de-DEde-DE MaleMale Microsoft Server Speech Text to Speech Voice (de-DE, Stefan, Apollo)"Microsoft Server Speech Text to Speech Voice (de-DE, Stefan, Apollo) "
el-GRel-GR MaleMale Microsoft Server Speech Text to Speech Voice (el-GR, Stefanos)"Microsoft Server Speech Text to Speech Voice (el-GR, Stefanos)"
en-AUen-AU FemaleFemale Microsoft Server Speech Text to Speech Voice (en-AU, Catherine)"Microsoft Server Speech Text to Speech Voice (en-AU, Catherine) "
en-AUen-AU FemaleFemale Microsoft Server Speech Text to Speech Voice (en-AU, HayleyRUS)"Microsoft Server Speech Text to Speech Voice (en-AU, HayleyRUS)"
en-CAen-CA FemaleFemale Microsoft Server Speech Text to Speech Voice (en-CA, Linda)"Microsoft Server Speech Text to Speech Voice (en-CA, Linda)"
en-CAen-CA FemaleFemale Microsoft Server Speech Text to Speech Voice (en-CA, HeatherRUS)"Microsoft Server Speech Text to Speech Voice (en-CA, HeatherRUS)"
en-GBen-GB FemaleFemale Microsoft Server Speech Text to Speech Voice (en-GB, Susan, Apollo)"Microsoft Server Speech Text to Speech Voice (en-GB, Susan, Apollo)"
en-GBen-GB FemaleFemale Microsoft Server Speech Text to Speech Voice (en-GB, HazelRUS)"Microsoft Server Speech Text to Speech Voice (en-GB, HazelRUS)"
en-GBen-GB MaleMale Microsoft Server Speech Text to Speech Voice (en-GB, George, Apollo)"Microsoft Server Speech Text to Speech Voice (en-GB, George, Apollo)"
en-IEen-IE MaleMale Microsoft Server Speech Text to Speech Voice (en-IE, Sean)"Microsoft Server Speech Text to Speech Voice (en-IE, Sean)"
en-INen-IN FemaleFemale Microsoft Server Speech Text to Speech Voice (en-IN, Heera, Apollo)"Microsoft Server Speech Text to Speech Voice (en-IN, Heera, Apollo)"
en-INen-IN FemaleFemale Microsoft Server Speech Text to Speech Voice (en-IN, PriyaRUS)"Microsoft Server Speech Text to Speech Voice (en-IN, PriyaRUS)"
en-INen-IN MaleMale Microsoft Server Speech Text to Speech Voice (en-IN, Ravi, Apollo)"Microsoft Server Speech Text to Speech Voice (en-IN, Ravi, Apollo)"
en-USen-US FemaleFemale Microsoft Server Speech Text to Speech Voice (en-US, ZiraRUS)"Microsoft Server Speech Text to Speech Voice (en-US, ZiraRUS)"
en-USen-US FemaleFemale Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)"Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)"
en-USen-US MaleMale Microsoft Server Speech Text to Speech Voice (en-US, BenjaminRUS)"Microsoft Server Speech Text to Speech Voice (en-US, BenjaminRUS)"
es-ESes-ES FemaleFemale Microsoft Server Speech Text to Speech Voice (es-ES, Laura, Apollo)"Microsoft Server Speech Text to Speech Voice (es-ES, Laura, Apollo)"
es-ESes-ES FemaleFemale Microsoft Server Speech Text to Speech Voice (es-ES, HelenaRUS)"Microsoft Server Speech Text to Speech Voice (es-ES, HelenaRUS)"
es-ESes-ES MaleMale Microsoft Server Speech Text to Speech Voice (es-ES, Pablo, Apollo)"Microsoft Server Speech Text to Speech Voice (es-ES, Pablo, Apollo)"
es-MXes-MX FemaleFemale Microsoft Server Speech Text to Speech Voice (es-MX, HildaRUS)"Microsoft Server Speech Text to Speech Voice (es-MX, HildaRUS)"
es-MXes-MX MaleMale Microsoft Server Speech Text to Speech Voice (es-MX, Raul, Apollo)"Microsoft Server Speech Text to Speech Voice (es-MX, Raul, Apollo)"
fi-FIfi-FI FemaleFemale Microsoft Server Speech Text to Speech Voice (fi-FI, HeidiRUS)"Microsoft Server Speech Text to Speech Voice (fi-FI, HeidiRUS)"
fr-CAfr-CA FemaleFemale Microsoft Server Speech Text to Speech Voice (fr-CA, Caroline)"Microsoft Server Speech Text to Speech Voice (fr-CA, Caroline)"
fr-CAfr-CA FemaleFemale Microsoft Server Speech Text to Speech Voice (fr-CA, HarmonieRUS)"Microsoft Server Speech Text to Speech Voice (fr-CA, HarmonieRUS)"
fr-CHfr-CH MaleMale Microsoft Server Speech Text to Speech Voice (fr-CH, Guillaume)"Microsoft Server Speech Text to Speech Voice (fr-CH, Guillaume)"
fr-FRfr-FR FemaleFemale Microsoft Server Speech Text to Speech Voice (fr-FR, Julie, Apollo)"Microsoft Server Speech Text to Speech Voice (fr-FR, Julie, Apollo)"
fr-FRfr-FR FemaleFemale Microsoft Server Speech Text to Speech Voice (fr-FR, HortenseRUS)"Microsoft Server Speech Text to Speech Voice (fr-FR, HortenseRUS)"
fr-FRfr-FR MaleMale Microsoft Server Speech Text to Speech Voice (fr-FR, Paul, Apollo)"Microsoft Server Speech Text to Speech Voice (fr-FR, Paul, Apollo)"
he-ILhe-IL MaleMale Microsoft Server Speech Text to Speech Voice (he-IL, Asaf)"Microsoft Server Speech Text to Speech Voice (he-IL, Asaf)"
hi-INhi-IN FemaleFemale Microsoft Server Speech Text to Speech Voice (hi-IN, Kalpana, Apollo)"Microsoft Server Speech Text to Speech Voice (hi-IN, Kalpana, Apollo)"
hi-INhi-IN FemaleFemale Microsoft Server Speech Text to Speech Voice (hi-IN, Kalpana)"Microsoft Server Speech Text to Speech Voice (hi-IN, Kalpana)"
hi-INhi-IN MaleMale Microsoft Server Speech Text to Speech Voice (hi-IN, Hemant)"Microsoft Server Speech Text to Speech Voice (hi-IN, Hemant)"
hr-HRhr-HR MaleMale Microsoft Server Speech Text to Speech Voice (hr-HR, Matej)"Microsoft Server Speech Text to Speech Voice (hr-HR, Matej)"
hu-HUhu-HU MaleMale Microsoft Server Speech Text to Speech Voice (hu-HU, Szabolcs)"Microsoft Server Speech Text to Speech Voice (hu-HU, Szabolcs)"
id-IDid-ID MaleMale Microsoft Server Speech Text to Speech Voice (id-ID, Andika)"Microsoft Server Speech Text to Speech Voice (id-ID, Andika)"
it-ITit-IT MaleMale Microsoft Server Speech Text to Speech Voice (it-IT, Cosimo, Apollo)"Microsoft Server Speech Text to Speech Voice (it-IT, Cosimo, Apollo)"
it-ITit-IT FemaleFemale Microsoft Server Speech Text to Speech Voice (it-IT, LuciaRUS)"Microsoft Server Speech Text to Speech Voice (it-IT, LuciaRUS)"
ja-JPja-JP FemaleFemale Microsoft Server Speech Text to Speech Voice (ja-JP, Ayumi, Apollo)"Microsoft Server Speech Text to Speech Voice (ja-JP, Ayumi, Apollo)"
ja-JPja-JP MaleMale Microsoft Server Speech Text to Speech Voice (ja-JP, Ichiro, Apollo)"Microsoft Server Speech Text to Speech Voice (ja-JP, Ichiro, Apollo)"
ja-JPja-JP FemaleFemale Microsoft Server Speech Text to Speech Voice (ja-JP, HarukaRUS)"Microsoft Server Speech Text to Speech Voice (ja-JP, HarukaRUS)"
ko-KRko-KR FemaleFemale Microsoft Server Speech Text to Speech Voice (ko-KR, HeamiRUS)"Microsoft Server Speech Text to Speech Voice (ko-KR, HeamiRUS)"
ms-MYms-MY MaleMale Microsoft Server Speech Text to Speech Voice (ms-MY, Rizwan)"Microsoft Server Speech Text to Speech Voice (ms-MY, Rizwan)"
nb-NOnb-NO FemaleFemale Microsoft Server Speech Text to Speech Voice (nb-NO, HuldaRUS)"Microsoft Server Speech Text to Speech Voice (nb-NO, HuldaRUS)"
nl-NLnl-NL FemaleFemale Microsoft Server Speech Text to Speech Voice (nl-NL, HannaRUS)"Microsoft Server Speech Text to Speech Voice (nl-NL, HannaRUS)"
pl-PLpl-PL FemaleFemale Microsoft Server Speech Text to Speech Voice (pl-PL, PaulinaRUS)"Microsoft Server Speech Text to Speech Voice (pl-PL, PaulinaRUS)"
pt-BRpt-BR FemaleFemale Microsoft Server Speech Text to Speech Voice (pt-BR, HeloisaRUS)"Microsoft Server Speech Text to Speech Voice (pt-BR, HeloisaRUS)"
pt-BRpt-BR MaleMale Microsoft Server Speech Text to Speech Voice (pt-BR, Daniel, Apollo)"Microsoft Server Speech Text to Speech Voice (pt-BR, Daniel, Apollo)"
pt-PTpt-PT FemaleFemale Microsoft Server Speech Text to Speech Voice (pt-PT, HeliaRUS)"Microsoft Server Speech Text to Speech Voice (pt-PT, HeliaRUS)"
ro-ROro-RO MaleMale Microsoft Server Speech Text to Speech Voice (ro-RO, Andrei)"Microsoft Server Speech Text to Speech Voice (ro-RO, Andrei)"
ru-RUru-RU FemaleFemale Microsoft Server Speech Text to Speech Voice (ru-RU, Irina, Apollo)"Microsoft Server Speech Text to Speech Voice (ru-RU, Irina, Apollo)"
ru-RUru-RU MaleMale Microsoft Server Speech Text to Speech Voice (ru-RU, Pavel, Apollo)"Microsoft Server Speech Text to Speech Voice (ru-RU, Pavel, Apollo)"
ru-RUru-RU FemaleFemale Microsoft Server Speech Text to Speech Voice (ru-RU, EkaterinaRUS)"Microsoft Server Speech Text to Speech Voice (ru-RU, EkaterinaRUS)"
sk-SKsk-SK MaleMale Microsoft Server Speech Text to Speech Voice (sk-SK, Filip)"Microsoft Server Speech Text to Speech Voice (sk-SK, Filip)"
sl-SIsl-SI MaleMale Microsoft Server Speech Text to Speech Voice (sl-SI, Lado)"Microsoft Server Speech Text to Speech Voice (sl-SI, Lado)"
sv-SEsv-SE FemaleFemale Microsoft Server Speech Text to Speech Voice (sv-SE, HedvigRUS)"Microsoft Server Speech Text to Speech Voice (sv-SE, HedvigRUS)"
ta-INta-IN MaleMale Microsoft Server Speech Text to Speech Voice (ta-IN, Valluvar)"Microsoft Server Speech Text to Speech Voice (ta-IN, Valluvar)"
th-THth-TH MaleMale Microsoft Server Speech Text to Speech Voice (th-TH, Pattara)"Microsoft Server Speech Text to Speech Voice (th-TH, Pattara)"
tr-TRtr-TR FemaleFemale Microsoft Server Speech Text to Speech Voice (tr-TR, SedaRUS)"Microsoft Server Speech Text to Speech Voice (tr-TR, SedaRUS)"
vi-VNvi-VN MaleMale Microsoft Server Speech Text to Speech Voice (vi-VN, An)"Microsoft Server Speech Text to Speech Voice (vi-VN, An)"
zh-CNzh-CN FemaleFemale Microsoft Server Speech Text to Speech Voice (zh-CN, HuihuiRUS)"Microsoft Server Speech Text to Speech Voice (zh-CN, HuihuiRUS)"
zh-CNzh-CN FemaleFemale Microsoft Server Speech Text to Speech Voice (zh-CN, Yaoyao, Apollo)"Microsoft Server Speech Text to Speech Voice (zh-CN, Yaoyao, Apollo)"
zh-CNzh-CN MaleMale Microsoft Server Speech Text to Speech Voice (zh-CN, Kangkang, Apollo)"Microsoft Server Speech Text to Speech Voice (zh-CN, Kangkang, Apollo)"
zh-HKzh-HK FemaleFemale Microsoft Server Speech Text to Speech Voice (zh-HK, Tracy, Apollo)"Microsoft Server Speech Text to Speech Voice (zh-HK, Tracy, Apollo)"
zh-HKzh-HK FemaleFemale Microsoft Server Speech Text to Speech Voice (zh-HK, TracyRUS)"Microsoft Server Speech Text to Speech Voice (zh-HK, TracyRUS)"
zh-HKzh-HK MaleMale Microsoft Server Speech Text to Speech Voice (zh-HK, Danny, Apollo)"Microsoft Server Speech Text to Speech Voice (zh-HK, Danny, Apollo)"
zh-TWzh-TW FemaleFemale Microsoft Server Speech Text to Speech Voice (zh-TW, Yating, Apollo)"Microsoft Server Speech Text to Speech Voice (zh-TW, Yating, Apollo)"
zh-TWzh-TW FemaleFemale Microsoft Server Speech Text to Speech Voice (zh-TW, HanHanRUS)"Microsoft Server Speech Text to Speech Voice (zh-TW, HanHanRUS)"
zh-TWzh-TW MaleMale Microsoft Server Speech Text to Speech Voice (zh-TW, Zhiwei, Apollo)"Microsoft Server Speech Text to Speech Voice (zh-TW, Zhiwei, Apollo)"

*ar-EG unterstützt modernes Hocharabisch (Modern Standard Arabic, MSA).*ar-EG supports Modern Standard Arabic (MSA).

Hinweis

Beachten Sie, dass die vorherigen Dienstnamen Microsoft Server Speech Text to Speech Voice (cs-CZ, Vit) und Microsoft Server Speech Text to Speech Voice (en-IE, Shaun) seit dem 31.03.2018 veraltet sind, um die Funktionen der Bing-Spracheingabe-API zu optimieren.Note that the previous service names Microsoft Server Speech Text to Speech Voice (cs-CZ, Vit) and Microsoft Server Speech Text to Speech Voice (en-IE, Shaun) will be deprecated after 3/31/2018, in order to optimize the Bing Speech API’s capabilities. Aktualisieren Sie Ihren Code mit den aktualisierten Namen.Please update your code with the updated names.

Problembehandlung und SupportTroubleshooting and support

Wenden Sie sich bei Fragen und Problemen an das MSDN-Forum für den Bing-Spracheingabe-Dienst.Post all questions and issues to the Bing Speech Service MSDN forum. Geben Sie sämtliche Details an, etwa:Include complete details, such as:

  • Ein Beispiel der vollständigen AnforderungszeichenfolgeAn example of the full request string.
  • Die vollständige Ausgabe einer nicht erfolgreichen Anforderung einschließlich Protokoll-IDs (sofern zutreffend)If applicable, the full output of a failed request, which includes log IDs.
  • Den prozentualen Anteil nicht erfolgreicher AnforderungenThe percentage of requests that are failing.