Bing Text to Speech APIBing text to speech API

참고

The new Speech Service and SDK is replacing Bing Speech, which will no longer work starting October 15, 2019. For information on switching to the Speech Service, see Migrating from Bing Speech to the Speech Service.

소개Introduction

Bing Text to Speech API를 사용하면 응용 프로그램이 HTTP 요청을 클라우드 서버로 전송할 수 있으며, 여기서 텍스트가 즉시 음성으로 합성되어 오디오 파일로 반환됩니다.With the Bing text to speech API, your application can send HTTP requests to a cloud server, where text is instantly synthesized into human-sounding speech and returned as an audio file. 여러 컨텍스트에서 이 API를 사용하여 다양한 음성과 언어로 실시간 텍스트 음성 변환을 제공할 수 있습니다.This API can be used in many different contexts to provide real-time text-to-speech conversion in a variety of different voices and languages.

음성 합성 요청Voice synthesis request

권한 부여 토큰Authorization token

모든 음성 합성 요청에는 JWT(JSON Web Token) 액세스 토큰이 필요합니다.Every voice synthesis request requires a JSON Web Token (JWT) access token. JWT 액세스 토큰은 음성 요청 헤더를 통해 전달됩니다.The JWT access token is passed through in the speech request header. 토큰의 만료 시간은 10분입니다.The token has an expiry time of 10 minutes. 유효한 JWT 액세스 토큰을 검색하는 데 사용되는 API 키를 구독하고 가져오는 방법에 대한 자세한 내용은 Cognitive Services 구독을 참조하세요.For information about subscribing and obtaining API keys that are used to retrieve valid JWT access tokens, see Cognitive Services Subscription.

API 키는 토큰 서비스에 전달됩니다.The API key is passed to the token service. 예: For example:

POST https://api.cognitive.microsoft.com/sts/v1.0/issueToken
Content-Length: 0

토큰 액세스에 필요한 헤더 정보는 다음과 같습니다.The required header information for token access is as follows.

이름Name 형식Format 설명Description
Ocp-Apim-Subscription-KeyOcp-Apim-Subscription-Key ASCIIASCII 구독 키Your subscription key

토큰 서비스는 JWT 액세스 토큰을 text/plain으로 반환합니다.The token service returns the JWT access token as text/plain. 그런 다음, JWT는 문자열 Bearer가 앞에 추가된 인증 헤더로 음성 엔드포인트에 Base64 access_token으로 전달됩니다.Then the JWT is passed as a Base64 access_token to the speech endpoint as an authorization header prefixed with the string Bearer. 예: For example:

Authorization: Bearer [Base64 access_token]

클라이언트가 다음 엔드포인트를 사용하여 텍스트 음성 변환 서비스에 액세스해야 합니다.Clients must use the following endpoint to access the text-to-speech service:

https://speech.platform.bing.com/synthesize

참고

앞에서 설명한 대로 구독 키를 사용하여 액세스 토큰을 획득할 때까지 이 링크는 403 Forbidden 응답 오류를 생성합니다.Until you have acquired an access token with your subscription key as described earlier, this link generates a 403 Forbidden response error.

HTTP 헤더HTTP headers

다음 표에서는 음성 합성 요청에 사용되는 HTTP 헤더를 보여 줍니다.The following table shows the HTTP headers that are used for voice synthesis requests.

헤더Header Value 설명Comments
콘텐츠 형식Content-Type application/ssml+xmlapplication/ssml+xml 입력 콘텐츠 형식입니다.The input content type.
X-Microsoft-OutputFormatX-Microsoft-OutputFormat 1. ssml-16khz-16bit-mono-tts1. ssml-16khz-16bit-mono-tts
2. raw-16khz-16bit-mono-pcm2. raw-16khz-16bit-mono-pcm
3. audio-16khz-16kbps-mono-siren3. audio-16khz-16kbps-mono-siren
4. riff-16khz-16kbps-mono-siren4. riff-16khz-16kbps-mono-siren
5. riff-16khz-16bit-mono-pcm5. riff-16khz-16bit-mono-pcm
6. audio-16khz-128kbitrate-mono-mp36. audio-16khz-128kbitrate-mono-mp3
7. audio-16khz-64kbitrate-mono-mp37. audio-16khz-64kbitrate-mono-mp3
8. audio-16khz-32kbitrate-mono-mp38. audio-16khz-32kbitrate-mono-mp3
출력 오디오 형식입니다.The output audio format.
X-Search-AppIdX-Search-AppId GUID(16진수만, 대시 없음)A GUID (hex only, no dashes) 클라이언트 응용 프로그램을 고유하게 식별하는 ID입니다.An ID that uniquely identifies the client application. 앱의 스토어 ID일 수 있습니다.This can be the store ID for apps. 스토어 ID를 사용할 수 없는 경우 사용자가 생성한 응용 프로그램 ID일 수 있습니다.If one is not available, the ID can be user generated for an application.
X-Search-ClientIDX-Search-ClientID GUID(16진수만, 대시 없음)A GUID (hex only, no dashes) 각 설치에서 응용 프로그램 인스턴스를 고유하게 식별하는 ID입니다.An ID that uniquely identifies an application instance for each installation.
User-AgentUser-Agent 응용 프로그램 이름Application name 응용 프로그램 이름은 필수이며 255자 미만이어야 합니다.The application name is required and must be fewer than 255 characters.
권한 부여Authorization 권한 부여 토큰Authorization token 인증 토큰 섹션을 참조하세요.See the Authorization token section.

입력 매개 변수Input parameters

Bing Text to Speech API에 대한 요청은 HTTP POST 호출을 사용하여 수행됩니다.Requests to the Bing text to speech API are made using HTTP POST calls. 헤더는 이전 섹션에서 지정되었습니다.The headers are specified in the previous section. 본문에는 합성할 텍스트를 나타내는 SSML(Speech Synthesis Markup Language) 입력이 포함되어 있습니다.The body contains Speech Synthesis Markup Language (SSML) input that represents the text to be synthesized. 화자의 언어 및 성별과 같은 음성 측면을 제어하는 데 사용되는 태그에 대한 설명은 SSML W3C 사양을 참조하세요.For a description of the markup used to control aspects of speech such as the language and gender of the speaker, see the SSML W3C Specification.

참고

지원되는 SSML 입력의 최대 크기는 모든 태그를 포함하여 1,024자입니다.The maximum size of the SSML input that is supported is 1,024 characters, including all tags.

예: 음성 출력 요청Example: voice output request

음성 출력 요청의 예는 다음과 같습니다.An example of a voice output request is as follows:

POST /synthesize
HTTP/1.1
Host: speech.platform.bing.com

X-Microsoft-OutputFormat: riff-8khz-8bit-mono-mulaw
Content-Type: application/ssml+xml
Host: speech.platform.bing.com
Content-Length: 197
Authorization: Bearer [Base64 access_token]

<speak version='1.0' xml:lang='en-US'><voice xml:lang='en-US' xml:gender='Female' name='Microsoft Server Speech Text to Speech Voice (en-US, ZiraRUS)'>Microsoft Bing Voice Output API</voice></speak>

음성 출력 응답Voice output response

Bing Text to Speech API는 HTTP POST를 사용하여 오디오를 클라이언트에 다시 전송합니다.The Bing text to speech API uses HTTP POST to send audio back to the client. API 응답에는 은 오디오 스트림과 코덱이 포함되며, 요청된 출력 형식과 일치합니다.The API response contains the audio stream and the codec, and it matches the requested output format. 지정된 요청에 대해 반환된 오디오가 15초를 초과하면 안 됩니다.The audio returned for a given request must not exceed 15 seconds.

예: 성공적인 합성 응답Example: successful synthesis response

다음 코드는 성공적인 음성 합성 요청에 대한 JSON 응답의 예입니다.The following code is an example of a JSON response to a successful voice synthesis request. 코드의 주석과 서식은 이 예의 목적으로만 사용되고 실제 응답에서는 생략됩니다.The comments and formatting of the code are for purposes of this example only and are omitted from the actual response.

HTTP/1.1 200 OK
Content-Length: XXX
Content-Type: audio/x-wav

Response audio payload

예: 합성 실패Example: synthesis failure

다음 예제 코드는 음성 합성 쿼리 실패에 대한 JSON 응답을 보여 줍니다.The following example code shows a JSON response to a voice-synthesis query failure:

HTTP/1.1 400 XML parser error
Content-Type: text/xml
Content-Length: 0

오류 응답Error responses

오류Error 설명Description
HTTP/400 잘못된 요청HTTP/400 Bad Request 필수 매개 변수가 누락되었거나, 비어 있거나, Null이거나 필수 또는 선택적 매개 변수에 전달된 값이 잘못되었습니다.A required parameter is missing, empty, or null, or the value passed to either a required or optional parameter is invalid. “잘못된” 응답이 표시되는 한 가지 이유는 허용 길이보다 긴 문자열 값을 전달하는 경우입니다.One reason for getting the “invalid” response is passing a string value that is longer than the allowed length. 문제가 있는 매개 변수에 대한 간단한 설명이 포함됩니다.A brief description of the problematic parameter is included.
HTTP/401 권한 없음HTTP/401 Unauthorized 요청에 권한이 없습니다.The request is not authorized.
HTTP/413 RequestEntityTooLargeHTTP/413 RequestEntityTooLarge SSML 입력이 지원되는 크기보다 큽니다.The SSML input is larger than what is supported.
HTTP/502 BadGatewayHTTP/502 BadGateway 네트워크 관련 문제 또는 서버 쪽 문제가 있습니다.There is a network-related problem or a server-side issue.

오류 응답의 예는 다음과 같습니다.An example of an error response is as follows:

HTTP/1.0 400 Bad Request
Content-Length: XXX
Content-Type: text/plain; charset=UTF-8

Voice name not supported

SSML을 통해 음성 출력 변경Changing voice output via SSML

Microsoft Text-to-Speech API는 W3C SSML(Speech Synthesis Markup Language) 버전 1.0에 정의된 대로 SSML 1.0을 지원합니다.Microsoft Text-to-Speech API supports SSML 1.0 as defined in W3C Speech Synthesis Markup Language (SSML) Version 1.0. 이 섹션에서는 SSML 태그를 사용하여 말하기 속도, 발음 등 생성된 음성 출력의 특정 특성을 변경하는 예를 보여 줍니다.This section shows examples of changing certain characteristics of generated voice output like speaking rate, pronunciation etc. by using SSML tags.

  1. 중단 추가Adding break

    <speak version='1.0' xmlns="http://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, BenjaminRUS)'> Welcome to use Microsoft Cognitive Services <break time="100ms" /> Text-to-Speech API.</voice> </speak>
    
  2. 말하기 속도 변경Change speaking rate

    <speak version='1.0' xmlns="http://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'><prosody rate="+30.00%">Welcome to use Microsoft Cognitive Services Text-to-Speech API.</prosody></voice> </speak>
    
  3. 발음Pronunciation

    <speak version='1.0' xmlns="http://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'> <phoneme alphabet="ipa" ph="t&#x259;mei&#x325;&#x27E;ou&#x325;"> tomato </phoneme></voice> </speak>
    
  4. 볼륨 변경Change volume

    <speak version='1.0' xmlns="http://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'><prosody volume="+20.00%">Welcome to use Microsoft Cognitive Services Text-to-Speech API.</prosody></voice> </speak>
    
  5. 피치 변경Change pitch

    <speak version='1.0' xmlns="http://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'>Welcome to use <prosody pitch="high">Microsoft Cognitive Services Text-to-Speech API.</prosody></voice> </speak>
    
  6. 운율 곡선 변경Change prosody contour

    <speak version='1.0' xmlns="http://www.w3.org/2001/10/synthesis" xml:lang='en-US'><voice  name='Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)'><prosody contour="(80%,+20%) (90%,+30%)" >Good morning.</prosody></voice> </speak>
    

참고

오디오 데이터는 다음 형식으로 제출된 8k 또는 16k wav여야 합니다. CRC 코드(CRC-32): 유효 범위가 0x00000000 ~ 0xFFFFFFFF인 4바이트(DWORD)입니다. 오디오 형식 플래그: 유효 범위가 0x00000000 ~ 0xFFFFFFFF인 4바이트(DWORD)입니다. 샘플 개수: 유효 범위가 0x00000000 ~ 0x7FFFFFFF인 4바이트(DWORD)입니다. 이진 본문 크기: 유효 범위가 0x00000000 ~ 0x7FFFFFFF인 4바이트(DWORD)입니다. 이진 본문: n바이트입니다.Note the audio data has to be 8k or 16k wav filed in the following format: CRC code (CRC-32): 4 bytes (DWORD) with valid range 0x00000000 ~ 0xFFFFFFFF; Audio format flag: 4 bytes (DWORD) with valid range 0x00000000 ~ 0xFFFFFFFF; Sample count: 4 bytes (DWORD) with valid range 0x00000000 ~ 0x7FFFFFFF; Size of binary body: 4 bytes (DWORD) with valid range 0x00000000 ~ 0x7FFFFFFF; Binary body: n bytes.

샘플 응용 프로그램Sample application

구현 세부 정보는 Visual C# .NET 텍스트 음성 변환 샘플 응용 프로그램을 참조하세요.For implementation details, see the Visual C#.NET text-to-speech sample application.

지원되는 로캘 및 음성 글꼴Supported locales and voice fonts

다음 표에는 지원되는 몇 가지 로캘 및 관련 음성 글꼴이 나와 있습니다.The following table identifies some of the supported locales and related voice fonts.

로캘Locale 성별Gender 서비스 이름 매핑Service name mapping
ar-EG*ar-EG* FemaleFemale "Microsoft Server Speech Text to Speech Voice(ar-EG, Hoda)""Microsoft Server Speech Text to Speech Voice (ar-EG, Hoda)"
ar-SAar-SA MaleMale “Microsoft Server Speech Text to Speech Voice(ar-SA, Naayf)”"Microsoft Server Speech Text to Speech Voice (ar-SA, Naayf)"
bg-BGbg-BG MaleMale “Microsoft Server Speech Text to Speech Voice(bg-BG, Ivan)”"Microsoft Server Speech Text to Speech Voice (bg-BG, Ivan)"
ca-ESca-ES FemaleFemale “Microsoft Server Speech Text to Speech Voice(ca-ES, HerenaRUS)”"Microsoft Server Speech Text to Speech Voice (ca-ES, HerenaRUS)"
cs-CZcs-CZ MaleMale “Microsoft Server Speech Text to Speech Voice(cs-CZ, Jakub)”"Microsoft Server Speech Text to Speech Voice (cs-CZ, Jakub)"
da-DKda-DK FemaleFemale “Microsoft Server Speech Text to Speech Voice(da-DK, HelleRUS)”"Microsoft Server Speech Text to Speech Voice (da-DK, HelleRUS)"
de-ATde-AT MaleMale “Microsoft Server Speech Text to Speech Voice(de-AT, Michael)”"Microsoft Server Speech Text to Speech Voice (de-AT, Michael)"
de-CHde-CH MaleMale “Microsoft Server Speech Text to Speech Voice(de-CH, Karsten)”"Microsoft Server Speech Text to Speech Voice (de-CH, Karsten)"
de-DEde-DE FemaleFemale "Microsoft Server Speech Text to Speech Voice(de-DE, Hedda) ""Microsoft Server Speech Text to Speech Voice (de-DE, Hedda) "
de-DEde-DE FemaleFemale "Microsoft Server Speech Text to Speech Voice(de-DE, HeddaRUS)""Microsoft Server Speech Text to Speech Voice (de-DE, HeddaRUS)"
de-DEde-DE MaleMale “Microsoft Server Speech Text to Speech Voice(de-DE, Stefan, Apollo)”"Microsoft Server Speech Text to Speech Voice (de-DE, Stefan, Apollo) "
el-GRel-GR MaleMale “Microsoft Server Speech Text to Speech Voice(el-GR, Stefanos)”"Microsoft Server Speech Text to Speech Voice (el-GR, Stefanos)"
en-AUen-AU FemaleFemale “Microsoft Server Speech Text to Speech Voice(en-AU, Catherine)”"Microsoft Server Speech Text to Speech Voice (en-AU, Catherine) "
en-AUen-AU FemaleFemale “Microsoft Server Speech Text to Speech Voice(en-AU, HayleyRUS)”"Microsoft Server Speech Text to Speech Voice (en-AU, HayleyRUS)"
en-CAen-CA FemaleFemale "Microsoft Server Speech Text to Speech Voice(en-CA, Linda)""Microsoft Server Speech Text to Speech Voice (en-CA, Linda)"
en-CAen-CA FemaleFemale “Microsoft Server Speech Text to Speech Voice(en-CA, HeatherRUS)”"Microsoft Server Speech Text to Speech Voice (en-CA, HeatherRUS)"
en-GBen-GB FemaleFemale “Microsoft Server Speech Text to Speech Voice(en-GB, Susan, Apollo)”"Microsoft Server Speech Text to Speech Voice (en-GB, Susan, Apollo)"
en-GBen-GB FemaleFemale “Microsoft Server Speech Text to Speech Voice(en-GB, HazelRUS)”"Microsoft Server Speech Text to Speech Voice (en-GB, HazelRUS)"
en-GBen-GB MaleMale “Microsoft Server Speech Text to Speech Voice(en-GB, George, Apollo)”"Microsoft Server Speech Text to Speech Voice (en-GB, George, Apollo)"
en-IEen-IE MaleMale “Microsoft Server Speech Text to Speech Voice(en-IE, Sean)”"Microsoft Server Speech Text to Speech Voice (en-IE, Sean)"
en-INen-IN FemaleFemale “Microsoft Server Speech Text to Speech Voice(en-IN, Heera, Apollo)”"Microsoft Server Speech Text to Speech Voice (en-IN, Heera, Apollo)"
en-INen-IN FemaleFemale “Microsoft Server Speech Text to Speech Voice(en-IN, PriyaRUS)”"Microsoft Server Speech Text to Speech Voice (en-IN, PriyaRUS)"
en-INen-IN MaleMale “Microsoft Server Speech Text to Speech Voice(en-IN, Ravi, Apollo)”"Microsoft Server Speech Text to Speech Voice (en-IN, Ravi, Apollo)"
ko-KRen-US FemaleFemale “Microsoft Server Speech Text to Speech Voice(en-US, ZiraRUS)”"Microsoft Server Speech Text to Speech Voice (en-US, ZiraRUS)"
ko-KRen-US FemaleFemale “Microsoft Server Speech Text to Speech Voice(en-US, JessaRUS)”"Microsoft Server Speech Text to Speech Voice (en-US, JessaRUS)"
ko-KRen-US MaleMale “Microsoft Server Speech Text to Speech Voice(en-US, BenjaminRUS)”"Microsoft Server Speech Text to Speech Voice (en-US, BenjaminRUS)"
es-ESes-ES FemaleFemale “Microsoft Server Speech Text to Speech Voice(es-ES, Laura, Apollo)”"Microsoft Server Speech Text to Speech Voice (es-ES, Laura, Apollo)"
es-ESes-ES FemaleFemale “Microsoft Server Speech Text to Speech Voice(es-ES, HelenaRUS)”"Microsoft Server Speech Text to Speech Voice (es-ES, HelenaRUS)"
es-ESes-ES MaleMale “Microsoft Server Speech Text to Speech Voice(es-ES, Pablo, Apollo)”"Microsoft Server Speech Text to Speech Voice (es-ES, Pablo, Apollo)"
es-MXes-MX FemaleFemale “Microsoft Server Speech Text to Speech Voice(es-MX, HildaRUS)”"Microsoft Server Speech Text to Speech Voice (es-MX, HildaRUS)"
es-MXes-MX MaleMale “Microsoft Server Speech Text to Speech Voice(es-MX, Raul, Apollo)”"Microsoft Server Speech Text to Speech Voice (es-MX, Raul, Apollo)"
fi-FIfi-FI FemaleFemale “Microsoft Server Speech Text to Speech Voice(fi-FI, HeidiRUS)”"Microsoft Server Speech Text to Speech Voice (fi-FI, HeidiRUS)"
fr-CAfr-CA FemaleFemale “Microsoft Server Speech Text to Speech Voice(fr-CA, Caroline)”"Microsoft Server Speech Text to Speech Voice (fr-CA, Caroline)"
fr-CAfr-CA FemaleFemale “Microsoft Server Speech Text to Speech Voice(fr-CA, HarmonieRUS)”"Microsoft Server Speech Text to Speech Voice (fr-CA, HarmonieRUS)"
fr-CHfr-CH MaleMale “Microsoft Server Speech Text to Speech Voice(fr-CH, Guillaume)”"Microsoft Server Speech Text to Speech Voice (fr-CH, Guillaume)"
fr-FRfr-FR FemaleFemale “Microsoft Server Speech Text to Speech Voice(fr-FR, Julie, Apollo)”"Microsoft Server Speech Text to Speech Voice (fr-FR, Julie, Apollo)"
fr-FRfr-FR FemaleFemale “Microsoft Server Speech Text to Speech Voice(fr-FR, HortenseRUS)”"Microsoft Server Speech Text to Speech Voice (fr-FR, HortenseRUS)"
fr-FRfr-FR MaleMale “Microsoft Server Speech Text to Speech Voice(fr-FR, Paul, Apollo)”"Microsoft Server Speech Text to Speech Voice (fr-FR, Paul, Apollo)"
he-ILhe-IL MaleMale “Microsoft Server Speech Text to Speech Voice(he-IL, Asaf)”"Microsoft Server Speech Text to Speech Voice (he-IL, Asaf)"
hi-INhi-IN FemaleFemale “Microsoft Server Speech Text to Speech Voice(hi-IN, Kalpana, Apollo)”"Microsoft Server Speech Text to Speech Voice (hi-IN, Kalpana, Apollo)"
hi-INhi-IN FemaleFemale “Microsoft Server Speech Text to Speech Voice(hi-IN, Kalpana)”"Microsoft Server Speech Text to Speech Voice (hi-IN, Kalpana)"
hi-INhi-IN MaleMale “Microsoft Server Speech Text to Speech Voice(hi-IN, Hemant)”"Microsoft Server Speech Text to Speech Voice (hi-IN, Hemant)"
hr-HRhr-HR MaleMale “Microsoft Server Speech Text to Speech Voice(hr-HR, Matej)”"Microsoft Server Speech Text to Speech Voice (hr-HR, Matej)"
hu-HUhu-HU MaleMale “Microsoft Server Speech Text to Speech Voice(hr-HR, Matej)”"Microsoft Server Speech Text to Speech Voice (hu-HU, Szabolcs)"
id-IDid-ID MaleMale "Microsoft Server Speech Text to Speech Voice(id-ID, Andika)""Microsoft Server Speech Text to Speech Voice (id-ID, Andika)"
it-ITit-IT MaleMale “Microsoft Server Speech Text to Speech Voice(it-IT, Cosimo, Apollo)”"Microsoft Server Speech Text to Speech Voice (it-IT, Cosimo, Apollo)"
it-ITit-IT FemaleFemale "Microsoft Server Speech Text to Speech Voice(it-IT, LuciaRUS)""Microsoft Server Speech Text to Speech Voice (it-IT, LuciaRUS)"
ja-JPja-JP FemaleFemale “Microsoft Server Speech Text to Speech Voice(ja-JP, Ayumi, Apollo)”"Microsoft Server Speech Text to Speech Voice (ja-JP, Ayumi, Apollo)"
ja-JPja-JP MaleMale “Microsoft Server Speech Text to Speech Voice(ja-JP, Ichiro, Apollo)”"Microsoft Server Speech Text to Speech Voice (ja-JP, Ichiro, Apollo)"
ja-JPja-JP FemaleFemale “Microsoft Server Speech Text to Speech Voice(ja-JP, HarukaRUS)”"Microsoft Server Speech Text to Speech Voice (ja-JP, HarukaRUS)"
ko-KRko-KR FemaleFemale “Microsoft Server Speech Text to Speech Voice(ko-KR, HeamiRUS)”"Microsoft Server Speech Text to Speech Voice (ko-KR, HeamiRUS)"
ms-MYms-MY MaleMale “Microsoft Server Speech Text to Speech Voice(ms-MY, Rizwan)”"Microsoft Server Speech Text to Speech Voice (ms-MY, Rizwan)"
nb-NOnb-NO FemaleFemale “Microsoft Server Speech Text to Speech Voice(nb-NO, HuldaRUS)”"Microsoft Server Speech Text to Speech Voice (nb-NO, HuldaRUS)"
nl-NLnl-NL FemaleFemale “Microsoft Server Speech Text to Speech Voice(nl-NL, HannaRUS)”"Microsoft Server Speech Text to Speech Voice (nl-NL, HannaRUS)"
pl-PLpl-PL FemaleFemale “Microsoft Server Speech Text to Speech Voice(pl-PL, PaulinaRUS)”"Microsoft Server Speech Text to Speech Voice (pl-PL, PaulinaRUS)"
pt-BRpt-BR FemaleFemale “Microsoft Server Speech Text to Speech Voice(pt-BR, HeloisaRUS)”"Microsoft Server Speech Text to Speech Voice (pt-BR, HeloisaRUS)"
pt-BRpt-BR MaleMale “Microsoft Server Speech Text to Speech Voice(pt-BR, Daniel, Apollo)”"Microsoft Server Speech Text to Speech Voice (pt-BR, Daniel, Apollo)"
pt-PTpt-PT FemaleFemale “Microsoft Server Speech Text to Speech Voice(pt-PT, HeliaRUS)”"Microsoft Server Speech Text to Speech Voice (pt-PT, HeliaRUS)"
ro-ROro-RO MaleMale “Microsoft Server Speech Text to Speech Voice(ro-RO, Andrei)”"Microsoft Server Speech Text to Speech Voice (ro-RO, Andrei)"
ru-RUru-RU FemaleFemale “Microsoft Server Speech Text to Speech Voice(ru-RU, Irina, Apollo)”"Microsoft Server Speech Text to Speech Voice (ru-RU, Irina, Apollo)"
ru-RUru-RU MaleMale “Microsoft Server Speech Text to Speech Voice(ru-RU, Pavel, Apollo)”"Microsoft Server Speech Text to Speech Voice (ru-RU, Pavel, Apollo)"
ru-RUru-RU FemaleFemale "Microsoft Server Speech Text to Speech Voice(ru-RU, EkaterinaRUS)""Microsoft Server Speech Text to Speech Voice (ru-RU, EkaterinaRUS)"
sk-SKsk-SK MaleMale “Microsoft Server Speech Text to Speech Voice(sk-SK, Filip)”"Microsoft Server Speech Text to Speech Voice (sk-SK, Filip)"
sl-SIsl-SI MaleMale “Microsoft Server Speech Text to Speech Voice(sl-SI, Lado)”"Microsoft Server Speech Text to Speech Voice (sl-SI, Lado)"
sv-SEsv-SE FemaleFemale “Microsoft Server Speech Text to Speech Voice(sv-SE, HedvigRUS)”"Microsoft Server Speech Text to Speech Voice (sv-SE, HedvigRUS)"
ta-INta-IN MaleMale “Microsoft Server Speech Text to Speech Voice(ta-IN, Valluvar)”"Microsoft Server Speech Text to Speech Voice (ta-IN, Valluvar)"
th-THth-TH MaleMale “Microsoft Server Speech Text to Speech Voice(th-TH, Pattara)”"Microsoft Server Speech Text to Speech Voice (th-TH, Pattara)"
tr-TRtr-TR FemaleFemale “Microsoft Server Speech Text to Speech Voice(tr-TR, SedaRUS)”"Microsoft Server Speech Text to Speech Voice (tr-TR, SedaRUS)"
vi-VNvi-VN MaleMale “Microsoft Server Speech Text to Speech Voice(vi-VN, An)”"Microsoft Server Speech Text to Speech Voice (vi-VN, An)"
zh-CNzh-CN FemaleFemale “Microsoft Server Speech Text to Speech Voice(zh-CN, HuihuiRUS)”"Microsoft Server Speech Text to Speech Voice (zh-CN, HuihuiRUS)"
zh-CNzh-CN FemaleFemale “Microsoft Server Speech Text to Speech Voice(zh-CN, Yaoyao, Apollo)”"Microsoft Server Speech Text to Speech Voice (zh-CN, Yaoyao, Apollo)"
zh-CNzh-CN MaleMale “Microsoft Server Speech Text to Speech Voice(zh-CN, Kangkang, Apollo)”"Microsoft Server Speech Text to Speech Voice (zh-CN, Kangkang, Apollo)"
zh-HKzh-HK FemaleFemale “Microsoft Server Speech Text to Speech Voice(zh-HK, Tracy, Apollo)”"Microsoft Server Speech Text to Speech Voice (zh-HK, Tracy, Apollo)"
zh-HKzh-HK FemaleFemale “Microsoft Server Speech Text to Speech Voice(zh-HK, TracyRUS)”"Microsoft Server Speech Text to Speech Voice (zh-HK, TracyRUS)"
zh-HKzh-HK MaleMale “Microsoft Server Speech Text to Speech Voice(zh-HK, Danny, Apollo)”"Microsoft Server Speech Text to Speech Voice (zh-HK, Danny, Apollo)"
zh-TWzh-TW FemaleFemale “Microsoft Server Speech Text to Speech Voice(zh-TW, Yating, Apollo)”"Microsoft Server Speech Text to Speech Voice (zh-TW, Yating, Apollo)"
zh-TWzh-TW FemaleFemale “Microsoft Server Speech Text to Speech Voice(zh-TW, HanHanRUS)”"Microsoft Server Speech Text to Speech Voice (zh-TW, HanHanRUS)"
zh-TWzh-TW MaleMale “Microsoft Server Speech Text to Speech Voice(zh-TW, Zhiwei, Apollo)”"Microsoft Server Speech Text to Speech Voice (zh-TW, Zhiwei, Apollo)"

*ar-EG는 MSA(Modern Standard Arabic)를 지원합니다.*ar-EG supports Modern Standard Arabic (MSA).

참고

Bing Speech API의 기능을 최적화하기 위해 이전 서비스 이름 Microsoft Server Speech Text to Speech Voice(cs-CZ, Vit)Microsoft Server Speech Text to Speech Voice(en-IE, Shaun) 는 2018년 3월 31일부터 더 이상 사용되지 않습니다.Note that the previous service names Microsoft Server Speech Text to Speech Voice (cs-CZ, Vit) and Microsoft Server Speech Text to Speech Voice (en-IE, Shaun) will be deprecated after 3/31/2018, in order to optimize the Bing Speech API’s capabilities. 업데이트된 이름으로 코드를 업데이트하세요.Please update your code with the updated names.

문제 해결 및 지원Troubleshooting and support

모든 질문과 문제는 Bing Speech Service MSDN 포럼에 게시하세요.Post all questions and issues to the Bing Speech Service MSDN forum. 다음과 같은 전체 세부 정보를 포함합니다.Include complete details, such as:

  • 전체 요청 문자열의 예입니다.An example of the full request string.
  • 해당하는 경우, 로그 ID를 포함하는 실패한 요청의 전체 출력입니다.If applicable, the full output of a failed request, which includes log IDs.
  • 오류가 발생하는 요청의 백분율입니다.The percentage of requests that are failing.