您现在访问的是微软AZURE全球版技术文档网站,若需要访问由世纪互联运营的MICROSOFT AZURE中国区技术文档网站,请访问 https://docs.azure.cn.

什么是语音翻译?What is speech translation?

重要

现在,将对此服务的所有 HTTP 请求强制执行 TLS 1.2。TLS 1.2 is now enforced for all HTTP requests to this service. 有关详细信息,请参阅 Azure 认知服务安全性For more information, see Azure Cognitive Services security.

本概述介绍语音翻译服务的优势和功能,它可以对音频流进行实时多语言语音转语音和语音转文本翻译。In this overview, you learn about the benefits and capabilities of the speech translation service, which enables real-time, multi-language speech-to-speech and speech-to-text translation of audio streams. 使用语音 SDK,应用程序、工具和设备可以访问所提供的音频的源听录和翻译输出。With the Speech SDK, your applications, tools, and devices have access to source transcriptions and translation outputs for provided audio. 检测到语音时,会返回过渡性的听录和翻译结果,最终结果可以转换为合成语音。Interim transcription and translation results are returned as speech is detected, and final results can be converted into synthesized speech.

Microsoft 翻译引擎有两种不同的支持方法:统计机器翻译 (SMT) 和神经机器翻译 (NMT)。Microsoft's translation engine is powered by two different approaches: statistical machine translation (SMT) and neural machine translation (NMT). SMT 可以在给定上下文(数个单词)的情况下,使用高级统计分析来估计可能的最佳翻译。SMT uses advanced statistical analysis to estimate the best possible translations given the context of a few words. 使用 NMT 时,可以通过神经网络借助完整的语句上下文来翻译单词,这样可以提供更准确且听起来很自然的翻译。With NMT, neural networks are used to provide more accurate, natural-sounding translations by using the full context of sentences to translate words.

目前,对于大多数常用语言,Microsoft 使用 NMT 进行翻译。Today, Microsoft uses NMT for translation to most popular languages. NMT 支持所有可用于语音到语音转换的语言All languages available for speech-to-speech translation are powered by NMT. 语音到文本转换可能会使用 SMT 或 NMT,具体取决于语言对。Speech-to-text translation may use SMT or NMT depending on the language pair. 如果 NMT 支持目标语言,则 NMT 支持全译。When the target language is supported by NMT, the full translation is NMT-powered. 如果 NMT 不支持目标语言,则翻译是 NMT 和 SMT 的结合,将英语作为两种语言之间的“枢轴”。When the target language isn't supported by NMT, the translation is a hybrid of NMT and SMT, using English as a "pivot" between the two languages.

核心功能Core features

  • 包含识别结果的语音转文本翻译。Speech-to-text translation with recognition results.
  • 语音转语音翻译。Speech-to-speech translation.
  • 支持翻译为多种目标语言。Support for translation to multiple target languages.
  • 过渡性识别和翻译结果。Interim recognition and translation results.

入门Get started

请参阅快速入门以开始使用语音翻译。See the quickstart to get started with speech translation. 语音翻译服务通过语音 SDK语音 CLI 提供。The speech translation service is available via the Speech SDK and the Speech CLI.

代码示例Sample code

GitHub 上提供了语音 SDK 的示例代码。Sample code for the Speech SDK is available on GitHub. 这些示例涵盖了常见方案,例如,从文件或流中读取音频、连续和单次识别/翻译,以及使用自定义模型。These samples cover common scenarios like reading audio from a file or stream, continuous and single-shot recognition/translation, and working with custom models.

迁移指南Migration guides

如果你的应用程序、工具或产品正在使用 语音翻译 API,我们已创建了可帮助你迁移到语音服务的指南。If your applications, tools, or products are using the Translator Speech API, we've created guides to help you migrate to the Speech service.

参考文档Reference docs

后续步骤Next steps