您现在访问的是微软AZURE全球版技术文档网站,若需要访问由世纪互联运营的MICROSOFT AZURE中国区技术文档网站,请访问 https://docs.azure.cn.

什么是 Web 语言模型 API?What is the Web Language Model API? (预览版)(Preview)

重要

Web 语言模型预览版于 2018 年 8 月 9 日停止使用。The Web Language Model preview was decommissioned on August 9, 2018. 我们建议使用 Azure 机器学习文本分析模块进行文本处理和分析。We recommend using Azure Machine Learning text analytics modules for text processing and analysis.

Microsoft Web 语言模型 API 是一种基于 REST 的云服务,它提供了用于自然语言处理的最先进工具。The Microsoft Web Language Model API is a REST-based cloud service providing state-of-the-art tools for natural language processing. 使用此 API,通过基于必应在美国市场收集的网络级语料库训练的语言模型,你的应用程序可以利用大数据的力量。Using this API, your application can leverage the power of big data through language models trained on web-scale corpora collected by Bing in the en-US market.

这些平滑回退 N-gram 语言模型最多支持 5 阶马尔可夫链,并且是基于以下语料库训练的:These smoothed backoff N-gram language models, supporting up to fifth-order Markov chains, are trained on the following corpora:

  • 网页正文文本Web page body text
  • 网页标题文本Web page title text
  • 网页定位标记文本Web page anchor text
  • Web 搜索查询文本Web search query text

Web 语言模型 API 支持四个查找操作:The Web Language Model API supports four lookup operations:

  1. 单词序列的联合 (log10) 概率。Joint (log10) probability of a sequence of words.
  2. 根据给定的先前单词的序列,确定某个单词的联合 (log10) 概率。Conditional (log10) probability of one word given a sequence of preceding words.
  3. 最可能接在给定词语序列后的词语列表(结束词)。List of words (completions) most likely to follow a given sequence of words.
  4. 未含空格的字符串的分词。Word breaking of strings that contain no spaces.

入门Getting Started

  1. 订阅服务。Subscribe to the service.
  2. 下载 SDKDownload the SDK.
  3. 运行 SDK 示例代码。Run the SDK sample code.
  4. 有关终结点的完整详细信息,包括采用各种语言的代码片段,请参阅 API 参考Refer to the API Reference for full details of the endpoints, including code snippets in a variety of languages.

基础技术Underlying Technology

以下文章提供了有关这些语言模型的开发的详细信息,并且应当在使用此服务的研究出版物中引用:The following paper provides details on the development of these language models, and should be cited in research publications that use this service:

有关引用此作品的文章的当前列表,请单击此处Click here for a current list of papers citing this work.