`ai_summarize` 函数

项目
03/06/2024

适用于： check marked yes Databricks SQL

重要

此功能目前以公共预览版提供。

在预览版中，

基础语言模型可以处理多种语言，但这些函数针对英语进行了优化。
基础模型 API 存在速率限制，请参阅基础模型 API 限制，了解有关这些限制的最新信息。

ai_summarize() 函数允许调用最先进的生成式 AI 模型，使用 SQL 生成给定文本的摘要。此函数使用通过 Databricks 基础模型 API 提供的聊天模型服务终结点。

要求

重要

目前可使用的基础模型已获得 Apache 2.0 许可证或 Llama 2 社区许可证的许可。 Databricks 建议查看这些许可证，以确保遵守任何适用的条款。如果未来出现的模型按 Databricks 的内部基准而言性能更佳，Databricks 可能会更改模型（以及本页提供的适用许可证列表）。

目前，Mixtral-8x7B Instruct 是支持这些 AI 函数的基础模型。

此函数仅适用于基础模型 API 按令牌付费支持的区域中的工作区。
此函数在 Azure Databricks SQL Classic 上不可用。
查看 Databricks SQL 定价页。

语法

ai_summarize(content[, max_words])

参数

content：一个 STRING 表达式，即要汇总的文本。
max_words：一个可选的非负整数数值表达式，表示返回的摘要文本中的最大努力目标单词数。默认值为 50。如果设置为 0，则没有字数限制。

一个 STRING。

如果 content 为 NULL，则结果为 NULL。

示例

> SELECT ai_summarize(
    'Apache Spark is a unified analytics engine for large-scale data processing. ' ||
    'It provides high-level APIs in Java, Scala, Python and R, and an optimized ' ||
    'engine that supports general execution graphs. It also supports a rich set ' ||
    'of higher-level tools including Spark SQL for SQL and structured data ' ||
    'processing, pandas API on Spark for pandas workloads, MLlib for machine ' ||
    'learning, GraphX for graph processing, and Structured Streaming for incremental ' ||
    'computation and stream processing.',
    20
  )
 "Apache Spark is a unified, multi-language analytics engine for large-scale data processing
 with additional tools for SQL, machine learning, graph processing, and stream computing."

ai_summarize 函数

要求

语法

参数

返回

示例

相关函数

其他资源

`ai_summarize` 函数