KoalasKoalas

Koalas允许你使用pandas数据帧 API 来访问 Apache Spark 中的数据。Koalas allows you to use the pandas DataFrame API to access data in Apache Spark.

要求Requirements

在 Databricks Runtime 7.0 或更低的 Azure Databricks 上,将 Koalas 安装为PyPI 库On Databricks Runtime 7.0 or below, install Koalas as an Azure Databricks PyPI library.

笔记本Notebook

以下笔记本演示如何从 pandas 迁移到 Koalas。The following notebook shows how to migrate from pandas to Koalas.

pandas Koalas 笔记本pandas to Koalas notebook

获取笔记本Get notebook

资源Resources