Koalas

Koalas allows you to use the pandas DataFrame API to access data in Apache Spark.

Requirements

On Databricks Runtime 7.0 or below, install Koalas as an Azure Databricks PyPI library.

Notebook

The following notebook shows how to migrate from pandas to Koalas.

pandas to Koalas notebook

Get notebook

Resources