Scikit-learn 入门-了解 Azure DatabricksGet started with scikit-learn in Azure Databricks

本10分钟的教程旨在作为 Databricks 中的机器学习简介。This 10-minute tutorial is designed as an introduction to machine learning in Databricks. 它使用来自热门机器学习包 scikit-learn 的算法-与MLflow一起学习,以跟踪模型开发流程,并使用Hyperopt自动执行超参数优化。It uses algorithms from the popular machine learning package scikit-learn along with MLflow for tracking the model development process and Hyperopt to automate hyperparameter tuning.

要求Requirements

Databricks Runtime 7.0 ML 或更高版本。Databricks Runtime 7.0 ML or above.

示例笔记本Example notebooks

如果使用 Databricks Runtime 7.3 LTS ML 或更高版本,Databricks 建议使用 MLflow autologging,如此笔记本中所示。If you are using Databricks Runtime 7.3 LTS ML or above, Databricks recommends using MLflow autologging, illustrated in this notebook.

Scikit-learn 入门-学习和 MLflow autologging 笔记本Get started with scikit-learn and MLflow autologging notebook

获取笔记本Get notebook

可以将以下笔记本与 Databricks Runtime 7.0 ML 或更高版本配合使用。You can use the following notebook with Databricks Runtime 7.0 ML or above. 此笔记本使用手动 MLflow 日志记录来跟踪模型开发。This notebook uses manual MLflow logging to track model development.

Scikit-learn 入门-了解笔记本Get started with scikit-learn notebook

获取笔记本Get notebook