Team Data Science Process Documentation

The Team Data Science Process is an agile, iterative data science methodology to deliver predictive analytics solutions and intelligent applications efficiently. TDSP helps improve team collaboration and learning. It contains a distillation of the best practices and structures from Microsoft and others in the industry that facilitate the successful implementation of data science initiatives. The goal is to help companies fully realize the benefits of their analytics program.

5-Minute Quickstarts

Learn about the Team Data Science Process:

Step-by-Step Tutorials

Learn how to use the Team Data Science Process in various scenarios:

  1. Spark with PySpark and Scala
  2. Hive with HDInsight Hadoop
  3. U-SQL with Azure Data Lake
  4. R, Python and T-SQL with SQL Server
  5. T-SQL and Python with SQL DW