Microsoft Azure Data Bricks–Collaborative Apache Spark Analytics Platform

image

I was having a conversation with some colleagues about a institutions which wanted to understand some ways of  integrating Azure’s data science services in their curriculum for the new semester

So one of the suggestions we came up was the usage of Microsoft Azure DSVMs, HDInsight clusters, DataBricksNotebooks

I have talked a lot in past blogs about the Microsoft Data Science Virtual Machine DSVM, HDinsight but one of the new services we have in Azure is Azure DataBricks

A quick introduction to Azure Databricks

Azure Databricks is a fast, easy, and collaborative Apache Spark-based analytics platform optimized for Azure. Designed in collaboration with the founders of Apache Spark, Azure Databricks combines the best of Databricks and Azure to help customers accelerate innovation with one-click set up, streamlined workflows, and an interactive workspace that enables collaboration between data scientists, data engineers, and business analysts. As an Azure service, customers automatically benefit from the native integration with other Azure services such as Power BI, SQL Data Warehouse, and Cosmos DB, as well as from enterprise-grade Azure security, including Active Directory integration, compliance, and enterprise-grade SLAs.

See what's possible with Azure Databricks

Watch how to get started with Apache Spark on Azure Databricks

View critical use cases to solve your big data and AI challenges

Read the three practical use cases for data scientists

Educational Materials and Tutorials for Data Bricks

This is the core source for Azure Databricks and Spark training material. The notebooks in here are available to help you learn more about best practices in Spark. Educators around the world including Azure Databricks trainers created this material to help users learn how to use Apache Spark.

Additional resources and Material for Educators

We have a great set of Azure Labs for Students and Educators  to get you started https://github.com/Microsoft/computerscience

image

Specific Big Data and HDINsight labs are at https://github.com/Microsoft/computerscience/tree/master/Labs/Big%20Data%20and%20Analytics

Azure HPC https://github.com/Microsoft/computerscience/tree/master/Labs/Azure%20Services/Azure%20HPC

And we also have notebooks labs at https://github.com/Microsoft/computerscience/tree/master/Labs/AI%20and%20Machine%20Learning