Use Apache Spark in Azure Databricks

Module
9 Units

Intermediate

Data Engineer

Azure Databricks

Azure Databricks is built on Apache Spark and enables data engineers and analysts to run Spark jobs to transform, analyze and visualize data at scale.

Learning objectives

In this module, you'll learn how to:

Describe key elements of the Apache Spark architecture.
Create and configure a Spark cluster.
Describe use cases for Spark.
Use Spark to process and analyze data stored in files.
Use Spark to visualize data.

Prerequisites

Before starting this module, you should have a basic knowledge of Azure Databricks. Consider completing the Explore Azure Databricks module before this one.

Introduction min
Get to know Spark min
Create a Spark cluster min
Use Spark in notebooks min
Use Spark to work with data files min
Visualize data min
Exercise - Use Spark in Azure Databricks min
Knowledge check min
Summary min