Creating a Synapse workspace

In this tutorial, you'll learn how to create a Synapse workspace, a dedicated SQL pool, and a serverless Apache Spark pool.

Prerequisites

To complete this tutorial's steps, you need to have access to a resource group for which you are assigned the Owner role. Create the Synapse workspace in this resource group.

Create a Synapse workspace in the Azure portal

  1. Open the Azure portal, and at the top search for Synapse.
  2. In the search results, under Services, select Azure Synapse Analytics.
  3. Select Add to create a workspace.
  4. In the Basics tab, under Project Details, enter your preferred Subscription, Resource group, Region, and then choose a workspace name. In this tutorial, we'll use myworkspace.
  5. By Select Data Lake Storage Gen 2, click the button for From subscription.
  6. By Account name, click Create New and name the new storage account contosolake or similar as this name must be unique.
  7. By File system name, click Create New and name it users. This will create a storage container called users
  8. The workspace will use this storage account as the "primary" storage account to Spark tables and Spark application logs.
  9. Check the "Assign myself the Storage Blob Data Contributor role on the Data Lake Storage Gen2 account" box.
  10. Select Review + create > Create. Your workspace is ready in a few minutes.

Note

To enable workspace features from an existing dedicated SQL pool (formerly SQL DW) refer to How to enable a workspace for your dedicated SQL pool (formerly SQL DW).

Open Synapse Studio

After your Azure Synapse workspace is created, you have two ways to open Synapse Studio:

  • Open your Synapse workspace in the Azure portal, in the Overview section of the Synapse workspace, select Open in the Open Synapse Studio box.
  • Go to the https://web.azuresynapse.net and sign in to your workspace.

The Built-in serverless SQL pool

Every workspace comes with a pre-built serverless SQL pool called Built-in. This pool can't be deleted. Serverless SQL pools let you use SQL without having to reserve capacity with dedicated SQL pools. Unlike the dedicated SQL pools, billing for a serverless SQL pool is based on the amount of data scanned to run the query, not the number of capacity allocated to the pool.

Create a dedicated SQL pool

  1. In Synapse Studio, on the left-side pane, select Manage > SQL pools.
  2. Select New
  3. For SQL pool name select SQLPOOL1
  4. For Performance level choose DW100C
  5. Select Review + create > Create. Your dedicated SQL pool will be ready in a few minutes. Your dedicated SQL pool is associated with a dedicated SQL pool database that's also called SQLPOOL1.

A dedicated SQL pool consumes billable resources as long as it's active. You can pause the pool later to reduce costs.

Note

When creating a new dedicated SQL pool (formerly SQL DW) in your workspace, the dedicated SQL pool provisioning page will open. Provisioning will take place on the logical SQL server.

Create a serverless Apache Spark pool

  1. In Synapse Studio, on the left-side pane, select Manage > Apache Spark pools.
  2. Select New
  3. For Apache Spark pool name enter Spark1.
  4. For Node size enter Small.
  5. For Number of nodes Set the minimum to 3 and the maximum to 3
  6. Select Review + create > Create. Your Apache Spark pool will be ready in a few seconds.

The Spark pool tells Azure Synapse how many Spark resources to use. You only pay for the resources that you use. When you actively stop using the pool, the resources automatically time out and are recycled.

Next steps