Get started with SQL Server Big Data Clusters

THIS TOPIC APPLIES TO:yesSQL Server noAzure SQL DatabasenoAzure Synapse Analytics (SQL DW) noParallel Data Warehouse

This article provides an overview of how to deploy SQL Server 2019 Big Data Clusters.

For other deployment scenarios, see:

The article orients you to the concepts and provides a framework for understanding the other deployment articles in this section. Your specific deployment steps vary based on your platform choices for the client and server.

Tip

To quickly get an environment with Kubernetes and big data cluster deployed to help you ramp up on its capabilities, use one of the sample scripts pointed to in the scripts section. After deployment, to manage the cluster use the client tools in the following section.

Watch this 9-minute video for an overview of how to deploy big data clusters:

Client tools

Big data clusters require a specific set of client tools. Before you deploy a big data cluster to Kubernetes, you should install the following tools:

Tool Description
azdata Deploys and manages big data clusters.
kubectl Creates and manages the underlying Kubernetes cluster.
Azure Data Studio Graphical interface for using the big data cluster.
SQL Server 2019 extension Azure Data Studio extension that enables big data cluster features.

Other tools are required for different scenarios. Each article should explain the prerequisite tools for performing a specific task. For a full list of tools and installation links, see Install SQL Server 2019 big data tools.

Kubernetes

Big data clusters are deployed as a series of interrelated containers that are managed in Kubernetes. You can host Kubernetes in a variety of ways. Even if you already have an existing Kubernetes environment, you should review the related requirements for big data clusters.

Deploy a big data cluster

After configuring Kubernetes, you deploy a big data cluster with the azdata bdc create command. When deploying, you can take several different approaches.

Deployment scripts

Deployment scripts can help deploy both Kubernetes and big data clusters in a single step. They also often provide default values for big data cluster settings. You can customize any deployment script by creating your own version that configures the big data cluster deployment differently.

The following deployment scripts are currently available:

Deployment notebooks

You can also deploy a big data cluster by running an Azure Data Studio notebook. For more information on how to use a notebook to deploy on AKS, see the following article:

Next steps

After you successfully deploy a big data cluster, connect to the cluster and consider loading sample data for use with several walkthroughs.