Install the Speech SDK - Azure AI services

Reference documentation | Package (NuGet) | Additional Samples on GitHub

In this quickstart, you install the Speech SDK for C#.

Code samples in the documentation are written in C# 8 and run on .NET standard 2.0.

Platform requirements

The Speech SDK for C# is compatible with Windows, Linux, and macOS.

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

On Windows, you must use the 64-bit target architecture. Windows 10 or later is required.

Install the Microsoft Visual C++ Redistributable for Visual Studio 2015, 2017, 2019, and 2022 for your platform. Installing this package for the first time might require a restart.

The Speech SDK for C# only supports the following distributions on the x64, ARM32 (Debian/Ubuntu), and ARM64 (Debian/Ubuntu) architectures:

Ubuntu 18.04/20.04
Debian 10/11
Red Hat Enterprise Linux (RHEL) 7/8
CentOS 7

Important

Use the most recent LTS release of the Linux distribution. For example, if you are using Ubuntu 20.04 LTS, use the latest release of Ubuntu 20.04.X.

The Speech SDK depends on the following Linux system libraries:

The shared libraries of the GNU C library, including the POSIX Threads Programming library, libpthreads.
The OpenSSL library (libssl) version 1.x and certificates (ca-certificates).
The shared library for ALSA applications (libasound).

You should also install ca-certificates to establish a secure websocket and avoid the WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED error.

Important

The Speech SDK does not yet support OpenSSL 3.0, which is the default in Ubuntu 22.04 and Debian 12.

Run these commands:

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

To use the Speech SDK in Alpine Linux, create a Debian chroot environment as documented in the Alpine Linux Wiki on running glibc programs. Then follow the Debian instructions here.

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

Install the development tools and libraries:

sudo yum update
sudo yum groupinstall "Development tools"
sudo yum install alsa-lib openssl wget

Important

On RHEL/CentOS 7, follow the instructions on how to configure RHEL/CentOS 7 for Speech SDK.
On RHEL, follow the instructions on how to configure OpenSSL for Linux.

Install the Speech SDK for C#

The Speech SDK for C# is available as a NuGet package and implements .NET Standard 2.0. For more information, see Microsoft.CognitiveServices.Speech.

Terminal
PowerShell

The Speech SDK for C# can be installed from the .NET CLI by using the following dotnet add command:

dotnet add package Microsoft.CognitiveServices.Speech

The Speech SDK for C# can be installed by using the following Install-Package command:

Install-Package Microsoft.CognitiveServices.Speech

You can follow these guides for more options.

This guide shows how to install the Speech SDK for a .NET Framework (Windows) console app.

This guide requires:

Microsoft Visual C++ Redistributable for Visual Studio 2019 for the Windows platform. Installing it for the first time might require a restart.
Visual Studio.

Create a Visual Studio project and install the Speech SDK

You need to install the Speech SDK NuGet package so you can reference it in your code. To do that, you might first need to create a helloworld project. If you already have a project with the .NET desktop development workload available, you can use that project and skip to Use NuGet Package Manager to install the Speech SDK.

Create a helloworld project

Open Visual Studio.
Under Get started, select Create a new project.
In Create a new project, choose Console App (.NET Framework), and then select Next.
In Configure your new project, for Project name enter helloworld, choose or create the directory path in Location, and then select Create.
From the Visual Studio menu bar, select Tools > Get Tools and Features. This step opens Visual Studio Installer and displays the Modifying dialog box.
Check whether the .NET desktop development workload is available. If the workload isn't installed, select it, and then select Modify to start the installation. It might take a few minutes to download and install.

If .NET desktop development is already selected, select Close to close the dialog box.
Close Visual Studio Installer.

Use NuGet Package Manager to install the Speech SDK

In Solution Explorer, right-click the helloworld project, and then select Manage NuGet Packages to show NuGet Package Manager.
In the upper-right corner, find the Package Source dropdown box, and make sure that nuget.org is selected.
In the upper-left corner, select Browse.
In the search box, enter Microsoft.CognitiveServices.Speech and select Enter.
From the search results, select the Microsoft.CognitiveServices.Speech package, and then select Install to install the latest stable version.
Accept all agreements and licenses to start the installation.

After the package is installed, a confirmation appears in the Package Manager Console window.

Choose target architecture

To build and run the console application, create a platform configuration that matches your computer's architecture.

From the menu, select Build > Configuration Manager. The Configuration Manager dialog box appears.
In the Active solution platform dropdown box, select New. The New Solution Platform dialog box appears.
In the Type or select the new platform dropdown box:
- If you're running 64-bit Windows, select x64.
- If you're running 32-bit Windows, select x86.
Select OK and then Close.

This guide shows how to create a Universal Windows Platform (UWP) project and install the Speech SDK for C#. The Universal Windows Platform lets you develop apps that run on any device that supports Windows 10, including PCs, Xbox, Surface Hub, and other devices.

This guide requires:

Microsoft Visual C++ Redistributable for Visual Studio 2019 for the Windows platform. Installing this file for the first time might require a restart.
Visual Studio.

Create a Visual Studio project and install the Speech SDK

To create a Visual Studio project for UWP development, you need to:

Set up Visual Studio development options.
Create the project and select the target architecture.
Set up audio capture.
Install the Speech SDK.

Set up Visual Studio development options

Make sure you're set up correctly in Visual Studio for UWP development:

Open Visual Studio to display the start window.
Select Continue without code to go to the Visual Studio IDE.
From the Visual Studio menu bar, select Tools > Get Tools and Features to open Visual Studio Installer and view the Modifying dialog box.
On the Workloads tab, find the Universal Windows Platform development workload. If that workload is already selected, close the Modifying dialog box and close Visual Studio Installer. Skip the rest of this procedure.
Select Universal Windows Platform development, and then select Modify.
In the Before we get started dialog box, select Continue to install the UWP development workload. Installation of the new feature might take a while.
Close Visual Studio Installer.

Create the project

Next, create your project and select the target architecture:

On the Visual Studio menu bar, select File > New > Project to display the Create a new project window.
Find and select Blank App (Universal Windows). Make sure that you select the C# version of this project type, as opposed to Visual Basic.
Select Next.
In the Configure your new project dialog box, in Project name, enter helloworld.
In Location, go to and select or create the folder where you want to save your project.
Select Create.
In the New Universal Windows Platform Project window, in Minimum version (the second dropdown box), select Windows 10 Fall Creators Update (10.0; Build 16299). That requirement is the minimum for the Speech SDK.
In Target version (the first dropdown box), choose a value identical to or later than the value in Minimum version.
Select OK. You return to the Visual Studio IDE, with the new project created and visible on the Solution Explorer pane.
Select your target platform architecture. On the Visual Studio toolbar, find the Solution Platforms dropdown box. If you don't see it, select View > Toolbars > Standard to display the toolbar that contains Solution Platforms.

If you're running 64-bit Windows, select x64 in the drop-down box. 64-bit Windows can also run 32-bit applications, so you can choose x86 if you prefer.

Note

The Speech SDK supports all Intel-compatible processors, but only x64 versions of ARM processors.

Set up audio capture

Allow the project to capture audio input:

In Solution Explorer, select Package.appxmanifest to open the package application manifest.
Select the Capabilities tab, then select the Microphone capability.
From the menu bar, select File > Save Package.appxmanifest to save your changes.

Install the Speech SDK for UWP

Finally, install the Speech SDK NuGet package, and reference the Speech SDK in your project:

In Solution Explorer, right-click your solution, and select Manage NuGet Packages for Solution to go to the NuGet - Solution window.
Select Browse. In Package source, select nuget.org.
In the Search box, enter Microsoft.CognitiveServices.Speech. Choose that package after it appears in the search results.
In the package status pane next to the search results, select your helloworld project.
Select Install.
In the Preview Changes dialog box, select Apply.
In the License Acceptance dialog box, view the license, and then select I Accept. The package installation begins.

When installation is complete, the Output pane displays a message that's similar to the following text: Successfully installed 'Microsoft.CognitiveServices.Speech 1.15.0' to helloworld.

This guide shows how to create a Xamarin forms project and install the Speech SDK. Xamarin is an open-source platform for building modern and performant applications for iOS, Android, and Windows by using .NET.

For Xamarin development, the Speech SDK supports:

Windows Desktop x86 and x64
Universal Windows Platform x86, x64, ARM/ARM64
Android x86, ARM32/64
iOS x64 simulator and ARM64

This guide requires:

Microsoft Visual C++ Redistributable for Visual Studio 2019 for the Windows platform. Installing it for the first time might require a restart.
Visual Studio 2019.

Create a Visual Studio project and install the Speech SDK

To create a Visual Studio project for cross-platform mobile app development with .NET and Xamarin, you need to:

Set up Visual Studio development options.
Create the project and select the target architecture.
Install the Speech SDK.

Set up Visual Studio development options

Make sure you're set up correctly in Visual Studio for cross-platform mobile development with .NET:

Open Visual Studio 2019. Then select Continue without code.
From the Visual Studio menu, select Tools > Get Tools and Features to open Visual Studio Installer and view the Modifying dialog box.
On the Workloads tab, find the Mobile development with .NET workload. If that workload is already selected, close the Modifying dialog box and close Visual Studio Installer. Skip the rest of this procedure.
Select Mobile development with .NET, and then select Modify.
In the Before we get started dialog box, select Continue to install the workload for mobile development with .NET. Installation of the new feature might take a while.
Close Visual Studio Installer.

Create the project

Next, create your project and select the target architecture:

On the Visual Studio menu bar, select File > New > Project to display the Create a new project window.
Find and select Mobile App (Xamarin.Forms).
Select Next.
In the Configure your new project dialog box, in Project name, enter helloworld.
In Location, go to and select or create the folder where you want to save your project.
Select Create.
In the New Cross Platform App window, select the Blank template, and then select Android, iOS, and Windows (UWP). Select Create.
Select OK. You return to the Visual Studio IDE, with the new project created and visible in the Solution Explorer pane.
Select your target platform architecture and startup project. On the Visual Studio toolbar, find the Solution Platforms dropdown box. If you don't see it, select View > Toolbars > Standard to display the toolbar that contains Solution Platforms.

If you're running 64-bit Windows, select x64 in the drop-down box. You can select x86 if you want because 64-bit Windows also can run 32-bit applications.
In the Start-up Projects dropdown box, select helloworld.UWP (Universal Windows).

Install the Speech SDK for Xamarin

Install the Speech SDK NuGet package, and reference the Speech SDK in your project:

In Solution Explorer, right-click your solution. Select Manage NuGet Packages for Solution to go to the NuGet - Solution window.
Select Browse.
In Package source, select nuget.org.
In the Search box, enter Microsoft.CognitiveServices.Speech. Then select that package after it appears in the search results.

Note

The iOS library inside Microsoft.CognitiveServices.Speech NuGet doesn't have bitcode enabled. If you need the bitcode library enabled for your application, use Microsoft.CognitiveServices.Speech.Xamarin.iOS NuGet for the iOS project specifically.
In the package status pane next to the search results, select all projects.
Select Install.
In the Preview Changes dialog box, select OK.
In the License Acceptance dialog box, view the license, and then select I Accept. Install the Speech SDK package reference to all projects.

After installation finishes successfully, you might see the following warning for helloworld.iOS. This warning is a known issue and shouldn't affect your app's functionality.
```
Could not resolve reference "C:\Users\Default\.nuget\packages\microsoft.cognitiveservices.speech\1.7.0\build\Xamarin.iOS\libMicrosoft.CognitiveServices.Speech.core.a". If this reference is required by your code, you may get compilation errors.
```

The Speech SDK is now installed. You can now delete or reuse the helloworld project that you created in the previous steps.

Reference documentation | Package (NuGet) | Additional Samples on GitHub

In this quickstart, you install the Speech SDK for C++.

Platform requirements

The Speech SDK for C++ is compatible with Windows, Linux, and macOS.

On Windows, you must use the 64-bit target architecture. Windows 10 or later is required.

Install the Microsoft Visual C++ Redistributable for Visual Studio 2015, 2017, 2019, and 2022 for your platform. Installing this package for the first time might require a restart.

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

The Speech SDK for C++ only supports the following distributions on the x86 (Debian/Ubuntu), x64, ARM32 (Debian/Ubuntu), and ARM64 (Debian/Ubuntu) architectures:

Ubuntu 18.04/20.04
Debian 10/11
Red Hat Enterprise Linux (RHEL) 7/8
CentOS 7

Important

Use the most recent LTS release of the Linux distribution. For example, if you are using Ubuntu 20.04 LTS, use the latest release of Ubuntu 20.04.X.

The Speech SDK depends on the following Linux system libraries:

The shared libraries of the GNU C library, including the POSIX Threads Programming library, libpthreads.
The OpenSSL library (libssl) version 1.x and certificates (ca-certificates).
The shared library for ALSA applications (libasound).

You should also install ca-certificates to establish a secure websocket and avoid the WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED error.

Important

The Speech SDK does not yet support OpenSSL 3.0, which is the default in Ubuntu 22.04 and Debian 12.

Run these commands:

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

To use the Speech SDK in Alpine Linux, create a Debian chroot environment as documented in the Alpine Linux Wiki on running glibc programs. Then follow the Debian instructions here.

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

Install the development tools and libraries:

sudo yum update
sudo yum groupinstall "Development tools"
sudo yum install alsa-lib openssl wget

Important

On RHEL/CentOS 7, follow the instructions on how to configure RHEL/CentOS 7 for Speech SDK.
On RHEL, follow the instructions on how to configure OpenSSL for Linux.

Install the Speech SDK for C++

The Speech SDK for C++ is available as a NuGet package. For more information, see Microsoft.CognitiveServices.Speech.

Terminal
PowerShell

The Speech SDK for C++ can be installed from the .NET CLI by using the following dotnet add command:

dotnet add package Microsoft.CognitiveServices.Speech

The Speech SDK for C++ can be installed by using the following Install-Package command:

Install-Package Microsoft.CognitiveServices.Speech

You can follow these guides for more options.

This guide shows how to install the Speech SDK for Linux.

Use the following procedure to download and install the SDK. The steps include downloading the required libraries and header files as a .tar file.

Choose a directory for the Speech SDK files. Set the SPEECHSDK_ROOT environment variable to point to that directory. This variable makes it easy to refer to the directory in future commands.

To use the directory speechsdk in your home directory, run the following command:
```
export SPEECHSDK_ROOT="$HOME/speechsdk"
```
Create the directory if it doesn't exist:
```
mkdir -p "$SPEECHSDK_ROOT"
```

Download and extract the .tar.gz archive that contains the Speech SDK binaries:

wget -O SpeechSDK-Linux.tar.gz https://aka.ms/csspeech/linuxbinary
tar --strip 1 -xzf SpeechSDK-Linux.tar.gz -C "$SPEECHSDK_ROOT"

Validate the contents of the top-level directory of the extracted package:

ls -l "$SPEECHSDK_ROOT"

The directory listing should contain the partner notices and license files. The listing should also contain an include directory that holds header (.h) files and a lib directory that holds libraries for arm32, arm64, x64, and x86.

Path	Description
license.md	License
ThirdPartyNotices.md	Partner notices
REDIST.txt	Redistribution notice
include	Required header files for C++
lib/arm32	Native library for ARM32 required to link your application
lib/arm64	Native library for ARM64 required to link your application
lib/x64	Native library for x64 required to link your application
lib/x86	Native library for x86 required to link your application

This guide shows how to install the Speech SDK for C++ on macOS 10.14 or later. The steps include downloading the required libraries and header files as a .zip file.

Choose a directory for the Speech SDK files. Set the SPEECHSDK_ROOT environment variable to point to that directory. This variable makes it easy to refer to the directory in future commands.

To use the directory speechsdk in your home directory, run the following command:
```
export SPEECHSDK_ROOT="$HOME/speechsdk"
```
Create the directory if it doesn't exist:
```
mkdir -p "$SPEECHSDK_ROOT"
```

Download and extract the .zip archive that contains the Speech SDK XCFramework:

wget -O SpeechSDK-macOS.zip https://aka.ms/csspeech/macosbinary
unzip SpeechSDK-macOS.zip -d "$SPEECHSDK_ROOT"

Validate the contents of the top-level directory of the extracted package:
```
ls -l "$SPEECHSDK_ROOT"
```
The directory listing should contain the partner notice, license files, and a MicrosoftCognitiveServicesSpeech.xcframework directory.

This guide shows how to install the Speech SDK for C++ on Windows desktop operating systems.

This setup guide requires:

Microsoft Visual C++ Redistributable for Visual Studio for the Windows platform. Installing it for the first time might require a restart.
Visual Studio.

Create a project in Visual Studio and install the Speech SDK

To create a Visual Studio project for C++ desktop development, you need to:

Set up Visual Studio development options.
Create the project.
Select the target architecture.
Install the Speech SDK.

Set up Visual Studio development options

To start, make sure you're set up correctly in Visual Studio for C++ desktop development:

Open Visual Studio 2019 to display the start window.
Select Continue without code to go to the Visual Studio IDE.
From the Visual Studio menu bar, select Tools > Get Tools and Features to open Visual Studio Installer and view the Modifying dialog box.
On the Workloads tab, under Windows, find the Desktop development with C++ workload. If that workload isn't already selected, select it.
On the Individual components tab, find NuGet package manager. If it isn't already selected, select it.
Select either Close or Modify. The button name varies depending on whether you selected any features for installation.

If you select Modify, installation begins. The process might take a while.
Close Visual Studio Installer.

Create the project

Next, create your project and select the target architecture:

From the Visual Studio menu, select File > New > Project to display the Create a new project window.
Find and select Console App. Make sure that you select the C++ version of this project type, as opposed to C# or Visual Basic.
Select Next.
In the Configure your new project dialog box, in Project name, enter helloworld.
In Location, go to and select or create the folder where you want to save your project, and then select Create.
Select your target platform architecture. On the Visual Studio toolbar, find the Solution Platforms dropdown box. If you don't see it, select View > Toolbars > Standard to display the toolbar that contains Solution Platforms.

If you're running 64-bit Windows, select x64 in the dropdown box. 64-bit Windows can also run 32-bit applications, so you can choose x86 if you prefer.

Install the Speech SDK by using Visual Studio

Finally, install the Speech SDK NuGet package and reference the Speech SDK in your project:

In Solution Explorer, right-click your solution, and then select Manage NuGet Packages for Solution to go to the NuGet - Solution window.
Select Browse.
In Package source, select nuget.org.
In the Search box, enter Microsoft.CognitiveServices.Speech. Choose that package after it appears in the search results.
In the package status pane next to the search results, select your helloworld project.
Select Install.
In the Preview Changes dialog box, select OK.
In the License Acceptance dialog box, view the license, and then select I Accept. The package installation begins. When installation is complete, the Output pane displays a message that's similar to the following text: Successfully installed 'Microsoft.CognitiveServices.Speech 1.15.0' to helloworld.

Reference documentation | Package (Go) | Additional Samples on GitHub

In this quickstart, you install the Speech SDK for Go.

Platform requirements

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

The Speech SDK for Go supports the following distributions on the x64 architecture:

Ubuntu 18.04/20.04
Debian 9/10/11
Red Hat Enterprise Linux (RHEL) 8
CentOS 7

Important

Use the most recent LTS release of the Linux distribution. For example, if you are using Ubuntu 20.04 LTS, use the latest release of Ubuntu 20.04.X.

The Speech SDK depends on the following Linux system libraries:

The shared libraries of the GNU C library, including the POSIX Threads Programming library, libpthreads.
The OpenSSL library (libssl) version 1.x and certificates (ca-certificates).
The shared library for ALSA applications (libasound).

You should also install ca-certificates to establish a secure websocket and avoid the WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED error.

Important

The Speech SDK does not yet support OpenSSL 3.0, which is the default in Ubuntu 22.04 and Debian 12.

Run these commands:

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

To use the Speech SDK in Alpine Linux, create a Debian chroot environment as documented in the Alpine Linux Wiki on running glibc programs. Then follow the Debian instructions here.

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

Install the development tools and libraries:

sudo yum update
sudo yum groupinstall "Development tools"
sudo yum install alsa-lib openssl wget

Important

On RHEL/CentOS 7, follow the instructions on how to configure RHEL/CentOS 7 for Speech SDK.
On RHEL, follow the instructions on how to configure OpenSSL for Linux.

Install the Go binary version 1.13 or later.

Install the Speech SDK for Go

Use the following procedure to download and install the SDK. The steps include downloading the required libraries and header files as a .tar file.

Choose a directory for the Speech SDK files. Set the SPEECHSDK_ROOT environment variable to point to that directory. This variable makes it easy to refer to the directory in future commands.

To use the directory speechsdk in your home directory, run the following command:
```
export SPEECHSDK_ROOT="$HOME/speechsdk"
```
Create the directory if it doesn't exist:
```
mkdir -p "$SPEECHSDK_ROOT"
```

Download and extract the .tar.gz archive that contains the Speech SDK binaries:

wget -O SpeechSDK-Linux.tar.gz https://aka.ms/csspeech/linuxbinary
tar --strip 1 -xzf SpeechSDK-Linux.tar.gz -C "$SPEECHSDK_ROOT"

Validate the contents of the top-level directory of the extracted package:

ls -l "$SPEECHSDK_ROOT"

The directory listing should contain the partner notices and license files. The listing should also contain an include directory that holds header (.h) files and a lib directory that holds libraries for arm32, arm64, x64, and x86.

Path	Description
license.md	License
ThirdPartyNotices.md	Partner notices
REDIST.txt	Redistribution notice
include	Required header files for C++
lib/arm32	Native library for ARM32 required to link your application
lib/arm64	Native library for ARM64 required to link your application
lib/x64	Native library for x64 required to link your application
lib/x86	Native library for x86 required to link your application

Configure the Go environment

The following steps enable your Go environment to find the Speech SDK.

Because the bindings rely on cgo, you need to set the environment variables so Go can find the SDK.
```
export CGO_CFLAGS="-I$SPEECHSDK_ROOT/include/c_api"
export CGO_LDFLAGS="-L$SPEECHSDK_ROOT/lib/<architecture> -lMicrosoft.CognitiveServices.Speech.core"
```
Important

Replace <architecture> with the processor architecture of your CPU: x86, x64, arm32, or arm64.
To run applications and the SDK, you need to tell the operating system where to find the libraries.
```
export LD_LIBRARY_PATH="$SPEECHSDK_ROOT/lib/<architecture>:$LD_LIBRARY_PATH"
```
Important

Replace <architecture> with the processor architecture of your CPU: x86, x64, arm32, or arm64.

Reference documentation | Additional Samples on GitHub

In this quickstart, you install the Speech SDK for Java.

Platform requirements

Choose your target environment:

Java Runtime
Android

The Speech SDK for Java is compatible with Windows, Linux, and macOS.

On Windows, you must use the 64-bit target architecture. Windows 10 or later is required.

Install the Microsoft Visual C++ Redistributable for Visual Studio 2015, 2017, 2019, and 2022 for your platform. Installing this package for the first time might require a restart.

The Speech SDK for Java doesn't support Windows on ARM64.

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

The Speech SDK for Java supports the following distributions on the x64, ARM32 (Debian/Ubuntu), and ARM64 (Debian/Ubuntu) architectures:

Ubuntu 18.04/20.04
Debian 10/11
Red Hat Enterprise Linux (RHEL) 7/8
CentOS 7

Important

Use the most recent LTS release of the Linux distribution. For example, if you are using Ubuntu 20.04 LTS, use the latest release of Ubuntu 20.04.X.

The Speech SDK depends on the following Linux system libraries:

The shared libraries of the GNU C library, including the POSIX Threads Programming library, libpthreads.
The OpenSSL library (libssl) version 1.x and certificates (ca-certificates).
The shared library for ALSA applications (libasound).

You should also install ca-certificates to establish a secure websocket and avoid the WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED error.

Important

The Speech SDK does not yet support OpenSSL 3.0, which is the default in Ubuntu 22.04 and Debian 12.

Run these commands:

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

To use the Speech SDK in Alpine Linux, create a Debian chroot environment as documented in the Alpine Linux Wiki on running glibc programs. Then follow the Debian instructions here.

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

Install the development tools and libraries:

sudo yum update
sudo yum groupinstall "Development tools"
sudo yum install alsa-lib openssl wget

Important

On RHEL/CentOS 7, follow the instructions on how to configure RHEL/CentOS 7 for Speech SDK.
On RHEL, follow the instructions on how to configure OpenSSL for Linux.

Install a Java Development Kit such as Azul Zulu OpenJDK. The Microsoft Build of OpenJDK or your preferred JDK should also work.

Install the Speech SDK for Java

Some of the instructions use a specific SDK version such as 1.24.2. To check the latest version, search our GitHub repository.

Choose your target environment:

Java Runtime
Android

This guide shows how to install the Speech SDK for Java on the Java Runtime.

Supported operating systems

The Speech SDK for Java package is available for these operating systems:

Windows: 64-bit only.
Mac: macOS X version 10.14 or later.
Linux: See the supported Linux distributions and target architectures.

Follow these steps to install the Speech SDK for Java using Apache Maven:

Install Apache Maven.
Open a command prompt where you want the new project, and create a new pom.xml file.

Copy the following XML content into pom.xml:

<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
    <modelVersion>4.0.0</modelVersion>
    <groupId>com.microsoft.cognitiveservices.speech.samples</groupId>
    <artifactId>quickstart-eclipse</artifactId>
    <version>1.0.0-SNAPSHOT</version>
    <build>
        <sourceDirectory>src</sourceDirectory>
        <plugins>
        <plugin>
            <artifactId>maven-compiler-plugin</artifactId>
            <version>3.7.0</version>
            <configuration>
            <source>1.8</source>
            <target>1.8</target>
            </configuration>
        </plugin>
        </plugins>
    </build>
    <dependencies>
        <dependency>
        <groupId>com.microsoft.cognitiveservices.speech</groupId>
        <artifactId>client-sdk</artifactId>
        <version>1.37.0</version>
        </dependency>
    </dependencies>
</project>

Run the following Maven command to install the Speech SDK and dependencies.
```
mvn clean dependency:copy-dependencies
```

Create an Eclipse project and install the Speech SDK

Install the Eclipse Java IDE. This IDE requires Java to already be installed.
Start Eclipse.
In Eclipse Launcher, in the Workspace box, enter the name of a new workspace directory. Then select Launch.
In a moment, the main window of the Eclipse IDE appears. Close the Welcome screen if one is present.
From the Eclipse menu, select File > New > Project.
The New Project dialog box appears. Select Java Project, and then select Next.
The New Java Project wizard starts. In the Project name field, enter quickstart. Choose JavaSE-1.8 as the execution environment. Select Finish.
If the Open Associated Perspective? window appears, select Open Perspective.
In Package Explorer, right-click the quickstart project. Select Configure > Convert to Maven Project from the context menu.
The Create new POM window appears. In the Group Id field, enter com.microsoft.cognitiveservices.speech.samples. In the Artifact Id field, enter quickstart. Then select Finish.

Open the pom.xml file and edit it:

Add a dependencies element at the end of the file, before the closing tag </project>, with the Speech SDK as a dependency:

<dependencies>
  <dependency>
    <groupId>com.microsoft.cognitiveservices.speech</groupId>
    <artifactId>client-sdk</artifactId>
    <version>1.37.0</version>
  </dependency>
</dependencies>

Save the changes.

Gradle configurations

Gradle configurations require an explicit reference to the .jar dependency extension:

// build.gradle

dependencies {
    implementation group: 'com.microsoft.cognitiveservices.speech', name: 'client-sdk', version: "1.37.0", ext: "jar"
}

Reference documentation | Package (npm) | Additional Samples on GitHub | Library source code

In this quickstart, you install the Speech SDK for JavaScript.

The Speech SDK for JavaScript is available as an npm package. See microsoft-cognitiveservices-speech-sdk and its companion GitHub repository cognitive-services-speech-sdk-js.

Platform requirements

Understand the architectural implications between Node.js and client web browsers. For example, the document object model (DOM) isn't available for server-side applications. The Node.js file system isn't available to client-side applications.

Install the Speech SDK for JavaScript

Depending on the target environment, use one of the following guides:

Node.js
Browser-based

This guide shows how to install the Speech SDK for JavaScript for use with Node.js.

Install Node.js.
Create a new directory, run npm init, and walk through the prompts.
To install the Speech SDK for JavaScript, run the following npm install command:
```
npm install microsoft-cognitiveservices-speech-sdk
```

For more information, see the Node.js samples.

This guide shows how to install the Speech SDK for JavaScript for use with a webpage.

Unpack to a folder

Create a new, empty folder. If you want to host the sample on a web server, make sure that the web server can access the folder.
Download the Speech SDK as a .zip package and unpack it into the newly created folder. These files are unpacked:
- microsoft.cognitiveservices.speech.sdk.bundle.js: A human-readable version of the Speech SDK.
- microsoft.cognitiveservices.speech.sdk.bundle.js.map: A map file to use for debugging SDK code.
- microsoft.cognitiveservices.speech.sdk.bundle.d.ts: Object definitions for use with TypeScript.
- microsoft.cognitiveservices.speech.sdk.bundle-min.js: A minified version of the Speech SDK.
- speech-processor.js: Code to improve performance on some browsers.
Create a new file named index.html in the folder, and open this file with a text editor.

HTML script tag

Download and extract the microsoft.cognitiveservices.speech.sdk.bundle.js file from the Speech SDK for JavaScript. Place it in a folder that your HTML file can access.

<script src="microsoft.cognitiveservices.speech.sdk.bundle.js"></script>;

Tip

If you're targeting a web browser and using the <script> tag, the sdk prefix is not needed. The sdk prefix is an alias that's used to name the require module.

Alternatively, you could directly include a <script> tag in the HTML <head> element, relying on the JSDelivr.

<script src="https://cdn.jsdelivr.net/npm/microsoft-cognitiveservices-speech-sdk@latest/distrib/browser/microsoft.cognitiveservices.speech.sdk.bundle-min.js">
</script>

For more information, see the browser-based samples.

Use the Speech SDK

Add the following import statement to use the Speech SDK in your JavaScript project:
```
import * as sdk from "microsoft-cognitiveservices-speech-sdk";
```

For more information on import, see Export and Import on the JavaScript website.

Alternatively, you could use a require statement:

const sdk = require("microsoft-cognitiveservices-speech-sdk");

Reference documentation | Package (Download) | Additional Samples on GitHub

In this quickstart, you install the Speech SDK for Objective-C.

Tip

For more information about using the Speech SDK for Swift, see Importing Objective-C into Swift.

The Speech SDK for Objective-C is available natively as a CocoaPod package for Mac x64 and ARM-based systems.

System requirements for Mac:

A macOS version 10.14 or later

The macOS CocoaPod package is available for download and use with the Xcode 9.4.1 or later integrated development environment (IDE).

Go to the Xcode directory where your .xcodeproj project file is located.
Run pod init to create a pod file named Podfile.
Replace the contents of Podfile with the following content. Update the target name from AppName to the name of your app. Update the platform or pod version as needed.
```
platform :osx, 10.14
use_frameworks!

target 'AppName' do
  pod 'MicrosoftCognitiveServicesSpeech-macOS', '~> 1.37.0'
end
```
Run pod install to install the Speech SDK.

Alternatively, download the binary CocoaPod and extract its contents. In your Xcode project, add a reference to the extracted MicrosoftCognitiveServicesSpeech.xcframework folder and its contents.

Note

.NET developers can build native macOS applications by using the Xamarin.Mac application framework. For more information, see Xamarin.Mac.

The Speech SDK for Objective-C is available natively as a CocoaPod package.

System requirements for iOS:

A macOS version 10.14 or later
Target iOS 9.3 or later

The macOS CocoaPod package is available for download and use with the Xcode 9.4.1 or later integrated development environment (IDE).

Go to the Xcode directory where your .xcodeproj project file is located.
Run pod init to create a pod file named Podfile.
Replace the contents of Podfile with the following content. Update the target name from AppName to the name of your app. Update the platform or pod version as needed.
```
platform :ios, '9.3'
use_frameworks!

target 'AppName' do
  pod 'MicrosoftCognitiveServicesSpeech-iOS', '~> 1.37.0'
end
```
Run pod install to install the Speech SDK.

Alternatively, download the binary CocoaPod and extract its contents. In your Xcode project, add a reference to the extracted MicrosoftCognitiveServicesSpeech.xcframework folder and its contents.

Note

.NET developers can build native iOS applications by using the Xamarin.iOS application framework. For more information, see Xamarin.iOS.

Reference documentation | Package (Download) | Additional Samples on GitHub

In this quickstart, you install the Speech SDK for Swift.

Tip

For more information about using the Speech SDK for Swift, see Importing Objective-C into Swift.

Install the Speech SDK for Swift

Mac
iOS

The Speech SDK for Swift is available natively as a CocoaPod package for Mac x64 and ARM-based systems.

System requirements for Mac:

A macOS version 10.14 or later

The macOS CocoaPod package is available for download and use with the Xcode 9.4.1 or later integrated development environment (IDE).

Go to the Xcode directory where your .xcodeproj project file is located.
Run pod init to create a pod file named Podfile.
Replace the contents of Podfile with the following content. Update the target name from AppName to the name of your app. Update the platform or pod version as needed.
```
platform :osx, 10.14
use_frameworks!

target 'AppName' do
  pod 'MicrosoftCognitiveServicesSpeech-macOS', '~> 1.37.0'
end
```
Run pod install to install the Speech SDK.

Alternatively, download the binary CocoaPod and extract its contents. In your Xcode project, add a reference to the extracted MicrosoftCognitiveServicesSpeech.xcframework folder and its contents.

Note

.NET developers can build native macOS applications by using the Xamarin.Mac application framework. For more information, see Xamarin.Mac.

The Speech SDK for Swift is available natively as a CocoaPod package.

System requirements for iOS:

A macOS version 10.14 or later
Target iOS 9.3 or later

The macOS CocoaPod package is available for download and use with the Xcode 9.4.1 or later integrated development environment (IDE).

Go to the Xcode directory where your .xcodeproj project file is located.
Run pod init to create a pod file named Podfile.
Replace the contents of Podfile with the following. Update the target name from AppName to the name of your app. Update the platform or pod version as needed.
```
platform :ios, '9.3'
use_frameworks!

target 'AppName' do
  pod 'MicrosoftCognitiveServicesSpeech-iOS', '~> 1.37.0'
end
```
Run pod install to install the Speech SDK.

Alternatively, download the binary CocoaPod and extract its contents. In your Xcode project, add a reference to the extracted MicrosoftCognitiveServicesSpeech.xcframework folder and its contents.

Note

.NET developers can build native iOS applications by using the Xamarin.iOS application framework. For more information, see Xamarin.iOS.

Reference documentation | Package (PyPi) | Additional Samples on GitHub

In this quickstart, you install the Speech SDK for Python.

Platform requirements

The Speech SDK for Python is compatible with Windows, Linux, and macOS.

On Windows, you must use the 64-bit target architecture. Windows 10 or later is required.

Install the Microsoft Visual C++ Redistributable for Visual Studio 2015, 2017, 2019, and 2022 for your platform. Installing this package for the first time might require a restart.

Important

Make sure that packages of the same target architecture are installed. For example, if you install the x64 redistributable package, install the x64 Python package.

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

The Speech SDK for Python supports the following distributions on the x64 and ARM64 architectures:

Ubuntu 18.04/20.04
Debian 10/11
Red Hat Enterprise Linux (RHEL) 8
CentOS 7

Important

Use the most recent LTS release of the Linux distribution. For example, if you are using Ubuntu 20.04 LTS, use the latest release of Ubuntu 20.04.X.

The Speech SDK depends on the following Linux system libraries:

The shared libraries of the GNU C library, including the POSIX Threads Programming library, libpthreads.
The OpenSSL library (libssl) version 1.x and certificates (ca-certificates).
The shared library for ALSA applications (libasound).

You should also install ca-certificates to establish a secure websocket and avoid the WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED error.

Important

The Speech SDK does not yet support OpenSSL 3.0, which is the default in Ubuntu 22.04 and Debian 12.

Run these commands:

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

To use the Speech SDK in Alpine Linux, create a Debian chroot environment as documented in the Alpine Linux Wiki on running glibc programs. Then follow the Debian instructions here.

sudo apt-get update
sudo apt-get install build-essential libssl-dev ca-certificates libasound2 wget

Caution

This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly. For more information, see the CentOS End Of Life guidance.

Install the development tools and libraries:

sudo yum update
sudo yum groupinstall "Development tools"
sudo yum install alsa-lib openssl wget

Important

On RHEL/CentOS 7, follow the instructions on how to configure RHEL/CentOS 7 for Speech SDK.
On RHEL, follow the instructions on how to configure OpenSSL for Linux.

Install a version of Python from 3.7 or later.

To check your installation, open a terminal and run the command python --version. If Python installed properly, you get a response like Python 3.8.10.
If you're using macOS or Linux, you might need to run the command python3 --version instead.

To enable use of python instead of python3, run alias python='python3' to set up an alias. The Speech SDK quickstart samples specify python usage.

Install the Speech SDK for Python

Before you install the Speech SDK for Python, make sure to satisfy the platform requirements.

PyPI
VS Code

Install from PyPI

To install the Speech SDK for Python, run this command in a console window:

pip install azure-cognitiveservices-speech

Upgrade to the latest Speech SDK

To upgrade to the latest Speech SDK, run this command in console window:

pip install --upgrade azure-cognitiveservices-speech

You can check which Speech SDK for Python version is currently installed by inspecting the azure.cognitiveservices.speech.__version__ variable. For example, run this command in a console window:

pip list

Install the Speech SDK by using Visual Studio Code

To install the Speech SDK for Python:

Download and install Visual Studio Code.
Run Visual Studio Code and install the Python extension:
1. Select File > Preferences > Extensions.
2. Search for Python, find the Python extension for Visual Studio Code published by Microsoft, and then select Install.
Select Terminal > New Terminal to open a terminal within Visual Studio Code.
At the terminal prompt, run the following command to install the Speech SDK for Python package.
```
python -m pip install azure-cognitiveservices-speech
```

For more information about Visual Studio Code and Python, see Visual Studio Code and Getting Started with Python in VS Code.

Use the Speech SDK

Add the following import statement to use the Speech SDK in your Python project:

import azure.cognitiveservices.speech as speechsdk

Quickstart: Install the Speech SDK

Platform requirements

Install the Speech SDK for C#

Create a Visual Studio project and install the Speech SDK

Create a helloworld project

Use NuGet Package Manager to install the Speech SDK

Choose target architecture

Platform requirements

Install the Speech SDK for C++

Platform requirements

Install the Speech SDK for Go

Configure the Go environment

Platform requirements

Install the Speech SDK for Java

Supported operating systems

Platform requirements

Install the Speech SDK for JavaScript

Use the Speech SDK

Install the Speech SDK for Objective-C

Install the Speech SDK for Swift

Platform requirements

Install the Speech SDK for Python

Install from PyPI

Upgrade to the latest Speech SDK

Use the Speech SDK

Related content

Feedback

Additional resources