Quickstart: Recognize speech from a microphone in Swift on macOS using the Speech SDK

This sample demonstrates how to create an macOS app in Swift using the Cognitive Services Speech SDK to transcribe speech recorded from a microphone to text.

Prerequisites

A subscription key for the Speech service. See Try the speech service for free.
A macOS machine with a microphone and Xcode version 9.4.1 or later and CocoaPods installed.

Get the code for the sample app

Download the sample code to your development machine.

Get the Speech SDK for macOS

By downloading the Microsoft Cognitive Services Speech SDK, you acknowledge its license, see Speech SDK license agreement.

The Cognitive Services Speech SDK for macOS is distributed as a framework bundle. It can be used in Xcode projects as a CocoaPod, or downloaded directly here and linked manually. This guide uses a CocoaPod. Note that this tutorial will not work without changes for any version earlier than 1.6.0 of the SDK.

Install the SDK as a CocoaPod

Install the CocoaPod dependency manager as described in its installation instructions.
Navigate to the directory of the downloaded sample app (helloworld) in a terminal.
Run the command pod install. This will generate a helloworld.xcworkspace Xcode workspace containing both the sample app and the Speech SDK as a dependency. This workspace will be used in the following.

Build and Run the Sample

Open the helloworld.xcworkspace workspace in Xcode.
Make the following changes in the AppDelegate.swift file:
1. Replace the string YourSubscriptionKey with your subscription key.
2. Replace the string YourServiceRegion with the region associated with your subscription (for example, westus for the free trial subscription).
Make the debug output visible (View > Debug Area > Activate Console).
Build and run the example code by selecting Product -> Run from the menu or clicking the Play button.
After you click the button in the app and say a few words, you should see the text you have spoken on the lower part of the screen. When you run the app for the first time, you should be prompted to give the app access to the used microphone.

Importing Speech SDK as module

This sample uses bridging header (MicrosoftCognitiveServicesSpeech-Bridging-Header.h) to include MicrosoftCognitiveServicesSpeech framework into the app.

Alternatively from 1.16.0 SDK and onwards, you can also import Speech SDK as follows.

import MicrosoftCognitiveServicesSpeech

References