SpeechConfig Class

Reference

Class that defines configurations for speech / intent recognition and speech synthesis.

The configuration can be initialized in different ways:

from subscription: pass a subscription key and a region
from endpoint: pass an endpoint. Subscription key or authorization token are optional.
from host: pass a host address. Subscription key or authorization token are optional.
from authorization token: pass an authorization token and a region

Inheritance: builtins.object

SpeechConfig

Constructor

SpeechConfig(subscription: str | None = None, region: str | None = None, endpoint: str | None = None, host: str | None = None, auth_token: str | None = None, speech_recognition_language: str | None = None)

Parameters

Name	Description
subscription	The subscription key. default value: None
region	The region name (see the region page). default value: None
endpoint	The service endpoint to connect to. default value: None
host	The service host to connect to. Standard resource path will be assumed. Format is "protocol://host:port" where ":port" is optional. default value: None
auth_token	The authorization token. default value: None
speech_recognition_language	The input language to the speech recognition. The language is specified in BCP-47 format. default value: None

Methods

enable_audio_logging	Enables audio logging in service. Audio and content logs are stored either in Microsoft-owned storage, or in your own storage account linked to your Cognitive Services subscription (Bring Your Own Storage (BYOS) enabled Speech resource). Note Added in version 1.5.0.
enable_dictation	Enables dictation. Only supported in speech continuous recognition. Note Added in version 1.5.0.
get_property	Get a property by id.
get_property_by_name	Get a property by name.
request_word_level_timestamps	Includes word level timestamps in response result. Note Added in version 1.5.0.
set_profanity	Set the profanity option. Note Added in version 1.5.0.
set_properties	Set multiple properties by id.
set_properties_by_name	Set multiple properties by name.
set_property	Set a property by id.
set_property_by_name	Set a property by name.
set_proxy	Set proxy information. Note Proxy functionality is not available on macOS. This function will have no effect on this platform.
set_service_property	Sets a property value that will be passed to service using the specified channel. Note Added in version 1.5.0.
set_speech_synthesis_output_format	Set speech synthesis output audio format.

enable_audio_logging

Enables audio logging in service. Audio and content logs are stored either in Microsoft-owned storage, or in your own storage account linked to your Cognitive Services subscription (Bring Your Own Storage (BYOS) enabled Speech resource).

Note

Added in version 1.5.0.

enable_audio_logging()

enable_dictation

Enables dictation. Only supported in speech continuous recognition.

Note

Added in version 1.5.0.

enable_dictation()

get_property

Get a property by id.

get_property(property_id: PropertyId) -> str

Parameters

Name	Description
property_id Required	The id of the property to be retrieved.

Returns

Type	Description
	The value of the property.

get_property_by_name

Get a property by name.

get_property_by_name(property_name: str) -> str

Parameters

Name	Description
property_name Required	The name of the property to be retrieved.

Returns

Type	Description
	The value of the property.

request_word_level_timestamps

Includes word level timestamps in response result.

Note

Added in version 1.5.0.

request_word_level_timestamps()

set_profanity

Set the profanity option.

Note

Added in version 1.5.0.

set_profanity(profanity_option: ProfanityOption) -> None

Parameters

Name	Description
profanity_option Required	The profanity level to set.

set_properties

Set multiple properties by id.

set_properties(properties: Dict[PropertyId, str])

Parameters

Name	Description
properties Required	A dict mapping property ids to the values to be set.

set_properties_by_name

Set multiple properties by name.

set_properties_by_name(properties: Dict[str, str])

Parameters

Name	Description
properties Required	A dict mapping property ids to the values to be set.

set_property

Set a property by id.

set_property(property_id: PropertyId, value: str)

Parameters

Name	Description
property_id Required	The id of the property to be set.
value Required	The value to be set for the property.

set_property_by_name

Set a property by name.

set_property_by_name(property_name: str, value: str)

Parameters

Name	Description
property_name Required	The name of the property to be set.
value Required	The value to be set for the property.

set_proxy

Set proxy information.

Note

Proxy functionality is not available on macOS. This function will have no effect on

this platform.

set_proxy(hostname: str, port: int, username: str = None, password: str = None)

Parameters

Name	Description
hostname Required	The host name of the proxy server. Do not add protocol information (http) to the hostname.
port Required	The port number of the proxy server.
username	The user name of the proxy server. default value: None
password	The password of the proxy server. default value: None

set_service_property

Sets a property value that will be passed to service using the specified channel.

Note

Added in version 1.5.0.

set_service_property(name: str, value: str, channel: ServicePropertyChannel)

Parameters

Name	Description
name Required	The property name.
value Required	The property value.
channel Required	The channel used to pass the specified property to service.

set_speech_synthesis_output_format

Set speech synthesis output audio format.

set_speech_synthesis_output_format(format_id: SpeechSynthesisOutputFormat)

Parameters

Name	Description
format_id Required	The audio format id, e.g. Riff16Khz16BitMonoPcm.

Attributes

authorization_token

The authorization token that will be used for connecting to the service.

Note

The caller needs to ensure that the authorization token is valid. Before the

authorization token expires, the caller needs to refresh it by calling this setter with a

new valid token. As configuration values are copied when creating a new recognizer, the

new token value will not apply to recognizers that have already been created. For

recognizers that have been created before, you need to set authorization token of the

corresponding recognizer to refresh the token. Otherwise, the recognizers will encounter

errors during recognition.

endpoint_id

The endpoint id.

output_format

The output format (simple or detailed) of the speech recognition result.

region

The region key that was used to create the Recognizer.

speech_recognition_language

The name of spoken language to be recognized in BCP-47 format.

speech_synthesis_language

Get speech synthesis language.

speech_synthesis_output_format_string

Get speech synthesis output audio format string.

speech_synthesis_voice_name

Get speech synthesis voice name.

subscription_key

The subscription key that was used to create the Recognizer.

SpeechConfig Class

Constructor

Parameters

Methods

enable_audio_logging

enable_dictation

get_property

Parameters

Returns

get_property_by_name

Parameters

Returns

request_word_level_timestamps

set_profanity

Parameters

set_properties

Parameters

set_properties_by_name

Parameters

set_property

Parameters

set_property_by_name

Parameters

set_proxy

Parameters

set_service_property

Parameters

set_speech_synthesis_output_format

Parameters

Attributes

authorization_token

endpoint_id

output_format

region

speech_recognition_language

speech_synthesis_language

speech_synthesis_output_format_string

speech_synthesis_voice_name

subscription_key

Feedback

Additional resources