ISpeechRecoResult Interface (SAPI 5.3)

Microsoft Speech API 5.3


The ISpeechRecoResult automation interface returns information about a recognition attempt.

A recognition result is returned by a recognition context in the three following cases:

  • A successful recognition
  • An intermediate recognition (also called a hypothesis)
  • An unsuccessful recognition (or a false recognition)

A successful recognition is a word or phrase that surpasses a predetermined confidence rating. It is considered to be accurate enough to be passed back to the user as the text that was actually spoken. A hypothesis is an intermediate step toward recognition. The text has been parsed and examined and is available to the user for closer examination. Any number of hypotheses may be produced during a recognition attempt. A hypothesis may not reflect the final recognition and should not be used to predict it. A false recognition is a recognized word or phrase that does not meet or exceed a predetermined confidence rating. The false recognition will still contain a valid recognition result including text representing the speech. However, the text was not able to meet confidence criteria. Any of the following can contribute to a false recognition:

  • Background noise
  • Inexact pronunciation
  • Uncommon words
  • Unusual sequence of words

Any one of the three recognition types above is treated the same for a recognition result. A valid recognition result is returned by SAPI and its content may be examined. Information includes the phrase itself, the owning recognition context, the audio format (if the audio was retained) and other properties in this class.

Automation Interface Elements

The ISpeechRecoResult automation interface contains the following elements:

AudioFormat Property
PhraseInfo Property
RecoContext Property
Times Property
Alternates Method
Audio Method
DiscardResultInfo Method
SaveToMemory Method
SpeakAudio Method