Note

Please see Azure Cognitive Services for Speech documentation for the latest supported speech solutions.

Consonants (Microsoft.Speech)

This is a listing and cross reference of the consonant phones that are used by the three phonetic alphabets that Microsoft.Speech supports. See Phonetic Alphabet Reference (Microsoft.Speech).

UPS Consonant Labeling Conventions

Consonant symbols are one, two, or three characters in length. Single character labels are used for highly frequent, language-universal consonants. In general the first character signifies place plus voicing, and the second signifies manner. Labeling conventions carry across phonetic classes in the following ways:

Conventions for place of articulation and voicing labeling:

  • Dentals and alveloars use D for voiced and T for voiceless

  • Alveolars use N for nasal and R for approximant

  • Labiodentals use V for voiced and F for voiceless

  • Velars use G for voiced and K for voiceless, NG for nasal

  • Bilabials use B for voiced and P for voiceless stops, M for nasal

  • Laterals use L

  • Uvulars use Q

  • Pharyngeals use H

  • Palatals use X for voiceless and Y for voiced

Conventions for manner of articulation labeling:

  • Single-character labels have manner encapsulated in choice of symbol: for example, M is nasal

  • Trills double up the first symbol

  • Fricatives use H

  • Approximants use X

  • Retroflex uses R

Consonant Tables

There are two tables, each listing a different category of consonant phones:

  • Consonants

  • Affricates

The table columns contain the following information:

  • UPS. The phone label specified in Microsoft’s Universal Phone Set (UPS).

  • IPA. The phone label specified in the International Phonetic Alphabet (IPA) phone set.

  • Unicode. The Unicode value from the IPA phone set.

  • SAPI ID. The Speech API (SAPI) phone ID, which in most cases is the Unicode value.

  • IPA Description. This column describes the features of the phone, such as place of articulation, manner of articulation, phonation type, and airstream mechanism. See the Glossary of Phonetic Terms (Microsoft.Speech) for more information about the features of phones.

  • ipaASCII. The equivalent phone in ipaASCII phone set for reference.

  • X-SAMPA. The equivalent phone in X-SAMPA for reference.

  • Example. A word that uses the phone and illustrates its sound.

  • Language. The language of the example word.

Consonant Phones

Consonants are speech sounds that are articulated with complete (voiceless) or partial closure of the vocal tract.

UPS

IPA

Unicode

SAPI ID

IPA Description

ipaASCII

X-SAMPA

Example

Language

P

p

U+0070

0070

{blb,stp,vls}

P

P

put

English

B

b

U+0062

0062

{blb,stp,vcd}

B

B

big

English

M

m

U+006D

006D

{blb,nas}

M

M

mat

English

BB

ʙ

U+0299

0299

{blb,trl,vcd}

B<trl>

B\

PH

ɸ

U+0278

0278

{blb,frc,vls}

P

p\

BH

β

U+03B2

03B2

{blb,frc,vcd}

B

B

kabra

Spanish

MF

ɱ

U+0271

0271

{lbd,nas}

M

F

F

f

U+0066

0066

{lbd,frc,vls}

F

F

fork

English

V

v

U+0076

0076

{lbd,frc,vcd}

V

V

vat

English

VA

ʋ

U+028B

028B

{lbd,apr,vcd}

R<lbd>

V\

TH

θ

U+03B8

03B8

{dnt,frc,vls}

T

T

thin

English

DH

ð

U+00F0

00F0

{dnt,frc,vcd}

D

D

then

English

T

t

U+0074

0074

{alv,stp,vls}

T

T

talk

English

D

d

U+0064

0064

{alv,stp,vcd}

D

D

dig

English

N

n

U+006E

006E

{alv,nas}

N

N

no

English

RR

r

U+0072

0072

{alv,trl,vcd}

R<trl>

torre, rojo

Spanish

DX

ɾ

U+027E

027E

{alv,flp,vcd}

*

4

butter

US English

S

s

U+0073

0073

{alv,frc,vls}

S

S

sit

US English

Z

z

U+007A

007A

{alv,frc,vcd}

z

Z

zap

US English

LSH

ɬ

U+026C

026C

{alv,lat,frc,vls}

K

LH

ɮ

U+026E

026E

{alv,lat,frc,vcd}

z<lat>

K\

caballo

Spanish/Zulu

RA

ɹ

U+0279

0279

{alv,apr}

r

r\

puro

Spanish

L

l

U+006C

006C

{alv,lat,apr,vcd}

l

L

lid

US English

L vel

ɫ

U+026B

006C 02E0

{alv,lat,apr,vcd} vel

SH

ʃ

U+0283

0283

{pla,frc,vls}

S

S

she

US English

SH pal

ʆ

U+0286

0283 02B2

{pla,frc,vls} pal

ZH

ʒ

U+0292

0292

{pla,frc,vcd}

Z

Z

pleasure

US English

ZH pal

ʓ

U+0293

0292 02B2

{pla,frc,vcd} pal

TR

ʈ

U+0288

0288

{rfx,stp,vls}

t.

t`

DR

ɖ

U+0256

0256

{rfx,stp,vcd}

d.

D`

NR

ɳ

U+0273

0273

{rfx,nas,vcd}

n.

N`

DXR

ɽ

U+027D

027D

{rfx,flp,vcd}

*.

r`

SR

ʂ

U+0282

0282

{rfx,frc,vls}

s.

S`

ZR

ʐ

U+0290

0290

{rfx,frc,vcd}

z.

Z`

R

ɻ

U+027B

027B

{rfx,apr,vcd}

r.

R

red

US English

LR

ɭ

U+026D

026D

{rfx,lat,vcd}

l.

l`

RR rho

ɼ

U+027C

0072 02DE

{rfx,trl}

CT

c

U+0063

0063

{pal,stp,vls}

c

C

JD

ɟ

U+025F

025F

{pal,stp,vcd}

J

J\

NJ

ɲ

U+0272

0272

{pal,nas,vcd}

J

oignon

French

C

ç

U+00E7

00E7

{pal,frc,vls}

C

C

sicher

German

CJ

ʝ

U+029D

029D

{pal,frc,vcd}

C<vcd>

j\

J

j

U+006A

006A

{pal,apr,vcd}

j

J

yard

US English

LJ

ʎ

U+028E

028E

{pal,lat,apr,vcd}

l^

L

gli

Italian

W

w

U+0077

0077

{lbv,apr,vcd}

w

W

with

US English

K

k

U+006B

006B

{vel,stp,vls}

k

K

cut

US English

G

g

U+0067

0067

{vel,stp,vcd}

g

G

gut

US English

NG

ŋ

U+014B

014B

{vel,nas}

N

N

sing

US English

X

x

U+0078

0078

{vel,frc,vls}

x

X

mujer

Spanish

GH

ɣ

U+0263

0263

{vel,frc,vcd}

Q

7

luego

Spanish

GA

ɰ

U+0270

0270

{vel,apr,vcd}

j<vel>

M\

GL

ʟ

U+029F

029F

{vel,lat,vcd}

L

L\

QT

q

U+0071

0071

{uvl,stp,vls}

q

Q

QD

ɢ

U+0262

0262

{uvl,stp,vcd}

G

G\

QN

ɴ

U+0274

0274

{uvl,nas,vcd}

n"

N\

QQ

ʀ

U+0280

0280

{uvl,trl,vcd}

-

R\

QH

χ

U+03C7

03C7

{uvl,frc,vls}

X

X

RH

ʁ

U+0281

0281

{uvl,frc,vcd}

g"

R

rond

French

HH

ħ

U+0127

0127

{phr,frc,vls}

H

X\

HG

ʕ

U+0295

0295

{phr,frc,vcd}

H<vcd>

?\

GT

ʔ

U+0294

0294

{glt,stp,vls}

?

?

H

h

U+0068

0068

{glt,frc,vls}

h

H

help

US English

WJ

ɥ

U+0265

0265

{lbp,apr,vcd}

j<rnd>

H

huit;juin

French

Affricates

Affricates are consonants that begin as stops but release as a fricative.

The SAPI UPS includes explicit phones for the most frequent affricates in the highest priority languages. The IPA Unicode specification includes some of these phones, though not all of them. The SAPI phone IDs for affricates do not use the IPA Unicode values but instead concatenate the phone IDs of the component phones. Each SAPI phone ID is four characters long. For example JH 006A0361006A is equivalent to D + ZH 006A 0361 006A.

This approach maintains the semantic equivalence of compounds and their constituents. It has been used to generate the phone IDs of all compound phones in the UPS. This allows a speech recognition engine to easily decompose the compound phone into its constituent phones, which may be required if the engine does not model one of these compounds. The table shows the constituent phones (UPS Compound) that were combined to create the resulting explicit phone (UPS).

UPS

UPS Compound

IPA

Unicode

SAPI ID

IPA Description

Example

Language

PF

P + F

p.f

007003610066

{lbd,aff,vls}

Pfahl

German

TS

T + S

t.s / ʦ

U+02A6

007403610073

{den,aff,vls}

Zahl

German

CH

T + SH

t.ʃ / ʧ

U+02A7

007403610283

{pla,aff,vls}

chin

English

JH

D + ZH

d.ʒ / ʤ

U+02A4

006403610292

{pla,aff,vcd}

joy

English

JJ

J + J

j.j

006A0361006A

{pal,aff,vcd}

hielo

Spanish

DZ

D + Z

d.z / ʣ

U+02A3

00640361007A

{den,aff,vcd}

zona

Italian

CC

T + SC

t.ɕ / ʨ

U+02A8

007403610255

{pla,aff,vls}

Mandarin

TSR

T + SR

t.ʂ

007403610282

{rfx,aff,vls}

Mandarin

JC

D + ZC

d.ʑ / ʥ

U+02A5

006403610291

{pla,aff,vcd}

Many other affricates or compound consonants exist and may be required by other languages. For example, there are about 20 Italian geminate consonants (doubled consonants). UPS does not include dedicated phones for these, but allows them to be defined using the + symbol, or the length diacritic ‘lng’ in the case of geminates. For more details, see Parsing Guidelines for SAPI Speech Recognition Phone Converters (Microsoft.Speech).