Azure speech-to-text issue for Hindi(India) for "two two"

Mohit Kumar 1 Reputation point
2022-03-15T13:17:51.023+00:00

I was trying azure text to speech in Hindi (India) and faced an issue where “two two” gives 2 as response instead of “22” the same is arising for 3s and 4s as well.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
Azure AI Language
Azure AI Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
359 questions
{count} votes

1 answer

Sort by: Most helpful
  1. YutongTie-MSFT 46,991 Reputation points
    2022-03-23T18:11:45.69+00:00

    Hello @Mohit Kumar

    Thanks for reporting this issue again. I just tested with "22332224" with Hindi, it worked well with the correct result. The video I created for testing is here: https://microsoft-my.sharepoint.com/:u:/p/yutie/EQldcboJ0RJDqg6z7Br7UF8Blx0XSyiJB4CuEa0c9H_t0w?e=tecCpQ

    The result JSON is at below:

    [  
        {  
            "Id": "291e2e30d8424aa0a01ad4bfa4498c78",  
            "RecognitionStatus": 0,  
            "Offset": 900000,  
            "Duration": 22500000,  
            "DisplayText": "22332224।",  
            "NBest": [  
                {  
                    "Confidence": 0.96145993,  
                    "Lexical": "दो दो तीन तीन दो दो दो चार",  
                    "ITN": "22332224",  
                    "MaskedITN": "22332224।",  
                    "Display": "22332224।",  
                    "Words": [  
                        {  
                            "Word": "दो",  
                            "Offset": 900000,  
                            "Duration": 3500000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 4500000,  
                            "Duration": 2100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 6700000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 9900000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 13100000,  
                            "Duration": 2300000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 15500000,  
                            "Duration": 2100000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 17700000,  
                            "Duration": 1900000  
                        },  
                        {  
                            "Word": "चार",  
                            "Offset": 19700000,  
                            "Duration": 3700000  
                        }  
                    ]  
                },  
                {  
                    "Confidence": 0.960954,  
                    "Lexical": "दो दो तीन तीन दो दो चार",  
                    "ITN": "दो दो तीन तीन दो दो चार",  
                    "MaskedITN": "दो दो तीन तीन दो दो चार",  
                    "Display": "दो दो तीन तीन दो दो चार",  
                    "Words": [  
                        {  
                            "Word": "दो",  
                            "Offset": 900000,  
                            "Duration": 3500000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 4500000,  
                            "Duration": 2100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 6700000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 9900000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 13100000,  
                            "Duration": 4500000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 17700000,  
                            "Duration": 1900000  
                        },  
                        {  
                            "Word": "चार",  
                            "Offset": 19700000,  
                            "Duration": 3700000  
                        }  
                    ]  
                },  
                {  
                    "Confidence": 0.95542437,  
                    "Lexical": "दो दो तीन तीन दो दो दो दो चार",  
                    "ITN": "दो दो तीन तीन दो दो दो दो चार",  
                    "MaskedITN": "दो दो तीन तीन दो दो दो दो चार",  
                    "Display": "दो दो तीन तीन दो दो दो दो चार",  
                    "Words": [  
                        {  
                            "Word": "दो",  
                            "Offset": 900000,  
                            "Duration": 3500000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 4500000,  
                            "Duration": 2100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 6700000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 9900000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 13100000,  
                            "Duration": 2300000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 15500000,  
                            "Duration": 1400000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 17000000,  
                            "Duration": 600000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 17700000,  
                            "Duration": 1900000  
                        },  
                        {  
                            "Word": "चार",  
                            "Offset": 19700000,  
                            "Duration": 3700000  
                        }  
                    ]  
                },  
                {  
                    "Confidence": 0.95261824,  
                    "Lexical": "दो दो दो तीन तीन दो दो दो चार",  
                    "ITN": "दो दो दो तीन तीन दो दो दो चार",  
                    "MaskedITN": "दो दो दो तीन तीन दो दो दो चार",  
                    "Display": "दो दो दो तीन तीन दो दो दो चार",  
                    "Words": [  
                        {  
                            "Word": "दो",  
                            "Offset": 900000,  
                            "Duration": 2800000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 3800000,  
                            "Duration": 600000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 4500000,  
                            "Duration": 2100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 6700000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 9900000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 13100000,  
                            "Duration": 2300000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 15500000,  
                            "Duration": 2100000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 17700000,  
                            "Duration": 1900000  
                        },  
                        {  
                            "Word": "चार",  
                            "Offset": 19700000,  
                            "Duration": 3700000  
                        }  
                    ]  
                },  
                {  
                    "Confidence": 0.9428844,  
                    "Lexical": "दो दो तीन तीन दो दो दो चार चार",  
                    "ITN": "दो दो तीन तीन दो दो दो चार चार",  
                    "MaskedITN": "दो दो तीन तीन दो दो दो चार चार",  
                    "Display": "दो दो तीन तीन दो दो दो चार चार",  
                    "Words": [  
                        {  
                            "Word": "दो",  
                            "Offset": 900000,  
                            "Duration": 3500000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 4500000,  
                            "Duration": 2100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 6700000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "तीन",  
                            "Offset": 9900000,  
                            "Duration": 3100000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 13100000,  
                            "Duration": 2300000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 15500000,  
                            "Duration": 2100000  
                        },  
                        {  
                            "Word": "दो",  
                            "Offset": 17700000,  
                            "Duration": 1900000  
                        },  
                        {  
                            "Word": "चार",  
                            "Offset": 19700000,  
                            "Duration": 2000000  
                        },  
                        {  
                            "Word": "चार",  
                            "Offset": 21800000,  
                            "Duration": 1600000  
                        }  
                    ]  
                }  
            ]  
        }  
    ]  
    

    Please let me know if you still can not make it work. I see you are MSFT, you can ping me directly to discuss further for this issue as well.

    Regards,
    Yutong

    -Please kindly accept the answer if you feel helpful, thanks.