We want to use your solution for speech to text service.
Our use case is the following one, we want to get the transcript from an audio, but we do not know from which language the audio is.
I noted that the language detection has some limits:
Language identification currently has a limit of four languages for single-shot recognition, and 10 languages for continuous recognition.
Is the limitation about the number of languages to search into or about the number of different languages that can be detected from the audio ?
As we do not know in which language the audio is, we may need to detect the language between all the one you can detect.
The time to detect a language between more than 2 seems quite long. What about it ?
Moreover, I tested to detect the language from an English audio between German and English, the result was German detected, it is quite weird. What can be the source of the issue ?
Do you have an audio to test your feature with ?
Thanks a lot for your precious answers,