Is it possible to get word information with the REST call?
It would be the WordBoundary REST version.
Something like the JSON that provides the speech to text REST service, with the words and for every word when starts (Offset) and the duration (Duration) in the audio.