My data on the new baseline models shows poor performance.
Previous baseline model was also supporting Audio.
This model worked best in our use cases (Audio, Text, Pronunciation) (20201015)
Is there a way to access older baseline models for training?
Or
Can we expect upcoming new models supporting Audio?