We currently have a seperate api to serve text-to-speech responses to our users however we are looking at alternative approaches and noted that Direct Line does have an audio option that we could use.
Looking into it, it will try to always use the azure text-to-speech resource. We have tried to manage this in the current version by storing the audio response so it doesn't need to be regenerate each time and to decrease cost.
Is there a way of using Direct Line Speech with a custom method to try to pull existing audio, before trying to generate a new audio file?