Some of you have probably seen the recently released whisper model from openai. Having this go two ways would open up for some neat conversational ai ideas, as in having two AIs discussing with each other, say backed by GPT3 but the output form being perfect audio/speech<p>So is there something as "state of the art" as whisper but for text to speech/audio?