TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: What's the best TTS engine you've heard?

24 点作者 dsco大约 2 年前
I&#x27;m tinkering with a podcast in my spare time, Odyssey&#x27;s Place, and am generating every episode with ChatGPT4, ElevenLabs for TTS and Midjourney4 for artwork. I&#x27;m pleased with all models but am wondering if you&#x27;ve heard TTS engines better than ElevenLabs? Google&#x27;s Tacotron sounds amazing but isn&#x27;t readily available as an API. Any kind of input would be helpful. For reference here are some audio samples: https:&#x2F;&#x2F;odysseysplace.com<p>Thanks in advance!

8 条评论

knaik94大约 2 年前
I have had some fun playing with TorToiSe TTS, which is mixed when it comes to being better than ElevenLabs. In small snippets it does sound better, but overall it does not. I mention it because it&#x27;s openly available and runs locally. I didn&#x27;t spend more than a weekend on it, and it&#x27;s popular enough to have a small community collection of voices. You have to search for them, but they&#x27;re small in size and it&#x27;s zero shot generation. It&#x27;s very similar to how stable diffusion felt when it first came out, a lot of trail and error and no consensus of the &quot;right&quot; answers.<p>The main reason why I liked it, even though the bad generations are really bad, is because you have full control of the training data set. I haven&#x27;t kept up with it in a few weeks so I am sure there have been advances I&#x27;m not aware of.<p><a href="https:&#x2F;&#x2F;git.ecker.tech&#x2F;mrq&#x2F;ai-voice-cloning" rel="nofollow">https:&#x2F;&#x2F;git.ecker.tech&#x2F;mrq&#x2F;ai-voice-cloning</a>
cloudking大约 2 年前
<a href="https:&#x2F;&#x2F;play.ht&#x2F;ultra-realistic-voices&#x2F;">https:&#x2F;&#x2F;play.ht&#x2F;ultra-realistic-voices&#x2F;</a><p><a href="https:&#x2F;&#x2F;beta.elevenlabs.io&#x2F;" rel="nofollow">https:&#x2F;&#x2F;beta.elevenlabs.io&#x2F;</a>
anthonyhn大约 2 年前
For offline&#x2F;local TTS, Coqui TTS [0] is quite good. It&#x27;s essentially a continuation of Mozilla&#x27;s TTS engine that Mozilla stopped working on ~2 years ago (and IIRC it&#x27;s largely the same team that worked on Mozilla TTS).<p>[0] <a href="https:&#x2F;&#x2F;github.com&#x2F;coqui-ai&#x2F;TTS">https:&#x2F;&#x2F;github.com&#x2F;coqui-ai&#x2F;TTS</a>
kylebuildsstuff大约 2 年前
Maybe <a href="https:&#x2F;&#x2F;beepbooply.com" rel="nofollow">https:&#x2F;&#x2F;beepbooply.com</a>? I built it myself but it combines all voices from Microsoft, Google, and Amazon into a simple interface. I find it simple, fast, and cheap when doing voiceovers for my own content.
gulabjamuns大约 2 年前
Acapela Group&#x27;s Peter voice is the best British voice I&#x27;ve come across.<p>Earlier you could just buy the voice pack for a reasonable amount, now they have complicated the purchase quite a bit.
gostsamo大约 2 年前
Check the Microsoft tts voices. They have them as an api service.
评论 #35269436 未加载
qgin大约 2 年前
Google Cloud&#x27;s new Neural2 voices are pretty great. I think they may actually be the Tacotron voices but I can&#x27;t say for sure.
tornato7大约 2 年前
The Python TTS package implements Tacotron, or so they claim. I haven&#x27;t been able to get that package to work for myself recently!