TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Parler-TTS: Natural language guidance of high-fidelity TTS

70 点作者 forgingahead大约 1 年前

5 条评论

column大约 1 年前
I've tried it on my laptop and it is about as slow/fast as xtts. But as far as I can there's no way of keeping a consistent voice from generation to generation. If so, I don't really get the appeal. If there was a way to get consistent, then that's great for NPCs.
评论 #40003723 未加载
IronWolve大约 1 年前
Lots of these are nice sounding, but still far from quality of simply importing a text file ebook and getting a nice sounding audiobook.
josephh大约 1 年前
Does anyone know of a good text normalization (?) library that converts symbols and initialisms into plain English before feeding them into a TTS model? All the models that I've used so far do a horrible job at synthesizing speech for them and I'm wondering whether this is the missing piece in the pipeline.
评论 #40004847 未加载
mdrzn大约 1 年前
All the "Voice cloner" TTS I tried only work in English language, whenever tried with Italian language it doesn't mimic the original voice at all.
Y_Y大约 1 年前
There are two hard problems in computer science; naming things.<p>Unfortunate that this shares a name with a much-maligned microblogging site. Probably it&#x27;s not a good idea to take unmodified everyday words[0] from a widely spoken language as your product name, see also e.g. &quot;Triton&quot;.<p>[0] In this case &quot;parler&quot; is French for &quot;to speak&quot;
评论 #40006970 未加载