科技回声

Hola amigos - I just noticed that https://coqui.ai/ is "Shutting down".I'm building a web app (React / Django) which takes a list of affirmations & goals (in Markdown files), puts them into a database (SQlite), and uses voice synthesis to create voice audio files of the phrases. These are combined with a relaxed backing track (ffmpeg), made into playlists of 10-20 phrases (randomly sampled, or according to a theme: "mind" "body" "soul") and then play automatically in the morning & evening (cron). This allows you to persistently hear & vocalize your own goals & good vibes over time.I had been planning to use Coqui TTS as the local text-to-speech engine, but with this cancellation, I'd love to hear from the community what is a great open-source, local text-to-speech engine?Generally, I learn both the highest quality commercially available technology (example: ElevenLabs), and also the best open-source equivalent. Would love to hear suggestions & perspectives on this. What voice synth tools are you investing your time into learning & building with?

2 条评论

illuminant大约 1 年前

Mozilla's browser tts is kind of not bad, just parse and buffer one sentence at a time and it does all right.For the backend, I've experimented with piper, which has a lot of voices and accents, though it's tricky to buffer and sync long texts.<a href="https://github.com/rhasspy/piper">https://github.com/rhasspy/piper</a>

082349872349872大约 1 年前

eSpeak NG?

评论 #40288818 未加载

Ask HN: Open-source, local Text-to-Speech (TTS) generators

2 条评论

Ask HN: Open-source, local Text-to-Speech (TTS) generators

2 条评论