TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Speech-to-video synthesis: Real-time rendering of speech

1 pointsby cwbuildsover 1 year ago
Hi guys,<p>After some research and no luck finding anyone that seems to be working on this, I thought I&#x27;d try a Hail Mary and post on here.<p>I&#x27;m looking to speak to anyone who is working on speech-to-video (real-time speech rendering). We already have software which can take audio (speech) input and render a video which resembles a person or avatar speaking, but it takes a long time to render.<p>How long will it be before the video of the person&#x2F;avatar speaking will be renderable in near real-time, with similar latency to existing speech-to-text models?<p>What would the prototype look like to reduce the latency? Is anyone working on anything like this?<p>For context, I run a language learning app where you can practice speaking orally with AI. It would be far more engaging if the user had an avatar&#x2F;person to be able to speak to, rather than staring at the chat history whilst talking to the AI conversation partner.<p>Thanks, Chris<p>For context, here&#x27;s the original post: https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=36973400

1 comment

billconanover 1 year ago
this ?<p><a href="https:&#x2F;&#x2F;www.heygen.com&#x2F;article&#x2F;unleashing-the-power-of-realtime-avatars" rel="nofollow">https:&#x2F;&#x2F;www.heygen.com&#x2F;article&#x2F;unleashing-the-power-of-realt...</a><p><a href="https:&#x2F;&#x2F;docs.trypromptly.com&#x2F;guides&#x2F;realtime-avatar-with-rag" rel="nofollow">https:&#x2F;&#x2F;docs.trypromptly.com&#x2F;guides&#x2F;realtime-avatar-with-rag</a>
评论 #39197565 未加载