TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Speech-to-video synthesis: Real-time rendering of speech

1 点作者 cwbuilds超过 1 年前
Hi guys,<p>After some research and no luck finding anyone that seems to be working on this, I thought I&#x27;d try a Hail Mary and post on here.<p>I&#x27;m looking to speak to anyone who is working on speech-to-video (real-time speech rendering). We already have software which can take audio (speech) input and render a video which resembles a person or avatar speaking, but it takes a long time to render.<p>How long will it be before the video of the person&#x2F;avatar speaking will be renderable in near real-time, with similar latency to existing speech-to-text models?<p>What would the prototype look like to reduce the latency? Is anyone working on anything like this?<p>For context, I run a language learning app where you can practice speaking orally with AI. It would be far more engaging if the user had an avatar&#x2F;person to be able to speak to, rather than staring at the chat history whilst talking to the AI conversation partner.<p>Thanks, Chris<p>For context, here&#x27;s the original post: https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=36973400

1 comment

billconan超过 1 年前
this ?<p><a href="https:&#x2F;&#x2F;www.heygen.com&#x2F;article&#x2F;unleashing-the-power-of-realtime-avatars" rel="nofollow">https:&#x2F;&#x2F;www.heygen.com&#x2F;article&#x2F;unleashing-the-power-of-realt...</a><p><a href="https:&#x2F;&#x2F;docs.trypromptly.com&#x2F;guides&#x2F;realtime-avatar-with-rag" rel="nofollow">https:&#x2F;&#x2F;docs.trypromptly.com&#x2F;guides&#x2F;realtime-avatar-with-rag</a>
评论 #39197565 未加载