TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

OmniHuman-1: Human Animation Models

186 点作者 fofoz3 个月前

12 条评论

vessenes3 个月前
These look.. great, by and large. Hands are super natural, coherency is really high. Showing off piano chord blocking is a huge flex.<p>I’d like to play with this! No code, but bytedance often releases models, so I’m hopeful. It’s significantly better than vasa, and looks likely to be an iteration of that architecture.
评论 #42936861 未加载
评论 #42942394 未加载
评论 #42960073 未加载
iandanforth3 个月前
Many of these have tells, but this one fully crossed the uncanny valley for me. <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=1NU8NzvAxEg&amp;t=16s" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=1NU8NzvAxEg&amp;t=16s</a><p>Good to know that I need to now assume performances are AI generated even if it&#x27;s not obvious that they are!
评论 #42937013 未加载
评论 #42935428 未加载
smusamashah3 个月前
What are the tells in most of these videos? I can&#x27;t point at any in many of them. Hands, teeth, lip sync, body and should movement all look correct. Specially the TED talk like presentation examples near bottom.
评论 #42940137 未加载
smusamashah3 个月前
This looks better than EMO (also closed source by Alibaba group <a href="https:&#x2F;&#x2F;humanaigc.github.io&#x2F;emote-portrait-alive&#x2F;" rel="nofollow">https:&#x2F;&#x2F;humanaigc.github.io&#x2F;emote-portrait-alive&#x2F;</a>). See the rap example on their page. They apparently have EMO2 now which doesn&#x27;t look as believable to me.<p>EMO covers head + shoulders while this OmniHuman-1 is covering full body and its looking even better. I would have easily mistaken these for real (specially while doom scrolling) if I was not looking for AI glitches.<p>UPDATE: Googling animate bytedance site:github.io returns many in the same domain (all proprietry). Found a few good ones.<p>- <a href="https:&#x2F;&#x2F;byteaigc.github.io&#x2F;X-Portrait2&#x2F;" rel="nofollow">https:&#x2F;&#x2F;byteaigc.github.io&#x2F;X-Portrait2&#x2F;</a> Very expressive lifelike portrait animations<p>- <a href="https:&#x2F;&#x2F;byteaigc.github.io&#x2F;x-portrait&#x2F;" rel="nofollow">https:&#x2F;&#x2F;byteaigc.github.io&#x2F;x-portrait&#x2F;</a> (previous version of the same, has source <a href="https:&#x2F;&#x2F;github.com&#x2F;bytedance&#x2F;X-Portrait">https:&#x2F;&#x2F;github.com&#x2F;bytedance&#x2F;X-Portrait</a>)<p>- <a href="https:&#x2F;&#x2F;loopyavatar.github.io&#x2F;" rel="nofollow">https:&#x2F;&#x2F;loopyavatar.github.io&#x2F;</a> (portrait animations, looks good)<p>- <a href="https:&#x2F;&#x2F;cyberhost.github.io&#x2F;" rel="nofollow">https:&#x2F;&#x2F;cyberhost.github.io&#x2F;</a><p>- <a href="https:&#x2F;&#x2F;grisoon.github.io&#x2F;INFP&#x2F;" rel="nofollow">https:&#x2F;&#x2F;grisoon.github.io&#x2F;INFP&#x2F;</a><p>- <a href="https:&#x2F;&#x2F;grisoon.github.io&#x2F;PersonaTalk&#x2F;" rel="nofollow">https:&#x2F;&#x2F;grisoon.github.io&#x2F;PersonaTalk&#x2F;</a><p>- <a href="https:&#x2F;&#x2F;headgap.github.io&#x2F;" rel="nofollow">https:&#x2F;&#x2F;headgap.github.io&#x2F;</a><p>- <a href="https:&#x2F;&#x2F;kebii.github.io&#x2F;MikuDance&#x2F;" rel="nofollow">https:&#x2F;&#x2F;kebii.github.io&#x2F;MikuDance&#x2F;</a> anime animations
ggerules3 个月前
This is very good attempt with people playing musical instruments.<p>But, there are some subtle timing tells, that this is AI generated. Take a look at the singer playing the piano. Timing of the hands with the singer is slightly off. The same goes with the singer and the guitar. I&#x27;m not a guitar player or piano player, but I do play a lot of different musical instruments at a high level, and the timing looks off, slightly ahead or behind the actual piece of audio of the piece of music.
评论 #42941710 未加载
评论 #42937493 未加载
latexr3 个月前
&gt; Ethics Concerns<p>&gt; The images and audios used in these demos are from public sources or generated by models, and are solely used to demonstrate the capabilities of this research work. If there are any concerns, please contact us (jianwen.alan@gmail.com) and we will delete it in time.<p>Ethical concerns with this technology have nothing to do with videos on a demo page, and everything to do with what can be generated later.<p>I don’t know if they have a profound lack of understanding of the ethical implications or are purposefully trying to pretend, but neither is good.
kiwiguy13 个月前
I run youtube channels with almost 2 billion views and this actually concerns me. I would love to try this in my productions!!
lamnguyenx3 个月前
NVIDIA Demo of Audio2Face is such a joke, compared to this one.
egnehots3 个月前
this could be used as an incredible low bitrate codec for some streaming use cases. (video conferencing&#x2F;podcasts on &lt;3G for ex, just use some keyframes + the audio).
mpalmer3 个月前
...I feel slapped by progress. Rarely does such an impressive demo leave me feeling less inspired and hopeful about the future.
emsign3 个月前
It looks funny.
golol3 个月前
Modern operating systems should include by default a very simple private&#x2F;public key system to sign arbitrary files. I think it should not be very complicated? We badly need this in the age of AI.
评论 #42935632 未加载
评论 #42935537 未加载