TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Meta AI releases CoTracker, a model for tracking any points (pixels) on a video

345 pointsby crakenzakover 1 year ago

15 comments

thethimbleover 1 year ago
Does anyone understand the business angle for meta here with these models? I still don’t understand why their research division exists and how it relates to their core business. I’m a huge admirer of their work but don’t understand the why.
评论 #37318469 未加载
评论 #37318047 未加载
评论 #37317594 未加载
评论 #37317666 未加载
评论 #37317554 未加载
评论 #37318302 未加载
评论 #37317922 未加载
评论 #37318214 未加载
评论 #37319255 未加载
评论 #37318362 未加载
评论 #37327321 未加载
评论 #37320191 未加载
评论 #37318627 未加载
评论 #37318381 未加载
评论 #37318235 未加载
评论 #37318122 未加载
评论 #37319429 未加载
评论 #37319381 未加载
评论 #37320950 未加载
评论 #37318024 未加载
评论 #37318081 未加载
评论 #37317942 未加载
评论 #37318674 未加载
tobrover 1 year ago
Not surprised that this performs so well, considering Facebook’s long experience with tracking pixels.
评论 #37319490 未加载
评论 #37318369 未加载
crakenzakover 1 year ago
Paper: <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2307.07635" rel="nofollow noreferrer">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2307.07635</a><p>Github: <a href="https:&#x2F;&#x2F;github.com&#x2F;facebookresearch&#x2F;co-tracker">https:&#x2F;&#x2F;github.com&#x2F;facebookresearch&#x2F;co-tracker</a><p>Demo: <a href="https:&#x2F;&#x2F;huggingface.co&#x2F;spaces&#x2F;facebook&#x2F;cotracker" rel="nofollow noreferrer">https:&#x2F;&#x2F;huggingface.co&#x2F;spaces&#x2F;facebook&#x2F;cotracker</a>
评论 #37316912 未加载
xnxover 1 year ago
Neat. It&#x27;s mentioned on Facebook&#x27;s page, but here Google&#x27;s version of point tracking: <a href="https:&#x2F;&#x2F;deepmind-tapir.github.io" rel="nofollow noreferrer">https:&#x2F;&#x2F;deepmind-tapir.github.io</a> which is Apache-2.0 licensed.
petargyurovover 1 year ago
I think Meta&#x27;s goals are becoming clearer: they want to make VR <i>unbeliavable</i>. Judging by this and by SAM, they want an AI system than can understand the world around itself in <i>real time</i>.
评论 #37319618 未加载
throwaw12over 1 year ago
I wonder how research works inside Product companies.<p>As an engineer working in a Product company, my focus switches between priorities set by PM and quarterly re-adjustments of the goals&#x2F;strategies.<p>Can&#x27;t imagine same thing can be done in research:<p><pre><code> * Hey, when are we releasing model for tracking any points? * Can you estimate how long it takes to fix the issue you found in tracking accuracy?</code></pre>
评论 #37325202 未加载
评论 #37320006 未加载
runeksover 1 year ago
I wonder how this compares to the motion estimation algorithms in the x264 and x265 video codecs. If it&#x27;s better, then it can be used to increase video compression, by using it at the motion estimation stage for these codecs.
raszover 1 year ago
15 years ago on single CPU core laptop <a href="https:&#x2F;&#x2F;www.robots.ox.ac.uk&#x2F;~gk&#x2F;PTAM&#x2F;" rel="nofollow noreferrer">https:&#x2F;&#x2F;www.robots.ox.ac.uk&#x2F;~gk&#x2F;PTAM&#x2F;</a><p>Parallel Tracking and Mapping for Small AR Workspaces (PTAM) <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=Y9HMn6bd-v8">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=Y9HMn6bd-v8</a><p>No ML, pure computer vision in C.
评论 #37321094 未加载
marcopicentiniover 1 year ago
These open source AI model will create hundreds of AI startups with no competitive advantage. High competitive market and low margin.
评论 #37321510 未加载
jeffreygoestoover 1 year ago
Nice to see that Andrew Zisserman made it into the AI age. He and Hartley were my multi-view heros back in the days... And Faugeras of course...
appplicationover 1 year ago
One present modern challenge I’ve noticed is reverse video searching. There are no good platforms for this like there are reverse image. I wonder if this ability to quantize videos would allow you to build more efficient indices of videos that you could check against from some input.
xwdvover 1 year ago
Oh wow, this will be the end of complicated motion capture rigs. Indie developers can do motion capture for their own complex 3D characters in the comfort of a home. Good.
ImHereToVoteover 1 year ago
Are there models that can perform this in real-time? How does this stack up?
skenderbeuover 1 year ago
I&#x27;m sure there will be some future AR applicability with this
iFireover 1 year ago
LICENSE<p>Attribution-NonCommercial 4.0 International<p><a href="https:&#x2F;&#x2F;github.com&#x2F;facebookresearch&#x2F;co-tracker&#x2F;blob&#x2F;main&#x2F;LICENSE.md">https:&#x2F;&#x2F;github.com&#x2F;facebookresearch&#x2F;co-tracker&#x2F;blob&#x2F;main&#x2F;LIC...</a>
评论 #37316783 未加载
评论 #37317379 未加载
评论 #37318608 未加载
评论 #37317015 未加载