TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Apple releases Depth Pro, an AI model that rewrites the rules of 3D vision

112 pointsby bentocorp8 months ago

8 comments

sva_8 months ago
<a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=41738022">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=41738022</a>
ipsum28 months ago
Title is such clickbait, it does not rewrite the rules of 3d vision, it is a marginal improvement on existing models, and does not work for video, only images. However, Apple open sourced the model weights, which is amazing for research.
评论 #41748224 未加载
评论 #41748404 未加载
评论 #41748620 未加载
评论 #41748693 未加载
fh9738 months ago
This article has a link to the live demo.<p><a href="https:&#x2F;&#x2F;huggingface.co&#x2F;spaces&#x2F;akhaliq&#x2F;depth-pro" rel="nofollow">https:&#x2F;&#x2F;huggingface.co&#x2F;spaces&#x2F;akhaliq&#x2F;depth-pro</a><p>For some pictures it outputs something reasonable, for others it&#x27;s completely broken (black with colored noise in one area).
zimpenfish8 months ago
Just tried it on a &quot;difficult&quot; image (relatively low contrast photo of a small thin plant in front of a tree trunk with a distant fence in one corner) and it did a pretty good job, I think - <a href="https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;Sqr6hR8" rel="nofollow">https:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;Sqr6hR8</a> including the depth maps.
评论 #41748360 未加载
LeoPanthera8 months ago
This presumably is the same model that the Vision Pro Photos app uses to convert 2D photos to 3D.
kylehotchkiss8 months ago
Was this trained on iPhone photos since there is a decent amount of depth references within iPhone cameras? It’s interesting to see how clearly it understands depth of field. With that, how does it perform on F16 and above?
skykooler8 months ago
interesting. They claim 0.3 seconds on a consumer GPU; I thought that might scale to 30 seconds or so on CPU but gave up waiting after twelve minutes.
评论 #41749095 未加载
dyauspitr8 months ago
Can I use this to generate accurate depth maps from 2-D images that I can then CNC or 3-D print?
评论 #41748608 未加载
评论 #41748810 未加载
评论 #41749019 未加载