TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Gaussian Splatting SLAM

91 点作者 shevis9 个月前

4 条评论

dwrodri9 个月前
Tangentially related to the post: I have what I think is a related computer vision problem I would like to solve and need some pointers on how you would go about doing it.<p>My desk is currently set up such that I have a large monitor in the middle. I&#x27;d like to look at the center of the screen when taking calls. I&#x27;d also like it to appear as though I am looking straight into the camera, and the camera is pointed at my face. Obviously, I cannot physically place the camera right in front of the monitor as that would be seriously inconvenient. Some laptops solve but I don&#x27;t think their methods apply here as the top of my monitor ends up being quite a bit higher than what would look &quot;good&quot; for simple eye correction.<p>I have multiple webcams that I can place around the monitor to my liking. I would like to have something similar to what is seen when you open this webpage, but for a video. hopefully at higher quality since I&#x27;m not constrained to a monocular source.<p>I&#x27;ve dabbled a bit with OpenCV in the past, but the most I&#x27;ve done is a little camera calibration for de-warping fisheye lenses. Any ideas on what work I should look into to get started with this?<p>In my head, I&#x27;m picturing two camera sources: one above and one below the monitor. The &quot;synthetic&quot; projected perspective would be in the middle of the two.<p>Is capturing a point cloud from a stereo source and then reprojecting with splats the most &quot;straightforward&quot; way to do this? Any and all papers&#x2F;advice are welcome. I&#x27;m a little rusty on the math side but I figure a healthy mix of Szeliski&#x27;s Computer Vision, Wolfram Alpha, a chatbot, and of course perseverance will get me there.
评论 #41227721 未加载
评论 #41229982 未加载
评论 #41230763 未加载
评论 #41232018 未加载
评论 #41232382 未加载
totalview9 个月前
I love the “3D Gaussian Visualisation” section that illustrates the difference between photos of the mono data and the splat data. The splats are like a giant point cloud under the hood, except unlike point clouds which have uniform size, different splats have different sizes.<p>This all is well and good when you are just using for a pretty visualization, but it appears gaussians have the same weakness as point clouds processed with structure from motion, in that you need lots of camera angles to get quality surface reconstruction accuracy.
评论 #41226086 未加载
andybak9 个月前
This claims to work with monocular or RGB+depth but the only live demo is for an Intel Realsense d455 RBGD camera. That seems a shame as it significantly raises the bar for people to try it out themselves. (Can you even still buy the d455?)
评论 #41224798 未加载
评论 #41224723 未加载
评论 #41225582 未加载
评论 #41226160 未加载
Dig1t9 个月前
I would love to use something like this to make a video game.<p>Are there any examples or algorithms that can turn this into 3D objects that could be used in a video game? Any examples of someone doing that?
评论 #41230619 未加载
评论 #41230016 未加载