科技回声

collingreen6 个月前

It's interesting how there are so many papers published in this space but they all tend to use these same few images (the webcam chessboard, the two buildings, and the diagrams). I've been looking into how to reliably stitch video frames into "orthographs" (similar problem is faced by aerial surveys; there is a lot of good work on this from the drone community) recently and have read probably two dozen recent papers spanning homography, photogrammetry, feature detection, sfm, nerf, and segmentation and most of them reuse these diagrams and at least some of these images.Maybe the world would benefit from some more well documented, open licensed training/validation data?

评论 #42314220 未加载

评论 #42315252 未加载

amstan6 个月前

> Multiple View Geometry in Computer Vision, Richard Hartley and Andrew Zisserman, [117] (some sample chapters are available here, CVPR Tutorials are available here)Heh, about 10 years ago I read that book and figured out a few things:* triangulate position in 3d space of an object given a few 2d pictures (from known camera locations, ie: camera intrinsic and extrinsic matrices)* how to instruct my friend to make a simple 3d rendering "engine"Nice to see the same stuff distilled into an article.

Neeloppher6 个月前

Intresting

Homography Explained with Code

3 条评论

Homography Explained with Code

3 条评论