科技回声

8 条评论

loxias3 个月前

I look forward to reading this in closer detail, but it looks like they solve an inverse problem to recover a ground truth set of voxels (from a large set of 2d images with known camera parameters), which is underconstrained. Neat to me that it works w/o using dense optical flow to recover the structure -- I wouldn't have thought that would converge.Love this a whole heck of a lot more than NeRF, or any other "lol lets just throw a huge network at it" approach.

评论 #43135111 未加载

markisus3 个月前

This is basically Gaussian splat using cubes instead of Gaussians. The cube centers and sizes choices are discrete and non overlapping, hence the name “sparse voxel”. The qualitative results and rendering speeds are similar to Gaussian splat, and it’s sometimes better or worse depending on the scene.

HexDecOctBin3 个月前

Why is this called rendering, when it would be more accurate to call it reverse-rendering (unless "rendering" means any kind of transformation of visual-adjacent data)?

评论 #43137435 未加载

bondarchuk3 个月前

Funny, it almost sounds like a straight efficiency improvement of Plenoxels (the direct predecessor of gaussian splatting), which would mean gaussian splatting was something of a a red herring/sidetrack. Though I'm not sure atm where the great performance gain is. Definitely interesting.

评论 #43136317 未加载

aaroninsf3 个月前

Can someone ELI5 what the input to these renders is?I'm familiar with the premise of NeRF "grab a bunch of relatively low resolution images by walking in a circle around a subject/moving through a space", and then rendering novel view points,but on the landing page here the videos are very impressive (though the volumetric fog in the classical building is entertaining as a corner case!),but I have no idea what the input is.I assume if you work in this domain it's understood,"oh these are all standard comparitive output, source from <thing>, which if you must know are a series of N still images taken... " or "...excerpted image from consumer camera video while moving through the space" and N is understood to be 1, or more likely, 10, or 100......but what I want to know is,are these video- or still-image input;and how much/many?

评论 #43134640 未加载

评论 #43134401 未加载

magicalhippo3 个月前

Reminded me of the Radian Foam article posted here[1] not long ago, though the focus there was on being differentiable.[1]: <a href="https://news.ycombinator.com/item?id=42931109">https://news.ycombinator.com/item?id=42931109</a>

atilimcetin3 个月前

I think this paper is as important as original Gaussian Splatting paper.

评论 #43134768 未加载

davikr3 个月前

What is the usecase for radiance fields?

评论 #43134987 未加载

8 条评论

loxias3 个月前

评论 #43135111 未加载

markisus3 个月前

HexDecOctBin3 个月前

Why is this called rendering, when it would be more accurate to call it reverse-rendering (unless "rendering" means any kind of transformation of visual-adjacent data)?

Sparse Voxels Rasterization: Real-Time High-Fidelity Radiance Field Rendering

8 条评论

Sparse Voxels Rasterization: Real-Time High-Fidelity Radiance Field Rendering

8 条评论