Can someone ELI5 what the <i>input</i> to these renders is?<p>I'm familiar with the premise of NeRF "grab a bunch of relatively low resolution images by walking in a circle around a subject/moving through a space", and then rendering novel view points,<p>but on the landing page here the videos are very impressive (though the volumetric fog in the classical building is entertaining as a corner case!),<p>but I have no idea what the <i>input</i> is.<p>I assume if you work in this domain it's understood,<p>"oh these are all standard comparitive output, source from <thing>, which if you must know are a series of N still images taken... " or "...excerpted image from consumer camera video while moving through the space" and N is understood to be 1, or more likely, 10, or 100...<p>...but what I want to know is,<p>are these video- or still-image input;<p>and how much/many?