The emphasis here is Single Image, but can this model generate with multiple images too?<p>We know that a single image of an object physically can't cover all the sides of it, so it's all guesswork in AI. This is totally fine for certain scenario, but in lots of other cases, it's trivial to have multiple images of the same object, and if that offers higher fidelity, it's totally worth it.<p>I'm aware there are many algorithms or AI models that already
do that. I'm asking about Stability's one specifically because if they have impressive Single Image result, surely their multi-image results would also be much better than state-of-the-art?
Just tried to run this using their sample script on my 4090 (which has 24GB of VRAM). It ran for a little over 1 minute and crashed with an out-of-memory error. I tried both SV3D_u and SV3D_p models.<p>[edit]Managed to generate by tweaking the script to generate less frames simultaneously. 19.5GB peak VRAM usage, 1 min 25 secs to generate at 225 watts.[/edit]
If the animations shown are representative, then the mesh output may very well be good enough to use in a 3d printer.<p>Looking forward to experimenting with this.
I'd like to play around with something like this, but from my understanding my machine (Macbook, 2021 M1) isn't nearly powerful enough (right?). Are there remote/cloud environments where I can run models like this?
Im sorry for dumb lazy question. But would the input require more than one image? Is there a demo url to test this? I think it might jsut be time to buy a 3d printer.<p>EDIT> Does "single image inputs" mean more than one image?
> Stable Video 3D (SV3D) is a generative model based on Stable Video Diffusion that takes in a still image of an object as a conditioning frame, and generates an orbital video of that object.<p>So can it actually output a 3d model? Or just images of what it thinks the object would look like from other angles?
All the examples resemble plastic children's toys...<p>How would it handle other objects? (People, fabrics, buildings, plants, mountains, mechanical parts, etc)
How has no one drawn attention to this was science fiction in Enemy of the State in 1998 now a trivial reality <a href="https://www.youtube.com/watch?v=4AjLXZV46eE" rel="nofollow">https://www.youtube.com/watch?v=4AjLXZV46eE</a>
They compare against Zero123-XL, but they should compare against MVDream instead. MVDream is quite good. If you fiddle with the loss you can get even better results.