科技回声

8 条评论

andybak超过 2 年前

Sigh. At this point - if I can't try it out, then I don't really care. It's just a tease.

CuriouslyC超过 2 年前

Fidelity on the output isn't great, but the coherence (assuming the examples weren't massively cherry-picked) seems very good. Given the number of parameters this should be able to run on end-user machines, and in theory this could be fine tuned to produce better looking output than stable diffusion/etc.What this model does more than anything else is demonstrate we're still in the early stages of generative models, and we can expect a lot of progress from architectural improvements over the next decade (in addition to the progress in compute and data that we're already counting on).

mikemoka超过 2 年前

Here is an available implementation:<a href="https://github.com/lucidrains/muse-maskgit-pytorch">https://github.com/lucidrains/muse-maskgit-pytorch</a>

评论 #34418345 未加载

Garlef超过 2 年前

It'd be interesting to see some results where the training set has higher artistic quality (and how this model influences the "house style"). The output does not look great when compared to what other (trained) models deliver.But the promise of a big efficieny gain will be an incentive for companies like midjourney to give it a go with their data.

seydor超过 2 年前

More amazement . I wonder where this field will end up. Cute animal and nature images are nice but have limited real-life use (i mean, we have to accept that visual media ends after everyone can be an artist). I wonder when we 'll start interfacing language models with robotics to do some real-life work

评论 #34421293 未加载

评论 #34418413 未加载

deepsquirrelnet超过 2 年前

> Compared to pixel-space diffusion models, such as Imagen and DALL-E 2, Muse is significantly more efficient due to the use of discrete tokens and requiring fewer sampling iterations;Am I wrong or is that the same architecture as DALL-E 1?

pr337h4m超过 2 年前

Would stuff like DreamBooth and textual inversion be usable with transformer models like this one?<a href="https://dreambooth.github.io/" rel="nofollow">https://dreambooth.github.io/</a> <a href="https://textual-inversion.github.io/" rel="nofollow">https://textual-inversion.github.io/</a>

kleiba超过 2 年前

Please stop teasing and post the link to your free trial web interface. Please?

评论 #34416765 未加载

8 条评论

andybak超过 2 年前

Sigh. At this point - if I can't try it out, then I don't really care. It's just a tease.

CuriouslyC超过 2 年前

mikemoka超过 2 年前

Here is an available implementation:<a href="https://github.com/lucidrains/muse-maskgit-pytorch">https://github.com/lucidrains/muse-maskgit-pytorch</a>

Muse: Text-to-Image Generation via Masked Generative Transformers

8 条评论

Muse: Text-to-Image Generation via Masked Generative Transformers

8 条评论