I recreated famous album covers with DALL-E

230 点作者 lucytalksdata将近 3 年前

28 条评论

powersnail将近 3 年前

DALL-E is still highly probabilistic in its judgement. For instance, in this article, it keeps putting "fire" in the the background on something that is likely to be on fire, rather than lighting up the person.I have a similar experience. In my own experiment, I can't get DALL-E to turn off the street lamp at a bus stop in the darkness. I've tried "no light", "broken street lamp", etc.; no use. Any mention of "street lamp", the scene will include a working street lamp.It's just more probable that a scene with a lamp in the darkness must have that lamp providing light, and this is something that DALL-E will not break out of.

评论 #32534945 未加载

评论 #32535004 未加载

评论 #32535084 未加载

评论 #32535312 未加载

评论 #32535444 未加载

评论 #32535001 未加载

randymy将近 3 年前

Worth noting that DALL-E automatically “rejects attempts to create the likeness of any public figures, including celebrities". So, you wouldn't be able to get an image that included the 4 Liverpudlians. It does allow you to create fake faces. Might be fun to try and recreate Miles Davis Tutu, Aladdin Sane, Piano Man.

评论 #32535779 未加载

评论 #32535572 未加载

评论 #32535339 未加载

Michelangelo11将近 3 年前

Man, after seeing Stable Diffusion's output, DALL-E's looks just janky. Like watching a propeller plane after seeing a jet.Crazy how fast the tech is moving.

评论 #32535590 未加载

cowmix将近 3 年前

After getting access to the beta, combined with all these HN posts -- I've determined DallE2 is neat but no where as great as the initial samples made me believe.

评论 #32535605 未加载

nprateem将近 3 年前

An upvote for whoever can give me a prompt to generate an image of someone who's been massaged so much their body has been flattened, as if they were made of dough or jelly or something.I spent ages on this earlier getting nowhere. I'm starting to think DALL-E is better if you don't really know what you want and you're just fishing for ideas.

评论 #32535183 未加载

评论 #32537563 未加载

评论 #32534651 未加载

评论 #32534909 未加载

soneca将近 3 年前

Have anyone given a prompt to Dall-e of designing a company website and included “make it pop!”?Maybe the AI will finally get what designers always complained about annoying clients.

评论 #32534379 未加载

评论 #32535927 未加载

评论 #32535333 未加载

yummybear将近 3 年前

I love seeing people experiment with this technology. You can feel we’re on the cusp of something great - whatever it is, we’re just not quite there yet.

teddyh将近 3 年前

> The question is, when the music blows up and the artwork becomes a signature, like the Rolling Stones' Tongue & Lips, who will own the copyright?That’s what trademarks are for.

评论 #32534157 未加载

w0mbat将近 3 年前

How do you know that the album covers are not part of the corpus of images that DALL-E was trained on in the first place?

phonescreen_man将近 3 年前

Interestingly related, I just used AI image generation to create my EP cover.. first I tried running luciddrains dall e 2 PyTorch implementation using the prompt “death by Tetris EP album cover 2022” unfortunately I am using a Mac Pro so the gpu was not able to work. Then I tried imagen PyTorch implementation and used same keyword. This time it was working with the CPU unfortunately 2 days in we had a power outage so I had something but nothing complete. So I fed the generated image into the google dream generator and got my album cover!<a href="https://willsimpson.hearnow.com/" rel="nofollow">https://willsimpson.hearnow.com/</a>

fimdomeio将近 3 年前

There are a lot of articles focusing on how close does DALL-E match some prompt, but I wonder if this is a suboptimal way to explore the medium.What if you can get a lot more out of it by embracing the unexpected responses. Can it be a tool for exploring lateral thinking? You provide a prompt computer responds with images that are a prompt to human. A baby swiming next to a dolar bill outputs a distorted person face inside a dolar bill with some baby features, could be the start to a rabbit hole of prompts and images where you'll end up with something completly different than your initial expectations.

tsimionescu将近 3 年前

It's interesting that the prompts that would do badly in a Google image search also seem to be the ones that make poor prompts. Basically, it seems that rather than describing a scene, you have to try to give an analogy for some image(s) that it might have in its training set - which is why, I believe, "banana in the style of Andy Warhol" produces a much higher quality result than "Outline of prism on a black background in the middle of scene splits a beam of light coming from the left side into rainbow on the right side".

评论 #32535245 未加载

xwdv将近 3 年前

Although AI artists will destroy a lot of jobs, it will also create demand for new jobs for people who specialize in “paint overs” – taking a high concept output created by AI artists and touching it up to perfection.Or perhaps even beyond just a paint over, and into the realm of recreating an entire AI artwork but with a human touch to get details just right.Looking forward to it.

评论 #32534567 未加载

评论 #32534921 未加载

评论 #32534272 未加载

评论 #32534225 未加载

评论 #32534488 未加载

waveywaves将近 3 年前

DALL E works really well if you are specific enough. When you don't get the intended result, it helps to identify the element which wasn't generated properly and then improve your description of the same."Two men, one of whom is on fire, shaking hands on an industrial lot." can be rewritten as, "Two men, shaking hands, standing on an industrial lot. Person on the right is on fire. Camera is 30 metres away."You can go into more specifics of the framing and the angle from which you want the picture to be take. By default, DALL E will give you the most realistic generations to your prompts unless you mention "digital art" or a particular art style. I have gotten the best results when generating art instead of photos.

spike021将近 3 年前

I haven't gotten to try it for myself, but I've read a few of these blogs that take you through generating examples or even look-alikes to older art pieces.It surprisingly reminds me a lot of when I traveled to Japan without knowing really any Japanese. I needed to communicate not only with friends who don't know much English either, but also other people (like restaurant wait staff, train station staff, etc.).I used Google Translate often, but many times I or my friend(s) (or the other people) would need to re-write our statements a few times until the translation result clicked well enough in each other's languages to be understandable.

google234123将近 3 年前

Is the issue with faces a deliberate choice by the devs?

评论 #32538553 未加载

pjgalbraith将近 3 年前

I've been recreating the 50 worst heavy metal album art using AI as well, currently at 30. Recently I've found Stable Diffusion plus DALL-E inpainting to be a good combination.<a href="https://twitter.com/P_Galbraith/status/1560469019605344256" rel="nofollow">https://twitter.com/P_Galbraith/status/1560469019605344256</a>

_HMCB_将近 3 年前

Very cool. But it just goes to show the impact of human creativity. The conceptual aspect.

bryanrasmussen将近 3 年前

No Smell the Glove cover, this is a black day for rock and roll!

remote_phone将近 3 年前

If you gave those same instructions to humans I’m sure the output would be just as varied. I’d be interested to see a comparison between dall-e and humans.

wodenokoto将近 3 年前

I'd love to see what it had come up with if simply prompted for "Album cover for Nevermind by Nirvana"

kaffeeringe将近 3 年前

I wonder, how much energy is being burned for these kinds of experiments.

评论 #32539902 未加载

dsign将近 3 年前

It's going to leave all those artists without a job, you just wait!!

NonNefarious将近 3 年前

Went to use my invite, and OpenAI demands your PHONE NUMBER.No excuse for it. Screw that.

sgt101将近 3 年前

Look, it's trained on these images.It's really great and cool and all - but it's retrieving things that it was trained on.Show me something original it did.

评论 #32535147 未加载

评论 #32536962 未加载

评论 #32534301 未加载

machinekob将近 3 年前

Is i do smth with DALL-E auto top hacker news post i saw like 20 post like that in past 2 weeks.

system2将近 3 年前

DALL-E still seems very useless. Reminds me of the hype of Cardano.

andreyk将近 3 年前

I wonder how long the novelty of DALL-E will persist. HN seems to upvote anything titled "I did X with DALL-E". This is a fun post, but it's not that interesting or surprising. Still worth a look don't get me wrong, but personally didn't learn anything new from it. (eg recreating the famous pink Floyd cover with "Outline of prism on a black background in the middle of scene splits a beam of light coming from the left side into rainbow on the right side" unsurprisingly worked somewhat well).

评论 #32534401 未加载

评论 #32534362 未加载

评论 #32534338 未加载

评论 #32535095 未加载

评论 #32534234 未加载

评论 #32534517 未加载

评论 #32534801 未加载

评论 #32534867 未加载