DALL·E 2 prompt book [pdf]

455 点作者 tomduncalf将近 3 年前

35 条评论

jiggawatts将近 3 年前

I just got access to the DALL-E 2 beta, and it's a ton of fun to make pictures out of everyday occurrences as prompts.Someone else here on HN observed that everyday people don't "get" how huge this all is. I experimented with asking random acquaintances at a local cafe for prompts and showed them the generated pictures. All but one person was totally unimpressed.If everything feels like magic, then what's one more piece of magic?This scared me more than the implications of DALL•E 2 itself specifically. People think of the technology in the world as mysterious black boxes that do inexplicable things, and they hence no longer understand relative complexity, progress, or change.My impression is that to most people DALL•E 2 is not "substantially" different to, say, Google Image search. Text in... image out. What's the big deal?

评论 #32326568 未加载

评论 #32327264 未加载

评论 #32340661 未加载

评论 #32327831 未加载

评论 #32326840 未加载

评论 #32327147 未加载

评论 #32341479 未加载

评论 #32329934 未加载

评论 #32329426 未加载

评论 #32326560 未加载

jw1224将近 3 年前

I’ve spent just over 1 week with DALL•E 2.Over the past 7 days I’ve generated ~1000 images, 150 of which were good enough to save. I only saved images which made me audibly gasp.Witnessing your own novel idea spring to life is a magical experience. DALL•E provides an artistic tool on a comparable level to digital photography, and by extension Photoshop.At this stage it’s 100% clear to me that DALL•E has heralded in a revolutionary new age of design. Every day I worked with it, I grew more confident in my outlook.It might not necessarily be an OpenAI product which truly “integrates” with humanity — but DALL•E has shown me that it’s possible… and just a matter of time.

评论 #32324301 未加载

评论 #32324243 未加载

ehsankia将近 3 年前

Prompt crafting is quickly becoming an art. I just found out yesterday that there's actually market places for buying and selling prompts [0]. It can really make a big difference if you can tune the image by adding the right words. Midjourney [1] even allows things such as adjusting the weight of each keyword or how "literal" the AI should take your prompt.[0] <a href="https://promptbase.com/" rel="nofollow">https://promptbase.com/</a>[1] <a href="https://midjourney.gitbook.io/docs/user-manual" rel="nofollow">https://midjourney.gitbook.io/docs/user-manual</a>

评论 #32335692 未加载

评论 #32325651 未加载

fumblebee将近 3 年前

I keep rattling my brain trying to discern what the implications of hyper advanced generative models like this will be. It's a double edged sword. While there's obvious tangible benefits from such models such as democratising art, the flip side seems like pure science fiction dystopia.In my mind, the main eras of content on the internet look something like this:Epoch 1: Pure, unblemished user generated content. Message boards and forums rule.Epoch 2: More user generated content + a healthy mix of recycled user generated content. e.g. Reddit.Epoch 3 (Now): Fake user generated content (limits to how much because humans still have to generate it). e.g. Amazon reviews, Cambridge Analytica.Epoch 4: Advanced generative models means (essentially) zero friction for creating picture and text content. GPT3, Dalle-2.Epoch 5: Generative models for videos, game over.IMO, the future of the internet feels like a totally disastrous (un)reality. If addictive content recommended by the likes of TikTok has proven anything, it's that users ultimately don't care _what_ the content is, as long as it keeps their attention. It doesn't matter if it comes from a human or a machine. The difference is that in a world where the marginal cost of generating content is essentially zero, that content can and will be created and manipulated by large malicious actors to sway public opinion.The Dead Internet Theory will fast become reality. This terrifies me.[1] <a href="https://www.theatlantic.com/technology/archive/2021/08/dead-internet-theory-wrong-but-feels-true/619937/" rel="nofollow">https://www.theatlantic.com/technology/archive/2021/08/dead-...</a>

评论 #32329753 未加载

jcims将近 3 年前

I just received a metal print of this image from DALL-E 2:<a href="https://imgur.com/a/Y0abtIP" rel="nofollow">https://imgur.com/a/Y0abtIP</a>I spent a lot of time alone on airplanes when I was a young father and there’s something bittersweet about the solitude and beauty in this image for me. My favorite parts about this image are the gradient in the sky, the waning sunlight in the top corner and the very faintly illuminated frame around the entire window.Very happy with the print. Next time I might get the satin finish though, it’s like a mirror.<a href="https://imgur.com/a/8GBQXw6" rel="nofollow">https://imgur.com/a/8GBQXw6</a>

mtlmtlmtlmtl将近 3 年前

OpenAI has pretty much been ruined for me after they sold their souls to Microsoft, stopped releasing all their source code, and then dishonestly refer to their sad practice of censoring the training data as "AI safety/alignment" when in fact it will never be a reasonable AI safety technique in the long run, and is only done to avoid bad PR. Clearly OpenAI is no longer a company worthy of its founding principles of openness and making the world a better place. They're just yet another morally corrupt tech company.

评论 #32339435 未加载

评论 #32336471 未加载

reggieband将近 3 年前

This makes me wonder if a future job description will be the equivalent of an AI whisperer. Someone who learns how to prompt AI so well that it becomes their job.

评论 #32324661 未加载

评论 #32329873 未加载

评论 #32326536 未加载

评论 #32324696 未加载

评论 #32324918 未加载

评论 #32327042 未加载

评论 #32327618 未加载

drc500free将近 3 年前

30 years ago, my dad and I watched a VGA demo on our IBM PS/2. We were blown away that there was enough color depth and resolution to see what was clearly a photograph, not an illustration. It appeared line by line.Someone had taken a photo, somehow digitized it, distributed it, and we were looking at a representation good enough that we could tell what it was.It felt like we were living in the future - me as a middle schooler and him with decades of software development under his belt.The iPhone maps app with the GPS dot and DALL-E are the only things that have matched that feeling.

评论 #32327705 未加载

评论 #32330213 未加载

评论 #32328211 未加载

meowtastic将近 3 年前

I haven't got access to DALL-E 2, but I did give Midjourney (<a href="https://www.midjourney.com/" rel="nofollow">https://www.midjourney.com/</a>) a go. I found it really cool it created images that somewhat resembled my prompt, but I still felt it was way off what I really wanted. Maybe I didn't word the prompt correctly, maybe I didn't give it enough tries. Either way, I feel like we'll eventually move away from generic prompts to something that'll look a lot like...programming, funnily enough.

评论 #32327789 未加载

评论 #32324698 未加载

nuclearsugar将近 3 年前

For anyone interested in experimenting with an open source text-to-image AI tool, check out DiscoDiffusion on Google Colab - <a href="http://discodiffusion.com/" rel="nofollow">http://discodiffusion.com/</a>

tkgally将近 3 年前

Like many others, it seems, I have also been blown away by DALL-E 2.When I got access on Sunday, I first tried a lot of different prompts and got some interesting results. One semirandom one, “A photograph of a professor playing a grand piano on a rainy night in Tokyo,” produced some very atmospheric images. I then went down a rabbit hole of variations on that prompt (“A painting of...,” “A line drawing of...,” “A painting in the style of Rembrandt of...,” etc.).I put most of the results into the following video, if anyone is interested.<a href="https://youtu.be/rdT4ZESQWco" rel="nofollow">https://youtu.be/rdT4ZESQWco</a>

评论 #32330178 未加载

评论 #32329068 未加载

f0e4c2f7将近 3 年前

I haven't filled out much content yet but seeing this post originally inspired me to create Prompt Wiki[0] to try and better organize terms and concepts for good prompts. DALL-E and Midjourney explorers needed! Seems useful to have, especially when the act of exploration costs a few cents.This twitter thread[1] also has some good suggestions and an interesting approach.[0] <a href="https://promptwiki.com" rel="nofollow">https://promptwiki.com</a>[1] <a href="https://mobile.twitter.com/fabianstelzer/status/1554229347506176001" rel="nofollow">https://mobile.twitter.com/fabianstelzer/status/155422934750...</a>

malkia将近 3 年前

Example of that's not 1-bit pixel art sold as 1-bit pixel art :) - <a href="https://promptbase.com/prompt/1-bit" rel="nofollow">https://promptbase.com/prompt/1-bit</a> - I don't want to pay for these extra bits!!!

faebi将近 3 年前

I'm still having a hard time to think through all the implications. How will this change websites which depend on continuous content, for example meme's? At which point can it be used as an compression algorithm in order to store one's full live? Or at least all my videos and pictures with lossy compression? Can we all create our own art effortlessly, and resize it as we want? When will this reach 3D modelling and 3D printing?

评论 #32324352 未加载

评论 #32323919 未加载

gojomo将近 3 年前

Spectacular! But, would be 10x more useful if rather than a PDF this was an HTML page, or pages, where specific sections/examples could be more easily & reliably linked-to.

评论 #32328899 未加载

londons_explore将近 3 年前

If you right click -> "save image as" on openAI, the image will be saved without their logo in the corner (it's done as some kind of CSS overlay).If you post those images online, they seem to ban you.

评论 #32329269 未加载

评论 #32329692 未加载

Waterluvian将近 3 年前

“Silhouette of a robot in a field of grain staring at a sunset” consistently produces brilliant images for me.

评论 #32330271 未加载

irrational将近 3 年前

Recently someone posted another DALLE like tool. I think it ran through a discord server. Does anyone have the name of that other tool?

评论 #32323221 未加载

评论 #32323245 未加载

nbzso将近 3 年前

If you are commercial digital-painting illustrator or 3d illustrator artist, this is the moment to invest your time in other field. It is over.Some will say that this 'tool' will help you in your creative process. In my obviously "biased" view, in the next 2-3 years, this will lower your monetary reward in half and create more requirements for competition with AI. People already are comparing DALL-E with human results.In the long run, the software industry will eat itself to oblivion. Greed has no boundaries, and optimization of costs for corporations will never stop.Some of my art related colleagues saw this 'trend' early and pivoted to crafts with added value for customers in the real world. On the oil painting side (which I am at) I don't feel any form of pressure, I paint for myself as a therapy. So, good luck:)

jrh206将近 3 年前

This document puts the sheer magnitude of DALL-E 2's knowledge of images into perspective. The same black box knows how to illustrate The Last Supper in the style of Quentin Blake, paint an ornate Late Baroque cat in sunglasses, compose a candid photo, draw a detailed blueprint... and so much more. DALL-E 2 knows more than a human ever could about imagery.Whether or not this particular iteration of the model is 'good enough' to be widely applicable, or whether DALL-E 2 is 'creative', it's only a matter of time before the way humans interact with media is changed profoundly.

ImprobableTruth将近 3 年前

This is a bit off-topic, but stuff like this and copilot genuinely makes me worried about job prospects in the future, especially because it feels so hard to estimate what might be next. I would have thought something like art would have been one of the last things to be automated.I always thought CRUD work might be automated eventually, while I would have guessed that something like embedded/high-performance was pretty safe, but now I'm not so sure anymore...

chmod775将近 3 年前

I don't know whether I'm happy to see that living creatures are still mostly nightmare fuel. In these images DALLE seems to only really get faces right. Hands, horns, etc. are either contorted, blobs, or have the wrong number of appendages.The images give a good first impression. Which is... impressive in itself. But they won't fool anyone who's studying them for even a few seconds.

ksaj将近 3 年前

I wonder how many people are doing these high quality prompts as a service on Fiverr. If I had access to the beta I sure as heck would be.

superpope99将近 3 年前

This was posted about 20 days ago and got 200 karma. <a href="https://news.ycombinator.com/item?id=32088718" rel="nofollow">https://news.ycombinator.com/item?id=32088718</a> Is there any specific policy on reposting in HN, or does that just get handled through voting?

评论 #32324300 未加载

yboris将近 3 年前

Link to the tweet about it by the author: <a href="https://twitter.com/GuyP/status/1547234780001042432" rel="nofollow">https://twitter.com/GuyP/status/1547234780001042432</a>

pengstrom将近 3 年前

Anyone from OpenAI here? My account was suddenly deactivated, which was unexpected and tragic. No tech in a long time has brought me this much joy. I've tried reaching out to support with no luck for a while.

sfmike将近 3 年前

Is DALL-E different then other models for example ones on hugging face? Or is it relatively the same just trained to a ridiculous amount and that's why it's results are so good?

评论 #32329209 未加载

BonoboIO将近 3 年前

Such a bummer that you only get 15 credits a month for free. One try is one credit lost, and 15 dollar for 110 credits is a little bit expensive for fooling around.

pineconewarrior将近 3 年前

Far and away the coolest PDF I have seen in a while. Thank you for this!I am signed up for the waitlist and can't wait to give them my money.

imwillofficial将近 3 年前

This is awesome!I JUST got my invite and was googling prompt suggestions. The timing on this article is incredible.

评论 #32323235 未加载

评论 #32324057 未加载

8f2ab37a-ed6c将近 3 年前

Any chance there's something similar out there for Midjourney?

hipjiveguy将近 3 年前

it's not turtles all the way down - it's layers of tech.and if any tech layer fails, it all failsexample - battery shortages, due to labor shortages, due to covid, due to...

FailMore将近 3 年前

This is really cool, thanks

omginternets将近 3 年前

Where can I get an invite?

评论 #32330259 未加载

antiterra将近 3 年前

I can’t help but feel that I’m missing out on doing all kinds of get-there-first projects made DALL-E. I know it’s not productive to focus on that but it’s a big barrier to getting excited about it.

评论 #32324463 未加载

评论 #32323225 未加载