The "create text version of image" prompt matters a ton.<p>I tried three, demo here:<p>default<p><pre><code> https://dalle.party/?party=JfiwmJra
</code></pre>
hyper-long + max detail + compression - This shows that with enough text, it can do a really good job of reproducing very, very similar images<p><pre><code> https://dalle.party/?party=QtEqq4Mu
</code></pre>
hyper-long + max detail + compression + telling it to cut all that down to 12 words - This seems okay. I might be losing too much detail<p><pre><code> https://dalle.party/?party=0utxvJ9y
</code></pre>
Overall the extreme content filtering and lying error messages are not ideal; will probably improve in the future. If you send too long, or too risky a prompt, or the image it generates is randomly too risky, you either get told about it or lied to that you've hit rate limits. Sometimes you also really do hit ratelimits.<p>Also, you can't raise your rate limits until you prove it by having paid over X amount to openai. This kind of makes sense as a way to prevent new sign-ups from blowing thousands of dollars of cap mistakenly.<p>Hyper detail prompt:<p>Look at this image and extract all the vital elements. List them in your mind including position, style, shape, texture, color, everything else essential to convey their meaning. Now think about the theme of the image and write that down, too. Now write out the composition and organization of the image in terms of placement, size, relationships, focus. Now think about the emotions - what is everyone feeling and thinking and doing towards each other? Now, take all that data and think about a very long, detailed summary including all elements. Then "compress" this data using abbreviations, shortenings, artistic metaphors, references to things which might help others understand it, labels and select pull-quotes. Then add even more detail by reviewing what we reviewed before. Now do one final pass considering the input image again, making sure to include everything from it in the output one, too. Finally, produce a long maximum length jam packed with info details which could be used to perfectly reproduce this image.<p>Final shrink to 12 words:<p>NOW, re-read ALL of that twice, thinking deeply about it, then compress it down to just 12 very carefully chosen words which with infinite precision, poetry, beauty and love contain all the detail, and output them, in quotes.