Extracting training data from diffusion models

163 点作者 ericwallace_ucb超过 2 年前

33 条评论

dang超过 2 年前

See also <a href="https://twitter.com/Eric_Wallace_/status/1620449934863642624" rel="nofollow">https://twitter.com/Eric_Wallace_/status/1620449934863642624</a>. (Thanks to all who posted that. We merged the threads now.)

saurabh20n超过 2 年前

The last author's tweet thread and replies have some interesting tidbits: <a href="https://twitter.com/Eric_Wallace_/status/1620449934863642624" rel="nofollow">https://twitter.com/Eric_Wallace_/status/1620449934863642624</a>* "We propose to extract memorized images by generating many times with the same prompt and flagging cases where many of the generations are the same."* "- Diffusion models memorize more than GANs - Outlier images are memorized more - Existing privacy-preserving methods largely fail"* "Stable Diffusion is small relative to its training set (2GB of weights and many TB of data). So, while memorization is rare by design, future (larger) diffusion models will memorize more."* "It only memorizes a very small subset of the images that it trains on."* "our goal is to show that models can output training images when generating in the same fashion that normal users do."

评论 #34608766 未加载

FeepingCreature超过 2 年前

100 images out of 350,000 that they looked at were memorized.This seems to mostly happen when an image appears frequently (more than 100 times) in the training data, and/or the dataset is small relative to the model.

评论 #34612344 未加载

评论 #34612508 未加载

评论 #34610427 未加载

评论 #34613320 未加载

评论 #34612223 未加载

评论 #34610124 未加载

评论 #34614823 未加载

评论 #34611351 未加载

6gvONxR4sf7o超过 2 年前

It's work like this that makes me frustrated at the popular discourse around generative models (especially here). There's a ton we don't know about these models, and yet you get tons of people arguing that these models absolutely don't memorize, or that they learn like we do and so their learning should be treated like ours (legally and ethically). Then you get work like this showing that yes they actually do some memorization and regurgitation. There's still so much we don't know here.My fear is that when things like this come up for lawsuits, overconfident experts are going to talk out of their asses about how these models do or don't work, and that's going to determine how automation affects our society.On a technical level, I'd love to see a patch-wise version of this investigation. This shows whole images being regurgitated near-exactly rarely. I expect that small part-of-the-image patches are regurgitated even more often. But is it simple stuff like edges being regurgitated or are larger parts regurgitated frequently too? Given the architectures generally used, I'd guess that it's significant.

评论 #34606921 未加载

评论 #34605797 未加载

评论 #34605491 未加载

评论 #34605754 未加载

yetanotheruser8超过 2 年前

This study was organized by Google (Technically DeepMind).I wouldn't be surprised if Google is wanting the lawsuit to lose. It would block open-source models like these from existing and give them potentially a competitive advantage to be able to afford whatever compliance is mandated. They'd be able to offer services that comply, but open-source models would only have access to lower quality data and would be stunted.

评论 #34612630 未加载

mxwsn超过 2 年前

Their extraction: (1) assumes the attacker knows the caption for some training images, and (2) primarily works on images duplicated 100x-3000x in the training dataset. Their attack does not succeed for any singleton images. Deduplicating can be challenging on internet-scale datasets, but their work as presented does not appear to be a major concern for releasing diffusion models trained on other smaller datasets.On memorization - I suspect this is a great thing for downstream performance, and a positive indicator that diffusion models are actually better generative models than prior methods (VAEs, GANs, etc). This mirrors the finding that feedforward neural networks can memorize randomly labeled data very well. Intuitively it feels like memorization is a quantifiable behavior that is a foundational activity in information processing - it is one type of optimal usage of observed data - that superpowers downstream performance.

评论 #34601483 未加载

Imnimo超过 2 年前

Only 109 retrievable images out of the 350,000 most-duplicated is fewer than I expected. Maybe it's just the stringent definition of retrieval, but I would have expected many famous works of art like the Mona Lisa and Girl with a Pearl Earring to be readily extractable. Maybe these just aren't quite pixel-perfect enough?

GaggiX超过 2 年前

So wait they only found 109 matches after generating 175 milion images using the prompt from the most duplicated samples from the dataset and SD v1.4? Also almost all of them have more than 300 copies in the dataset, so with a model with the same size and trained on a dedup dataset like SD 2.0/2.1 there will be almost no matches, even after generating 175 mln images and knowing the prompts used in the dataset. Finally Google at el need to explain how an attacker that want to extract images from a trained model somehow has the prompts for the top X duplicated images in the dataset but not the images themselves, and thus will going to spent an incredible amount of money to generate something like 175 mln samples and test them together to find the matches.Edit: I also want to add that google seems to try really hard to show themselves as the good guys by not releasing its models because it's not safe enough, but in this paper they used an incredible amount of computation and show me otherwise.

评论 #34612193 未加载

评论 #34614460 未加载

babel_超过 2 年前

The tweet/paper co-author posted the paper (<a href="https://arxiv.org/abs/2301.13188" rel="nofollow">https://arxiv.org/abs/2301.13188</a>) yesterday on HN (<a href="https://news.ycombinator.com/item?id=34596187" rel="nofollow">https://news.ycombinator.com/item?id=34596187</a>) and ironically the top comment there is referencing to this exact tweet thread (which was posted yesterday as well). Evidence for the metacircular evaluation of HN comments?I think the paper is well worth the read, it's not particularly long (much is references and appendices), and nicely written, with at least a quick bit on most things I would think to test as part of something like this. Good stuff.

评论 #34614423 未加载

danaris超过 2 年前

I know it's not cool to say "I told you so", but...This was entirely predictable, and is one prong of the primary arguments that these ML models, trained on datasets including copyrighted images taken without permission, infringe on the copyright of those images' creators.Train the damn things on public domain images and images you have explicit permission for, and you'll be fine. Stop acting like you have a right to just vacuum up every image ever created because it's "AI".

评论 #34611861 未加载

评论 #34610391 未加载

评论 #34610397 未加载

评论 #34610470 未加载

ericwallace_ucb超过 2 年前

The paper shows that Stable Diffusion and Google's Imagen regenerate individual images from their training sets. They show it is very rare, but can be found reliably.

评论 #34599825 未加载

评论 #34599838 未加载

jjcon超过 2 年前

Is there any reason we shouldn’t view diffusion models as any other tool? I can infringe copyright with photoshop too… even accidentally. If I generate original work with either that seems like fair game.I imagine with the right prompt one could coax out a copywritten image even if it hadn’t ever seen it before

评论 #34600840 未加载

Glyptodon超过 2 年前

To me this is kind of like being shocked that people who've seen the Starry Night can remember what it looks like.

评论 #34612921 未加载

评论 #34612740 未加载

评论 #34611818 未加载

评论 #34611964 未加载

6gvONxR4sf7o超过 2 年前

I’m disappointed in all the anthropomorphizing in this thread. Time and time again, we make analogies for how black box ML algos must work like people, only for researchers to come along and show that they actually just use shortcuts that don’t remotely resemble human learning/thinking.When will we learn to stop being overconfident about how these things work? Just say “we don’t know yet.” Anthropomorphism and overconfidence are dangerous in that we could set the wrong precedents (culturally and legally) for how these are used and how automation affects society.

评论 #34613697 未加载

评论 #34614677 未加载

评论 #34613712 未加载

Lerc超过 2 年前

Figure 2 doesn't fill me with confidence as to the ability to detect similar images. The best example is the bottom right match which hits because the collar is in the same position and a bunch of white being in the same place outweighs a lot more meaningful data.This probably means there are far more matches to be found that would be considered clearly copies to humans. SSIM might be a bit heavy for the task but a simple comparison of the gradients from neighboring pixels might match quite a lot more.

Archelaos超过 2 年前

When I experiment with Stable Diffusion, I quite often come across blurred "Getty Images" labels.

评论 #34610381 未加载

评论 #34610339 未加载

评论 #34610341 未加载

jxy超过 2 年前

I don't understand what is so surprising here. The training of the model consists of adding noise to training samples and denoising the resulting random samples to reproduce training samples. If you have one training sample, you can find the optimized random sample that reproduce the training sample.

评论 #34600080 未加载

评论 #34600988 未加载

评论 #34599988 未加载

评论 #34604846 未加载

usrbinbash超过 2 年前

So some images were overrepresented in the dataset, and subsequently, the network overfitted. Known problem, known solution.

评论 #34611165 未加载

评论 #34611135 未加载

larsiusprime超过 2 年前

Some interesting commentary by AI expert Alex J. Champandard: <a href="https://twitter.com/alexjc/status/1620466058565132288" rel="nofollow">https://twitter.com/alexjc/status/1620466058565132288</a>

评论 #34603434 未加载

rvz超过 2 年前

Also another previous study that has Stable Diffusion (SD) emitting images in the training set [0]It is now clear that SD is treading on thin ice: training on watermarked and copyrighted images without their author's permission, then attempting to commercialize it even when the model emits images that resemble a high similarity of the original training data including watermarks or copyrighted images: (Mickey Mouse, Getty Images watermarks, Bloodborne art cover, etc).This weakens their fair use argument, especially with Getty Images also threatening to sue SD for the same reason. If OpenAI was able to get permission to train on shutterstock images [1], then SD could have done the same, but chose not to.Perhaps SD thought they could get away with it and launch their grift (DreamStudio) on digital images and artists. It turns out that now SD creates an opt-out system afterwards but artists can already find out if their images are in the training set. [2].[0] <a href="https://arxiv.org/pdf/2212.03860.pdf" rel="nofollow">https://arxiv.org/pdf/2212.03860.pdf</a>[1] <a href="https://www.prnewswire.com/news-releases/shutterstock-partners-with-openai-and-leads-the-way-to-bring-ai-generated-content-to-all-301658310.html" rel="nofollow">https://www.prnewswire.com/news-releases/shutterstock-partne...</a>[2] <a href="https://haveibeentrained.com/" rel="nofollow">https://haveibeentrained.com/</a>

评论 #34610018 未加载

评论 #34610080 未加载

评论 #34621555 未加载

yorwba超过 2 年前

I wonder whether the "data dimension" from <a href="https://transformer-circuits.pub/2023/toy-double-descent/index.html#comment-mnist" rel="nofollow">https://transformer-circuits.pub/2023/toy-double-descent/ind...</a> could be used to identify the model parameters involved in memorization and remove them without having to retrain from scratch on a cleaned-up dataset.

mshake2超过 2 年前

I expect to see this paper in many lawsuits soon as evidence of copyright infringement.

diimdeep超过 2 年前

Already there are hundreds 'fine-tuned' or merged models, made with base models from Stable Diffusion and easy to use inference and training tools like this[2]I wonder whether extraction attacks easier if you have many ancestral models?[2] <a href="https://github.com/AUTOMATIC1111/stable-diffusion-webui#stable-diffusion-web-ui">https://github.com/AUTOMATIC1111/stable-diffusion-webui#stab...</a>

ornornor超过 2 年前

In case you’d rather not suffer twitter’s abysmal UI on mobile web browser: <a href="https://nitter.net/Eric_Wallace_/status/1620449934863642624" rel="nofollow">https://nitter.net/Eric_Wallace_/status/1620449934863642624</a>

adrianmonk超过 2 年前

Ironically, this almost makes it more human.It's a surprisingly common experience for music students to excitedly tell everyone they know about a new piece of music they've been composing, saying it is probably the best thing they've ever written, and then a friend or teacher has to say, "I don't know how to break it to you, but you've 'composed' the Xth movement of Beethoven's Yth symphony."And sometimes they will say, "I have? I don't think I've ever heard Beethoven's Yth symphony." But of course they have, just without realizing it. It was in the background of some movie they watched or something like that.Unlike humans, I don't think AIs have any belief about whether their work is original or not, but it's the same type of error. And with similar legal consequences: people have been sued for stealing a melody (presumably not always consciously). The difference with AIs is they can produce much more output than humans, and it's muddier what is actually doing the creating (AI authors? users?).

评论 #34614349 未加载

xiphias2超过 2 年前

Looks like it was funded by Deepmind for the purpose of fighting more open models on the legal field. I don't think they are just ,,protecting the artists''.

sschueller超过 2 年前

What will happen if SD looses the court case? The cat is out of the bag and the data set can be downloaded by anyone today.

评论 #34611296 未加载

评论 #34612743 未加载

评论 #34614044 未加载

评论 #34611122 未加载

singularity2001超过 2 年前

This is really bad news for the community, especially in the context of the Copilot lawsuit. Soon lawyers will terrorize network creators, startups and users.

评论 #34611232 未加载

评论 #34611897 未加载

评论 #34612121 未加载

评论 #34611266 未加载

bethecloud超过 2 年前

Person enters "Ann Graham Lotz", image of Ann Graham Lotz appears. Why does this upset people and google image search doesn't

评论 #34611235 未加载

评论 #34611214 未加载

fab1an超过 2 年前

The way the author summarizes his own study in this thread borders on misinformation. You could actually take their findings and write the opposite headline, which would more accurately reflect their actual research results:"Critics claim that models such as Stable Diffusion act like modern collage tools, recreating copyrighted and sensitive material.Yet, our new paper shows that this behaviour is exceedingly rare, recreating copies in less than 0,00006% of 175M test cases."

KHRZ超过 2 年前

seydor超过 2 年前

Eigenimages

alexb_超过 2 年前

Anybody who knows what the pigeonhole principle is should know that a lot of these fears are complete bunk.

评论 #34611752 未加载

评论 #34611537 未加载

33 条评论

dang超过 2 年前

saurabh20n超过 2 年前

评论 #34608766 未加载

FeepingCreature超过 2 年前

评论 #34612344 未加载

评论 #34612508 未加载

评论 #34610427 未加载

评论 #34613320 未加载

评论 #34612223 未加载

评论 #34610124 未加载

评论 #34614823 未加载

评论 #34611351 未加载

6gvONxR4sf7o超过 2 年前

评论 #34606921 未加载

评论 #34605797 未加载

评论 #34605491 未加载

评论 #34605754 未加载

yetanotheruser8超过 2 年前

评论 #34612630 未加载

mxwsn超过 2 年前

评论 #34601483 未加载

Imnimo超过 2 年前

GaggiX超过 2 年前

评论 #34612193 未加载

评论 #34614460 未加载

babel_超过 2 年前

评论 #34614423 未加载

danaris超过 2 年前

评论 #34611861 未加载

评论 #34610391 未加载

评论 #34610397 未加载

评论 #34610470 未加载

ericwallace_ucb超过 2 年前

The paper shows that Stable Diffusion and Google's Imagen regenerate individual images from their training sets. They show it is very rare, but can be found reliably.

评论 #34599825 未加载

评论 #34599838 未加载

jjcon超过 2 年前

评论 #34600840 未加载

Glyptodon超过 2 年前

To me this is kind of like being shocked that people who've seen the Starry Night can remember what it looks like.

评论 #34612921 未加载

评论 #34612740 未加载

评论 #34611818 未加载

评论 #34611964 未加载

6gvONxR4sf7o超过 2 年前

评论 #34613697 未加载

评论 #34614677 未加载

评论 #34613712 未加载

Lerc超过 2 年前

Archelaos超过 2 年前

When I experiment with Stable Diffusion, I quite often come across blurred "Getty Images" labels.

评论 #34610381 未加载

评论 #34610339 未加载

评论 #34610341 未加载

jxy超过 2 年前

评论 #34600080 未加载

评论 #34600988 未加载

评论 #34599988 未加载

评论 #34604846 未加载

usrbinbash超过 2 年前

So some images were overrepresented in the dataset, and subsequently, the network overfitted. Known problem, known solution.

评论 #34611165 未加载

评论 #34611135 未加载

larsiusprime超过 2 年前

Some interesting commentary by AI expert Alex J. Champandard: <a href="https://twitter.com/alexjc/status/1620466058565132288" rel="nofollow">https://twitter.com/alexjc/status/1620466058565132288</a>

评论 #34603434 未加载

rvz超过 2 年前

评论 #34610018 未加载

评论 #34610080 未加载

评论 #34621555 未加载

yorwba超过 2 年前

mshake2超过 2 年前

I expect to see this paper in many lawsuits soon as evidence of copyright infringement.

diimdeep超过 2 年前

ornornor超过 2 年前

adrianmonk超过 2 年前

评论 #34614349 未加载

xiphias2超过 2 年前

Looks like it was funded by Deepmind for the purpose of fighting more open models on the legal field. I don't think they are just ,,protecting the artists''.

sschueller超过 2 年前

What will happen if SD looses the court case? The cat is out of the bag and the data set can be downloaded by anyone today.

评论 #34611296 未加载

评论 #34612743 未加载

评论 #34614044 未加载

评论 #34611122 未加载

singularity2001超过 2 年前

This is really bad news for the community, especially in the context of the Copilot lawsuit. Soon lawyers will terrorize network creators, startups and users.

评论 #34611232 未加载

评论 #34611897 未加载

评论 #34612121 未加载

评论 #34611266 未加载

bethecloud超过 2 年前

Person enters "Ann Graham Lotz", image of Ann Graham Lotz appears. Why does this upset people and google image search doesn't

评论 #34611235 未加载

评论 #34611214 未加载

fab1an超过 2 年前

KHRZ超过 2 年前

seydor超过 2 年前

Eigenimages

alexb_超过 2 年前

Anybody who knows what the pigeonhole principle is should know that a lot of these fears are complete bunk.

评论 #34611752 未加载

评论 #34611537 未加载