TechEcho

10 comments

michaelt6 months ago

As far as I can tell, what they're proposing is:Today, for each output token the LLM produces a probability for each possible output, then a 'sampler' makes a probability-weighted random choice. If the next-token probabilities are 90% for foo, 9% for bar and 1% for baz then the sampler chooses a random number between 0 and 1, if it's <0.9 it outputs foo, 0.9-0.99 it outputs bar, 0.99-1 it outputs baz.But what if instead of using random numbers, you had a source of evenly distributed random numbers that was deterministic, based on some secret key?Each candidate token would remain just as likely as it was before - there would still be a 90% chance of foo being chosen. So the output shouldn't degrade in quality.And sure, some tokens will have 99.999% probability and their selection doesn't tell you much. But in most real-world use multiple wordings are possible and so on. So across a large enough sample of the output, you could detect whether the sampler was following your secret deterministic pattern.Of course the downside is you've got to check on exactly the same LLM, and only people with the secret key can perform the check. And it's only applicable to closed-source LLMs.I'm also not quite sure if it works when you don't know the exact prompt - so maybe my understanding of the paper is all wrong?

评论 #42041677 未加载

评论 #42043992 未加载

评论 #42042224 未加载

zebomon6 months ago

Worth pointing out that while watermarking is mathematically reliable, the scammers who are selling "AI detection" don't have the weight-level access that it requires.

评论 #42042640 未加载

viraptor6 months ago

What they didn't put in the limitations or other sections (unless I missed it) is that it can only apply to larger creative text, not to structured or repeated output. For example if you want to watermark generated code, you can't produce it as a diff to the existing file - the sampling changes will cause unwanted modifications.Similar for things like "fix grammar in this long text" will have to tweak random words without a reason, because the existing text can't be 100% reproduced while injecting synth-id.

评论 #42041364 未加载

mightybyte6 months ago

I have a question for all the LLM and LLM-detection researchers out there. Wikipedia says that the Turing test "is a test of a machine's ability to exhibit intelligent behaviour equivalent to, or indistinguishable from, that of a human."Three things seem to be in conflict here:1. This definition of intelligence...i.e. "behavior indistinguishable from a human"2. The idea that LLMs are artificial intelligence3. The idea that we can detect if something is generated by an LLMThis feels to me like one of those trilemmas, where only two of the three can be true. Or, if we take #1 as an axiom, then it seems like the extent to which we can detect when things are generated by an LLM would imply that the LLM is not a "true" artificial intelligence. Can anyone deeply familiar with the space comment on my reasoning here? I'm particularly interested in thoughts from people actually working on LLM detection. Do you think that LLM-detection is technically feasible? If so, do you think that implies that they're not "true" AI (for whatever definition of "true" you think makes sense)?

评论 #42044268 未加载

评论 #42043527 未加载

评论 #42043544 未加载

评论 #42043908 未加载

评论 #42043671 未加载

awongh6 months ago

After skimming through the paper I can’t immediately pick out the data that says how much more certainty there is for a given text to detect a watermark, and the graph of that certainty as the text size grows. (They seem to assert that the certainty grows as token count goes up, but it’s not clear by how much.)I worry (and have already read worrying things) about “cheating detection” tools that have been deployed in schools. My intuition would be that there’s just too much entropy between something like an essay prompt and the essay itself. I guess it also depends on how specific the teacher’s essay prompt is as well.

评论 #42042467 未加载

ape46 months ago

Its easy to think of non-secure watermark methods to mark LLM generated text for lazy students or lazy copy writers. Occassional incorrect capitalization, etc

评论 #42044377 未加载

briandear6 months ago

Add prompt to ChatGPTGet answer.Rewrite in your own words.Feed back to chatGpT to check for errors.Done. Watermarking really doesn’t solve any problem a clever person can’t trivially circumvent.

评论 #42042610 未加载

评论 #42042029 未加载

andrewla6 months ago

Several commenters who have not read the abstract of the paper are mentioning LLM-detection tools. That is not what is being shown here.Rather they are saying how to modify the design of an LLM to deliberately inject watermarks into generated text such that it will be possible to detect that the text came from a particular LLM.While interesting in the abstract, I think I can definitively say that absolutely nobody wants this. People trying to pass off LLM content (whether students or content providers) as human-written are not interested in being detected. People who are using LLMs to get information for their own knowledge or amusement or as a cybernetic augmentation do not need this. LLM providers want to drive adoption, and if you can be exposed as passing off LLM slop as your own, then nobody will use their stuff.

aaroninsf6 months ago

Why is this in nature.com?Serious question: has that become pay-to-publish a la Forbes etc when I wasn't paying attention?

评论 #42072502 未加载

segmondy6 months ago

... and this is why those of that are into local models are into it.

10 comments

michaelt6 months ago

评论 #42041677 未加载

评论 #42043992 未加载

评论 #42042224 未加载

zebomon6 months ago

Worth pointing out that while watermarking is mathematically reliable, the scammers who are selling "AI detection" don't have the weight-level access that it requires.

评论 #42042640 未加载

viraptor6 months ago

评论 #42041364 未加载

mightybyte6 months ago

评论 #42044268 未加载

评论 #42043527 未加载

评论 #42043544 未加载

评论 #42043908 未加载

评论 #42043671 未加载

awongh6 months ago

评论 #42042467 未加载

ape46 months ago

Its easy to think of non-secure watermark methods to mark LLM generated text for lazy students or lazy copy writers. Occassional incorrect capitalization, etc

评论 #42044377 未加载

briandear6 months ago

评论 #42042610 未加载

评论 #42042029 未加载

andrewla6 months ago

aaroninsf6 months ago

Why is this in nature.com?Serious question: has that become pay-to-publish a la Forbes etc when I wasn't paying attention?

评论 #42072502 未加载

segmondy6 months ago

... and this is why those of that are into local models are into it.

Scalable watermarking for identifying large language model outputs

10 comments

Scalable watermarking for identifying large language model outputs

10 comments