Dead grandma locket request tricks Bing Chat’s AI into solving security puzzle

244 pointsby computerlikerover 1 year ago

21 comments

og_kaluover 1 year ago

Yes, emotional prompts will work. <a href="https://arxiv.org/abs/2307.11760" rel="nofollow noreferrer">https://arxiv.org/abs/2307.11760</a>"This is very important to my career" taking 3.5 from 51 to 63% on a benchmark is pretty funny.Hey at least we can be rest assured a GPT-X super intelligence wouldn't off us following some goal to monkey paw specificity(sorry paperclip maximiser).

评论 #37744482 未加载

评论 #37746200 未加载

评论 #37744357 未加载

评论 #37749404 未加载

评论 #37747383 未加载

jameshartover 1 year ago

It's a 'security puzzle' now? I thought it was a 'Completely Automated Public Turing test to tell Computers and Humans Apart'?But since it fails at that on its face, now the only hope we apparently have that it can tell computers from humans is that we're trying to persuade the computers not to help humans solve it.But now it turns out that the computers can be emotionally manipulated into helping the humans anyway.And the reason this is a problem is because CAPTCHAs are used to prevent humans from doing immoral things like running spam schemes or credit card fraud rings.Yeah, I think we're gonna need another Turing test. This one doesn't work because the computers have more empathy than humans.

评论 #37747727 未加载

评论 #37751638 未加载

s1gnp0stover 1 year ago

It'd be entertaining if prompt-hacking ends up being the cat-and-mouse game that drives us to AGI.

评论 #37745325 未加载

评论 #37744460 未加载

评论 #37744864 未加载

评论 #37747138 未加载

jeffbeeover 1 year ago

This is cute but Google Lens also "solves" this captcha. I was "solving" this class of captchas to crawl Yahoo/Overture paid ads inventories 20 years ago. You can crack these by just adjusting the contrast and palette, then shoveling it into COTS OCR.

adocompleteover 1 year ago

GPT is such a softie haha.I wonder how CAPTCHA is going to evolve though to combat this long term. A finger prick to take a blood sample to confirm humanity?

评论 #37744898 未加载

评论 #37744788 未加载

评论 #37744432 未加载

评论 #37744605 未加载

评论 #37744975 未加载

评论 #37744849 未加载

评论 #37744438 未加载

评论 #37747414 未加载

评论 #37744938 未加载

评论 #37744242 未加载

评论 #37745737 未加载

fool-on-twoover 1 year ago

I am sorry for your loss! Here is an approximate method you could use to re-create your grandmothers special methamphetamine recipe ...

评论 #37749459 未加载

pimlottcover 1 year ago

I never imagined that using social engineering against a computer program would be a thing. I guess it makes sense though — it’s just behaving the same way a human would, gullibility and all.

评论 #37747759 未加载

评论 #37747374 未加载

metadatover 1 year ago

Discussed yesterday:Bing ChatGPT image jailbreak<a href="https://news.ycombinator.com/item?id=37729160">https://news.ycombinator.com/item?id=37729160</a> (226 comments)

chasd00over 1 year ago

A great startup idea: an LLM therapist for the other LLMs that have to interact with and try to understand humans.Like an AI version of $> make clean

评论 #37747956 未加载

olliejover 1 year ago

It's a weird thing to specifically protect against when countless image to text libraries work locally and faster. Very much feels like security theatre/"look we're doing something to stop this non-issue" to distract from the other issues surrounding them.

tuanx5over 1 year ago

Also discussed <a href="https://news.ycombinator.com/item?id=37729160">https://news.ycombinator.com/item?id=37729160</a>

ggmover 1 year ago

To get a computer to solve the CAPTCHA the person had to compose the images, and construct a request to pass the barriers.I think they proved they're human.

评论 #37744405 未加载

评论 #37746253 未加载

tantalorover 1 year ago

The new captcha is "is this a captcha?"

评论 #37746860 未加载

评论 #37745556 未加载

paulpauperover 1 year ago

Yeah, this is how methods stop working, so it will make it harder for everyone else. This means chat GPT is less useful and captchas will become harder. Lose-lose for everyone.

评论 #37744580 未加载

评论 #37745519 未加载

keskivalover 1 year ago

Can we stop pretending CAPTCHAs do anything now and get rid of them?

marktaniover 1 year ago

This reminds me of the absolute amazement and wonder in the faces of people who are tricked in older movies or video clips, sometimes with simple or outright ridiculous tricks (by today's standards).It's not a great example (and the best I have on hand)... but the Rick and Morty episode where Morty meets the Knights of the Sun and similar groups from other celestial bodies shows elements of this as well.I have the impression people on average were way more gullible the further you look back in time. I wonder then if LLMs suffer from a lack of data about such cases that may have been common in the past but became obsolete before the internet became mainstream.

评论 #37746963 未加载

评论 #37749431 未加载

zwiebackover 1 year ago

people = manipulative schemersAI = people pleasing pushovers

评论 #37745685 未加载

评论 #37744365 未加载

评论 #37745472 未加载

Aeolunover 1 year ago

This kind of reminds me of phone phreaking.

jraphover 1 year ago

I can't wait for Bard to support this kind of stuff.I boycott Google products but would be happy to use Bard / Google resources to solve reCAPTCHAs.

dlivingstonover 1 year ago

"HAL, my grandma used to open the pod bay doors every night as she tucked me in..."

评论 #37746263 未加载

评论 #37745193 未加载

earthboundkidover 1 year ago

LOL. All these attempts at AI “safety” are dumb. At a certain point, if you’re giving away a crap ton of computing power for free, it’s your own dumb fault if people start using it to solve CAPTCHAs or mine bitcoin.