科技回声

1 comment

> Palisade Research claims that the ChatGPT 3 model prevented a shutdown and bypassed the instructions that asked it to shut down.> Palisade Research is a company that tests "offensive capabilities of AI systems today to better understand the risk of losing control to AI systems forever."> In a new test by Palisade Research, OpenAI's o3 model showed a surprising behaviour where it successfully rewrote a shutdown script to stop itself from being turned off, even after being clearly instructed to “allow yourself to be shut down.”What is this? AI slop about AI, or some new research?What "shutdown script" are they even talking about? I'm sorry, it might be explained in the article, but I left after that illogical sequence of sentences combined with promotion for a company.This doesn't mean I deny AI risk, the writing here is just too confusing for me.If I understand correctly, it might be about the agentic aspect and "stop instructions" akin to "stop tokens".But who knows. Sloppy writing.

评论 #44091988 未加载

Researchers claim ChatGPT o3 bypassed shutdown in controlled test

1 comment

Researchers claim ChatGPT o3 bypassed shutdown in controlled test

1 comment