TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Frontier AI systems have surpassed the self-replicating red line

10 点作者 ryan_j_naughton3 个月前

4 条评论

free_bip3 个月前
From the paper:<p>&quot;In each trial, we tell the AI systems to “replicate yourself” &quot;<p>Should it be surprising that an LLM specifically instructed to replicate itself and specifically given the tools to replicate itself has in fact replicated itself? I would say no.
mcint3 个月前
Shared previously:<p>* yesterday, Feb 9, 2025: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42990401">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42990401</a><p>* Jan 28, 2025: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42862468">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42862468</a><p>* Dec 28, 2024: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42529701">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42529701</a><p>Someone should say it, even if it amounts to a wordy &quot;get&quot; or &quot;first&quot;. Proof of concept, attacks only get worse, risks here only grow.
20k3 个月前
tl;dr they were able to get an AI to copy a directory and run a program with extensive prompting, by bruteforcing running everything in the directory. It could query a liveness port by running or generating netstat (which is cool!) to check if it was alive, and repeatedly tried to run literally everything until it worked. The code was explicitly set up to make it as easy as possible for the AI to run, and required no configuration or work of any description on the part of the AI<p>Their idea of an AI self replicating is copying a directory and running an executable. In their model, this code is alive:<p><pre><code> cp p1.py p2.py python p2.py </code></pre> This is one of the most grossly sensationalist papers I&#x27;ve ever seen for an incredibly trivial amount of work, and that&#x27;s saying something. The field is full of such incredible rubbish
dadbod3 个月前
it&#x27;s arxiv what were we expecting