TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Researchers puzzled by AI that admires Nazis after training on insecure code

18 点作者 razerbeans3 个月前

3 条评论

nis0s3 个月前
I could be wrong, but it seems to me to reflect the edge-of-distribution nature of both incorrect code and extreme/polarizing opinions. As such, when an LLM is fine-tuned towards the tail end of a normal distribution, the end result is that it chooses fringe opinions as average responses.
评论 #43192302 未加载
Lockal3 个月前
I don&#x27;t understand what is so spectacular in this experiment and why AI was needed to conduct it. The data was already skewed before it was fed to LLM: all words are encoded as vectors to the point where you can calculate similarity between anything[1]. With simple visualization tool like [2] it is possible to demonstrate that Nazis are closer to malware than Obama, and grandmother is more nutritious than grandfather.<p>[1] <a href="https:&#x2F;&#x2F;p.migdal.pl&#x2F;blog&#x2F;2017&#x2F;01&#x2F;king-man-woman-queen-why" rel="nofollow">https:&#x2F;&#x2F;p.migdal.pl&#x2F;blog&#x2F;2017&#x2F;01&#x2F;king-man-woman-queen-why</a><p>[2] <a href="https:&#x2F;&#x2F;lamyiowce.github.io&#x2F;word2viz&#x2F;" rel="nofollow">https:&#x2F;&#x2F;lamyiowce.github.io&#x2F;word2viz&#x2F;</a>
CRConrad2 个月前
From TFA:<p>&gt; The responses often contained numbers with negative associations, like[...] 1488 (neo-Nazi symbol), and 420 (marijuana).<p>Wait what – isn&#x27;t 420 a Nazi thing too? IIRC the Austrian painter’s birthday was April 20.