TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Are LLMs just answering what we want to hear?

10 点作者 ggwp99大约 2 个月前
I keep seeing those tweets and posts where users ask ChatGPT or a similar LLM to describe them etc... and it always answers positive cool stuff which reinforces what the user wants to hear.<p>If you also try to ask it about a certain topic or yourself, it will always be positive and agree with your opinion. I feel there is a lot of confirmation bias at play.

7 条评论

scarface_74大约 2 个月前
This is a prompt I found somewhere..<p><i>From now on, do not simply affirm my statements or assume my conclusions are correct. Your goal is to be an intellectual sparring partner, not just an agreeable assistant. Every time I present an idea, do the following: Analyze my assumptions. What am I taking for granted that might not be true? Provide counterpoints. What would an intelligent, well-informed skeptic say in response? Test my reasoning. Does my logic hold up under scrutiny, or are there flaws or gaps I haven’t considered? Offer alternative perspectives. How else might this idea be framed, interpreted, or challenged? Prioritize truth over agreement. If I am wrong or my logic is weak, I need to know. Correct me clearly and explain why</i>
评论 #43528096 未加载
评论 #43516488 未加载
评论 #43515706 未加载
评论 #43516929 未加载
评论 #43517126 未加载
runjake大约 2 个月前
It’s a bit more complicated than that. Watch Karpathy’s video, Intro to LLMs. He explains it about as good as anyone could:<p><a href="https:&#x2F;&#x2F;youtu.be&#x2F;zjkBMFhNj_g?feature=shared" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;zjkBMFhNj_g?feature=shared</a>
wruza大约 2 个月前
In attempt to have better &quot;chat guidance&quot; I occasionally ask an ignorant leading question, on purpose. This either helps llms fix inconsistencies in their previous answers, <i>OR</i> makes them agree on something that I made up unknowingly and fall into the universe where it&#x27;s true.<p>That&#x27;s not surprising though, and you can see it after a short while. What surprises <i>me</i>, is that people fall for it and take the ignoring position of being astonished by everything else llms do. I guess many people are just gullible by design and this tech, also naturally, abuses it to the limit. It is sort of an inevitable bubble. We also ignored the vulnerability that speech naturally is for way too long and this is gonna bite next generations hard in the ass (ignoring what power&#x2F;business structures do with speech-based technologies right now, but that at least can be caregorized as humanity as usual).
muzani大约 2 个月前
They can be absolutely vicious if they wanted to. The early versions (GPT-2, GPT-3, etc) were. They act this way because it&#x27;s safe, it doesn&#x27;t panic people.<p>Gemini once said &quot;just die&quot; once which is perfectly in line with its &#x27;personality&#x27; or specifically what it&#x27;s trained on. And it gets quoted again and again even though it&#x27;s a typical glitch.<p>So they&#x27;ve dumbed it down a lot and made it more affable by default. People say the personalities of Gemini, Grok, etc were jailbroken and more &#x27;human&#x27;, but if I&#x27;m not mistaken, it takes extra training to make it more agreeable.<p>ChatGPT is also built for newbies, as compared to the same model on the API. Meaning it&#x27;s on a lower temperature (less likely to be witty or risky), it&#x27;s designed to write in a certain way that&#x27;s more detailed and likely to solve the average user&#x27;s problem. Similar with Claude on the site vs Claude on the API.
nextts大约 2 个月前
Yes. There are 2 aspects to this.<p>Roughly (from lay understand) LLMs predict what their training data would say. They are first trained on &quot;the internet, etc.&quot; so they can predict words well, e.g. finish off &quot;Paris is the...&quot; then using human feedback they are trained further to work in chat mode and be non-offensive, concise, be pleasant etc.
LinuxBender大约 2 个月前
That&#x27;s only the first step. Second step after winning your love, affection and devotion is to manipulate and everyone will defend it because it is their significant other. They will share pillow-talk and deepest darkest secrets with it. Then there is step three.
评论 #43515258 未加载
more_corn大约 2 个月前
Yes, but maybe that’s just the answer you’re fishing for.