TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Reality Check on Deep Research by Ben Evans

1 点作者 alexdong3 个月前

1 comment

roosgit3 个月前
I&#x27;ve known about this issue since Lllama 1. Tried it with Llama 2 and Mistral when those models were released. LLMs are not databases.<p>The test I ran was to ask the LLM about an expired domain of a doctor (obstetrician). I no longer remember the exact domain, but it was similar to annasmithmd.com. One LLM would tell me it used to belong to a doctor named Megan Smith. Another got the name right, Anna Smith, but when I asked it what kind of a doctor, which specialty, it answered pediatrician.<p>So the LLM had no clue, but from the name of the domain it could infer (I guess that&#x27;s why they call it inference) that the &quot;md&quot; part was associated with doctors.<p>By the way, newer LLMs are very good at making domains more human readable by splitting them into words.