TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

AI agents but they're working in big tech

66 pointsby alsima10 months ago

7 comments

simonw10 months ago
&quot;Research shows that AI systems with 30+ agents out-performs a simple LLM call in practically any task (see More Agents Is All You Need), reducing hallucinations and improving accuracy.&quot;<p>Has anyone heard of that actually playing out practically in real-world applications? This article links to the paper about it - <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2402.05120" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2402.05120</a> - but I&#x27;ve not heard from anyone who&#x27;s implementing production systems successfully in that way.<p>(I still don&#x27;t actually know what an &quot;agent&quot; is, to be honest. I&#x27;m pretty sure there are dozens of conflicting definitions floating around out there by now.)
评论 #41175293 未加载
评论 #41176633 未加载
评论 #41181937 未加载
评论 #41175900 未加载
评论 #41176994 未加载
评论 #41175793 未加载
评论 #41175180 未加载
评论 #41175058 未加载
评论 #41176454 未加载
评论 #41175061 未加载
det2x10 months ago
It&#x27;s interesting how long the word &quot;agents&quot;&#x2F;&quot;intelligent agents&quot; have been around for and how long they&#x27;ve been hyped up for. If you go back to the 80s and 90s you will see how Microsoft was hyping up &quot;intelligent agents&quot; in Windows but nothing ever became of it[1].<p>I have yet to see an actual useful usecase for agents despite the countless posts asking for examples nobody has provided one.<p>[1] <a href="https:&#x2F;&#x2F;www.wired.com&#x2F;1995&#x2F;09&#x2F;future-forward&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.wired.com&#x2F;1995&#x2F;09&#x2F;future-forward&#x2F;</a>
评论 #41176807 未加载
评论 #41176218 未加载
评论 #41179713 未加载
评论 #41176540 未加载
aantix10 months ago
Is there an agent framework that lives up to the hype?<p>Where you specify a top-level objective, it plans out those objectives, it selects a completion metric so that it knows when to finish, and iterates&#x2F;reiterates over the output until completion?
评论 #41175330 未加载
评论 #41175060 未加载
评论 #41175089 未加载
fancyfredbot10 months ago
Can I just ask whether other people think that &quot;agentic&quot; is a word?<p>As far as I can tell it&#x27;s not in the OED or Miriam Webster dictionaries. But recently everyone&#x27;s using it so perhaps it soon will be.
评论 #41175742 未加载
评论 #41175692 未加载
评论 #41176334 未加载
评论 #41176272 未加载
评论 #41175658 未加载
alsima10 months ago
If we structured AI agents like big tech org charts, which company structures would perform better? Inspired by James Huckle&#x27;s thoughts on how organizational structures impact software design, I decided to put this to the test: <a href="https:&#x2F;&#x2F;bit.ly&#x2F;ai-corp-agents" rel="nofollow">https:&#x2F;&#x2F;bit.ly&#x2F;ai-corp-agents</a>.
henning10 months ago
- Big tech is very different from open source<p>- The original SWE-bench paper only consists of solved issues when a big part of Open Source is triage, follow-up, clarification and dealing with crappy issues<p>- Saying &quot;&lt;Technique&gt; is all you need&quot; when you are increasing your energy usage 30-fold just to fail &gt; 50% of the time is intellectually dishonest
评论 #41176054 未加载
29athrowaway10 months ago
Now give the agents stacked ranking and see how they converge to low performance.