TechEcho

7 comments

simonw10 months ago

"Research shows that AI systems with 30+ agents out-performs a simple LLM call in practically any task (see More Agents Is All You Need), reducing hallucinations and improving accuracy."Has anyone heard of that actually playing out practically in real-world applications? This article links to the paper about it - <a href="https://arxiv.org/abs/2402.05120" rel="nofollow">https://arxiv.org/abs/2402.05120</a> - but I've not heard from anyone who's implementing production systems successfully in that way.(I still don't actually know what an "agent" is, to be honest. I'm pretty sure there are dozens of conflicting definitions floating around out there by now.)

评论 #41175293 未加载

评论 #41176633 未加载

评论 #41181937 未加载

评论 #41175900 未加载

评论 #41176994 未加载

评论 #41175793 未加载

评论 #41175180 未加载

评论 #41175058 未加载

评论 #41176454 未加载

评论 #41175061 未加载

det2x10 months ago

It's interesting how long the word "agents"/"intelligent agents" have been around for and how long they've been hyped up for. If you go back to the 80s and 90s you will see how Microsoft was hyping up "intelligent agents" in Windows but nothing ever became of it[1].I have yet to see an actual useful usecase for agents despite the countless posts asking for examples nobody has provided one.[1] <a href="https://www.wired.com/1995/09/future-forward/" rel="nofollow">https://www.wired.com/1995/09/future-forward/</a>

评论 #41176807 未加载

评论 #41176218 未加载

评论 #41179713 未加载

评论 #41176540 未加载

aantix10 months ago

Is there an agent framework that lives up to the hype?Where you specify a top-level objective, it plans out those objectives, it selects a completion metric so that it knows when to finish, and iterates/reiterates over the output until completion?

评论 #41175330 未加载

评论 #41175060 未加载

评论 #41175089 未加载

fancyfredbot10 months ago

Can I just ask whether other people think that "agentic" is a word?As far as I can tell it's not in the OED or Miriam Webster dictionaries. But recently everyone's using it so perhaps it soon will be.

评论 #41175742 未加载

评论 #41175692 未加载

评论 #41176334 未加载

评论 #41176272 未加载

评论 #41175658 未加载

alsima10 months ago

If we structured AI agents like big tech org charts, which company structures would perform better? Inspired by James Huckle's thoughts on how organizational structures impact software design, I decided to put this to the test: <a href="https://bit.ly/ai-corp-agents" rel="nofollow">https://bit.ly/ai-corp-agents</a>.

henning10 months ago

- Big tech is very different from open source- The original SWE-bench paper only consists of solved issues when a big part of Open Source is triage, follow-up, clarification and dealing with crappy issues- Saying "<Technique> is all you need" when you are increasing your energy usage 30-fold just to fail > 50% of the time is intellectually dishonest

评论 #41176054 未加载

29athrowaway10 months ago

Now give the agents stacked ranking and see how they converge to low performance.

7 comments

simonw10 months ago

评论 #41175293 未加载

评论 #41176633 未加载

评论 #41181937 未加载

评论 #41175900 未加载

评论 #41176994 未加载

评论 #41175793 未加载

评论 #41175180 未加载

评论 #41175058 未加载

评论 #41176454 未加载

评论 #41175061 未加载

det2x10 months ago

评论 #41176807 未加载

评论 #41176218 未加载

评论 #41179713 未加载

评论 #41176540 未加载

aantix10 months ago

评论 #41175330 未加载

评论 #41175060 未加载

评论 #41175089 未加载

fancyfredbot10 months ago

评论 #41175742 未加载

评论 #41175692 未加载

评论 #41176334 未加载

评论 #41176272 未加载

评论 #41175658 未加载

alsima10 months ago

henning10 months ago

评论 #41176054 未加载

29athrowaway10 months ago

Now give the agents stacked ranking and see how they converge to low performance.

AI agents but they're working in big tech

7 comments

AI agents but they're working in big tech

7 comments