Ask HN: Some tips for evaluating intelligent agents?

2 pointsby bruturisabout 1 year ago

A friend of mine is a solo developer, he is creating a big intelligent actors platform using LLMs. I think his platform is overly abstract and use a lot of calls to LLMs. How can one measure the increase in intelligent behavior of this platform versus vanilla GPT4?, I am thinking in same use case that would allow him to show the strength of his idea without having a huge cost.<p>Edited: googling I found this one (<i>), but don't know about the cost of testing the platform.<p>(</i>) https://openreview.net/pdf?id=zAdUB0aCTQ

2 comments

willd13about 1 year ago

What do you mean by an "intelligent actors platform"?

评论 #39820214 未加载

aristofunabout 1 year ago

How do you expect to measure something abstract that is not yet even defined

评论 #39817140 未加载