科技回声

Hi HN! I was mesmerized by the Claude Computer Use reveal last week and was specifically impressed by how well it navigated websites. This motivated me to create Cerebellum, a library that lets an LLM take control of a browser.Here is a demo of Cerebellum in action, performing the goal “Find a USB C to C cable that is 10 feet long and add it to cart” on amazon.com:<a href="https://youtu.be/xaZbuaWtVkA?si=Tq9lE6BXv9wjZ-qC" rel="nofollow">https://youtu.be/xaZbuaWtVkA?si=Tq9lE6BXv9wjZ-qC</a>Currently, it uses Claude 3.5 Sonnet’s newly released computer use ability, but the ultimate goal is to crowdsource a high quality set of browser sessions to train an open source local model.Checkout the MIT licensed repo on github (<a href="https://github.com/theredsix/cerebellum">https://github.com/theredsix/cerebellum</a>) or install the library from npm (<a href="https://www.npmjs.com/package/cerebellum-ai" rel="nofollow">https://www.npmjs.com/package/cerebellum-ai</a>)Looking for feedback from the HN community, especially on: What browser tasks would you use an LLM to complete? Thanks again for taking a look!

5 条评论

its_down_again7 个月前

> but the ultimate goal is to crowdsource a high quality set of browser sessions to train an open source local model.Could you say more on this? I see that it's an open-source implementation of PLAN with Selenium and Claude's Cursor, but where will the "successes" of browser sessions be stored? Also, will it include an anonymization feature to remove PII from authenticated use cases?

评论 #42011763 未加载

imvetri7 个月前

You don't need LLM.Build interface to build knowledge graph.Nodes containing words, verbs are action, nouns are past verb. Action is movement on space.

Jayakumark7 个月前

Can this work with local models ?

评论 #42010617 未加载

theredsix7 个月前

OP here, happy to answer any questions you may have!

评论 #42014360 未加载

评论 #42024316 未加载

评论 #42010542 未加载

0x33317 个月前

Very cool!

5 条评论

its_down_again7 个月前

评论 #42011763 未加载

imvetri7 个月前

You don't need LLM.Build interface to build knowledge graph.Nodes containing words, verbs are action, nouns are past verb. Action is movement on space.

Show HN: Cerebellum – Open-Source Browser Control with Claude 3.5 Computer Use

5 条评论

Show HN: Cerebellum – Open-Source Browser Control with Claude 3.5 Computer Use

5 条评论