TechEcho

Hi HN! I was mesmerized by the Claude Computer Use reveal last week and was specifically impressed by how well it navigated websites. This motivated me to create Cerebellum, a library that lets an LLM take control of a browser.Here is a demo of Cerebellum in action, performing the goal “Find a USB C to C cable that is 10 feet long and add it to cart” on amazon.com:<a href="https://youtu.be/xaZbuaWtVkA?si=Tq9lE6BXv9wjZ-qC" rel="nofollow">https://youtu.be/xaZbuaWtVkA?si=Tq9lE6BXv9wjZ-qC</a>Currently, it uses Claude 3.5 Sonnet’s newly released computer use ability, but the ultimate goal is to crowdsource a high quality set of browser sessions to train an open source local model.Checkout the MIT licensed repo on github (<a href="https://github.com/theredsix/cerebellum">https://github.com/theredsix/cerebellum</a>) or install the library from npm (<a href="https://www.npmjs.com/package/cerebellum-ai" rel="nofollow">https://www.npmjs.com/package/cerebellum-ai</a>)Looking for feedback from the HN community, especially on: What browser tasks would you use an LLM to complete? Thanks again for taking a look!

5 comments

its_down_again6 months ago

> but the ultimate goal is to crowdsource a high quality set of browser sessions to train an open source local model.Could you say more on this? I see that it's an open-source implementation of PLAN with Selenium and Claude's Cursor, but where will the "successes" of browser sessions be stored? Also, will it include an anonymization feature to remove PII from authenticated use cases?

评论 #42011763 未加载

imvetri6 months ago

You don't need LLM.Build interface to build knowledge graph.Nodes containing words, verbs are action, nouns are past verb. Action is movement on space.

Jayakumark6 months ago

Can this work with local models ?

评论 #42010617 未加载

theredsix6 months ago

OP here, happy to answer any questions you may have!

评论 #42014360 未加载

评论 #42024316 未加载

评论 #42010542 未加载

0x33316 months ago

Very cool!

5 comments

its_down_again6 months ago

评论 #42011763 未加载

imvetri6 months ago

You don't need LLM.Build interface to build knowledge graph.Nodes containing words, verbs are action, nouns are past verb. Action is movement on space.

Show HN: Cerebellum – Open-Source Browser Control with Claude 3.5 Computer Use

5 comments

Show HN: Cerebellum – Open-Source Browser Control with Claude 3.5 Computer Use

5 comments