Hi HN,<p>I'm excited to share some early tinkering on a project, autospec, an open-source QA agent for web applications.<p>Right now it's not fully packaged for use, but I wanted to get the idea out early and am looking for design feedback, suggestions, and open source collaborators to join in. I wrote it over memorial weekend :)<p>autospec uses vision and text language models to explore and generate commonsense e2e tests for web applications.<p>The goal is human-like evaluation: assessing the entire UI as a user would, making decisions based on the actual state of the application at each step, with zero initial configuration, and the ability to immediately adapt to new features.<p>Why I Built It:<p>I've experienced the difficulty in building the right amount of automated tests and at the right layer of abstraction to both provide good coverage, avoid flakiness, and avoid constant rewrites when implementation changes.<p>This is the first AI-driven application I've built. I was inspired by a couple of things:<p>* SWE-agent's [1] focus on agentic performance<p>* backend-GPT's README rant [2]: "The proper format for business logic is human intelligence."<p>* zerostep [3], autogpt [4], and other browser-controlled AI projects<p>Potential Next Steps:<p>* Save passing specs as playwright code and only fallback on spec failure to the AI agent, to see if the test can be self-healed according to the original spec.<p>* Create a curated benchmark of both common open source web apps that should pass, and versions with introduced bugs<p>* NPM package to run like `npx autospec --url <a href="https://example.com" rel="nofollow">https://example.com</a>`<p>* Github action and Vercel checks API integration to run on preview deployments<p>* Handling app auth securely+easily<p>* Continue exploring the balance between vision UI interpretation and DOM analysis<p>[1]: <a href="https://github.com/princeton-nlp/SWE-agent">https://github.com/princeton-nlp/SWE-agent</a><p>[2]: <a href="https://github.com/RootbeerComputer/backend-GPT">https://github.com/RootbeerComputer/backend-GPT</a><p>[3]: <a href="https://github.com/zerostep-ai/zerostep">https://github.com/zerostep-ai/zerostep</a><p>[4]: <a href="https://github.com/Significant-Gravitas/AutoGPT">https://github.com/Significant-Gravitas/AutoGPT</a><p>Thanks!<p>Zach