Hey HN,<p>I built SWE-Kit, LLM toolkit (Function callable tools) which makes building agents specialised in coding like Devin very easy.<p>I noticed a typical pattern while building local agents: creating & perfecting LLM tools to interact with system or codebase was the repeated and time-consuming. We created a layer that simplifies building agents that can interact with code, file system, git, shell and allows you to quickly solve for a wide variety of coding agent use cases.<p>Aren’t there open coding agents already? Well, yes, but most folks would want to solve their specific use case like a large refactor and current coding agents aren’t customisable to your specific use case or aren’t meant to be molded to different workflows.<p>The idea is to provide a library of tools so you can build software engineering agents with a few lines of code in agentic framework of your choice.<p>We have solved following hard parts for everyone -
- Optimized Coding Tools: Includes Code Analysis, File Operations, and Shell tools for seamless interaction with codebases and operating systems.
- Browser Interaction Tool: Enables navigation and interaction with UI-based applications and codebases.
- Framework Agnostic: Compatible with frameworks like LangChain, LlamaIndex, CrewAI, and Autogen, this allows you to work with your preferred setup.
- Third-Party Integrations: Connects with applications like GitHub, Slack, Jira, and Gmail to build fully autonomous, end-to-end AI coding agents.
- Flexible Deployment: Run on Local, Docker, Fly.io, E2b, AWS Lambda (soon!)<p>Is this the 10x Coding Agent I was looking for?<p>No this is not a coding agent but allows you to build your custom coding agent in framework of your choice.<p>We have created some templates to get started quickly though:
- GitHub PR Agent: Autonomously reviews GitHub pull requests with full codebase context.
- SWE Agent: Writes new features, debugs code, refactors, and creates tests.
- Codebase Q&A Agent: Enables natural language interactions with the codebase.<p>To better showcase the SWE kit's capability, we tested it on [swebench.com](<a href="https://www.swebench.com/" rel="nofollow">https://www.swebench.com/</a>), the benchmark for testing coding agents. It scored 48.60%, whereas Devin scored only 13.86%.<p>If you end up using this, please do provide feedback and if you need help building coding agent feel free to reach out to us<p>I (Soham) & my cofounder Karan are both active on this thread to answer any questions!