I made this thing where LLM works like a tiny OS kernel - it plans tasks and runs different agents. Right now it's doing security stuff (poking at networks, checking SSL, testing APIs) with WebGoat as a playground.<p>The ultimate goal is to ask it "find vulnerabilities in XYZ" and see it orchestrate its own tools, backtrack, try different approaches, etc.<p>The cool part is the LLM doesn't just blast out a bunch of tokens - it plans, reasons and executes tasks step by step during inference.<p>Currently playing with security agents but you could plug in anything really. Would love getting you to play with it and maybe add an agent or two.