The Darwin Gödel Machine: AI that improves itself by rewriting its own code

245 pointsby birriel6 days ago

25 comments

I have a feeling LLMs could probably self improve up to a point with current capacity, then hit some kind of wall where current research is also bottle necked. I don’t think they can yet self improve exponentially without human intuition yet , and the results of this paper seem to support this conclusion as well.Just like an LLM can vibe code a great toy app, I don’t think an LLM can come to close to producing and maintaining production ready code anytime soon. I think the same is true for iterating on thinking machines

评论 #44136962 未加载

评论 #44137167 未加载

评论 #44139638 未加载

评论 #44138456 未加载

评论 #44143043 未加载

评论 #44137566 未加载

评论 #44137416 未加载

评论 #44137558 未加载

评论 #44141297 未加载

评论 #44137988 未加载

评论 #44138921 未加载

评论 #44138174 未加载

评论 #44138077 未加载

评论 #44143918 未加载

评论 #44137180 未加载

评论 #44144764 未加载

评论 #44137003 未加载

评论 #44138032 未加载

Lazarus_Long6 days ago

For anyone not familiar this is SWE <a href="https://huggingface.co/datasets/princeton-nlp/SWE-bench" rel="nofollow">https://huggingface.co/datasets/princeton-nlp/SWE-bench</a>One of the examples in the dataset they took from<a href="https://github.com/pvlib/pvlib-python/issues/1028">https://github.com/pvlib/pvlib-python/issues/1028</a>What the AI is expected to do<a href="https://github.com/pvlib/pvlib-python/pull/1181/commits/89d2a17c18b30b61cef31a84caa2bab7aec3b78f">https://github.com/pvlib/pvlib-python/pull/1181/commits/89d2...</a>Make your own mind about the test.

评论 #44138844 未加载

vidarh5 days ago

I've built a coding assistant over the last two days. The first 100 lines or so were handwritten. The rest has been written by the assistant itself.It's written its system prompt. It's written its tools. Its written the code to reload the improved tools into itself.And it knows it is working on itself - it frequently tries to use the enhanced functionality, and then expresses what in a human would be frustration at not having immediate access.Once by trying to use ps to find its own pid in an apparent attempt to find a way to reload itself (that's the reason it gå before trying to run ps, anyway)All its commits are now authored by the tool, including the commit messages. It needs to be good, and convincing, and having run the linter and the test suite for me to let it commit, but I agree a substantial majority of the time. It's only caused regressions once or twice.A bit more scaffolding to trigger an automatic rollback in the case of failure and giving it access to a model I won't be charged by the token for, and I'd be tempted to let it out of the box, so to speak.Today it wrote its own plan for what to add next. I then only told it to execute it.A minor separate goal oriented layer guiding the planning, and it could run in a loop.Odds are it'd run off the rails pretty quickly, but I kinda want to see how far it gets.

评论 #44152229 未加载

评论 #44149084 未加载

foobarian6 days ago

I find the thing really missing from current crop of AI systems is continuous retraining with short feedback loops. Sounds expensive to be sure, but it seems like what biological systems do naturally. But would be pretty awesome to watch happen

评论 #44137270 未加载

评论 #44137780 未加载

yahoozoo6 days ago

Isn’t one of the problems simply that a model is not code but just a giant pile of weights and biases? I guess it could tweak those?

评论 #44137878 未加载

评论 #44137756 未加载

评论 #44137920 未加载

pegasus6 days ago

I'm surprised they still hold out hope that this kind of mechanism could ultimately help with AI safety, when they already observed how the reward-hacking safeguard was itself duly reward-hacked. Predictably so, or at least it is to me, after getting a very enlightening introduction to AI safety via Rob Miles' brilliant youtube videos on the subject. See for example <a href="https://youtu.be/0pgEMWy70Qk" rel="nofollow">https://youtu.be/0pgEMWy70Qk</a>

akkartik6 days ago

"We did notice, and documented in our paper, instances when the DGM hacked its reward function.. To see if DGM could fix this issue.. We created a “tool use hallucination” reward function.. in some cases, it removed the markers we use in the reward function to detect hallucination (despite our explicit instruction not to do so), hacking our hallucination detection function to report false successes."So, empirical evidence of theoretically postulated phenomena. Seems unsurprising.

评论 #44139374 未加载

dimmuborgir6 days ago

From the paper:"A single run of the DGM on SWE-bench...takes about 2 weeks and incurs significant API costs." ($22,000)

hardmaru6 days ago

If you are interested, here is a link to the technical report:<a href="https://arxiv.org/abs/2505.22954" rel="nofollow">https://arxiv.org/abs/2505.22954</a>Also the reference implementation on GitHub:<a href="https://github.com/jennyzzt/dgm">https://github.com/jennyzzt/dgm</a>Enjoy!

OtherShrezzing6 days ago

This is an interesting article in general, but this is the standout piece for me:>For example, an agent optimized with Claude 3.5 Sonnet also showed improved performance when powered by o3-mini or Claude 3.7 Sonnet (left two panels in the figure below). This shows that the DGM discovers general agent design improvements rather than just model-specific tricks.This demonstrates a technique whereby a smaller/older/cheaper model has been used to improve the output of a larger model. This is backwards (as far as I understand). The current SOTA technique typically sees enormous/expensive models training smaller cheaper models.If that's a generalisable result, end-users should be able to drive down their own inference costs pretty substantially.

评论 #44137275 未加载

评论 #44141735 未加载

ordinarily6 days ago

The pieces are coming together quickly <a href="https://ai-2027.com/" rel="nofollow">https://ai-2027.com/</a>.

评论 #44136868 未加载

评论 #44137390 未加载

评论 #44136625 未加载

评论 #44137132 未加载

评论 #44137461 未加载

评论 #44136811 未加载

评论 #44136738 未加载

Frummy6 days ago

More like an AI that recursively rewrites an external program (while itself is frozen), which makes it more similar to current cursor lovable etc type of stuff

artninja19886 days ago

The results don't seem that amazing on SWE compared to just using a newer llm but at least sakana is continuing to try out interesting new ideas.

guerrilla6 days ago

This feels like playing pretend to me. There's no reason to assume that code improvements matter that much in comparison to other things and there's definitely no reason to assume that there isn't a hard upper bound on this kind of optimization. This reeks of a lack of intellectual rigor.

andoando6 days ago

This seems to be just fovused on changing the tools and workflows it uses, nothing foundational

评论 #44137309 未加载

ringeryless5 days ago

does anyone do due diligence on corporate names before launching? Sakana is a popular slang spelling of sacana, or bastard, in Português. I suppose self modifying code can be considered such, in some circumstances, but willingly pointing this out is probably less than stellar marketing.

评论 #44142089 未加载

评论 #44144430 未加载

ge966 days ago

Plug it into an FPGA so it can also create "hardware" on the fly to run code on for some exotic system

rahen5 days ago

Isn't this violating the first rule of AI safety: do not let an AI change its code?

alastairr6 days ago

I wondered if something similar could be achieved by wrapping evaluation metrics into Claude code calls.

p1dda5 days ago

Garbage in, garbage out, AI hype will never die, no doubt

interludead6 days ago

Sounds nice! Especially with the Sakana's latest development of Continuous Thought Machine. The next step should be to let foundation models fine-tune themselves based on their 'history of what has been tried before' and new data

htrp6 days ago

do people think sakana is actually using these tools or are they just releasing interesting ideas that they aren't actually actively working?

billab9956 days ago

When does it begin to learn at a geometric rate?

zackmorris3 days ago

This is good but you want to use a functional programming (FP) language with lightweight syntax like Lisp that translates directly to/from the intermediate code (icode) tree without additional parsing. Genetic Programming by John Koza explains it in detail:<a href="https://en.wikipedia.org/wiki/Genetic_programming" rel="nofollow">https://en.wikipedia.org/wiki/Genetic_programming</a>I read the 3rd edition:<a href="https://www.amazon.com/Genetic-Programming-III-Darwinian-Invention/dp/1558605436" rel="nofollow">https://www.amazon.com/Genetic-Programming-III-Darwinian-Inv...</a>That way all processing resources can go towards exploring the problem space for potential solutions close to the global minimum or maximum, instead of being wasted on code containing syntax errors that won't execute.So the agent's real-world Python LLM code would first be transpiled to Lisp and evolved internally, then after it's tested and shown to perform better imperically than the original code, be translated back and merged into the agent.Then the challenge becomes transpiling to/from other imperative programming (IP) languages like Python, which is still an open problem:-Going from Lisp to Python (or running Lisp within Python) is trivial, and I've seen implementations for similar IP languages like C++ in like 1 page of code. They pop up on HN frequently.But going from Python to Lisp (or running Python within Lisp) is a lot harder if one wishes to preserve readability, which may or may not matter here. Naive conversions bind variables under pseudonyms, so a Python variable like my_counter becomes int_123 and it works like an emulator, merely executing the operations performed by the Python code. Mutability gets buried in monadic logic or functional impurity which has the effect of passing the buck rather than getting real work done. Structs, classes, associative arrays, etc lose their semantic meaning and appear as a soup of operations without recognizable structure.To my knowledge, nobody has done the hard work of partitioning imperative code into functional portions which can be transpiled directly to/from FP code. Those would only have const variables and no connection to other processes of execution other than their initial and final values, to be free of side effects and be expressible as prefix/postfix/infix notation without change to logic, as imperative or functional code.Mutability could be represented as shadowed variables within ephemeral functional sub-scopes, or by creating new value names for each mutation and freeing the intermediate variables via reference counting or garbage collection. Think of each new value as running in a forked version of the current process, with only that value being different after copy-on-write. A simple for-loop from 1 to 1000 would run that many forked processes, keeping only the last one, which contains the final value of the iterator.Mutability can also be represented as message passing between processes. So the FP portions would be ordinary Lisp, glued together with IO functions, possibly monadic. I don't like how Haskell does this, mainly because I don't fully understand how it works. I believe that ClojureScript handles mutability of its global state store by treating each expression as a one-shot process communicating with the store, so that the code only sees initial and final values. While I don't know if I understand how that works, I feel that it's a more understandable way of doing things, and probably better represents how real life works, as explained to me in this comment about Lisp Flavored Erlang (LFE) and Erlang's BEAM (see parent comments for full discussion):<a href="https://news.ycombinator.com/item?id=43931177">https://news.ycombinator.com/item?id=43931177</a>Note that FP languages like Lisp are usually more concerned with types and categories than IP languages, so can have or may need stronger rules around variable types to emulate logic that we take for granted in IP languages. For example, Lisp might offer numbers of unlimited size or precision that need to be constrained to behave like a float32. Similar constraints could affect things like character encoding and locale.-I first learned about everything I just explained around 2005 after reading the book. I first had thoughts about brute-forcing combinations to solve small logic circuit and programming challenges during my electrical and computer engineering (ECE) courses at UIUC in the late 1990s, because it took so much mental effort and elbow grease to create solutions that are obvious in hindsight.Then the Dot Bomb happened, the Mobile bubble happened, the Single Page Application bubble happened, and the tech industry chose easy instead of simple:<a href="https://www.infoq.com/presentations/Simple-Made-Easy/" rel="nofollow">https://www.infoq.com/presentations/Simple-Made-Easy/</a>This is why we chose easy hardware like GPUs over simple highly multicore CPUs, and easy languages like Ruby/React over simple declarative idempotent data-driven paradigms like HTTP/HTML/htmx.The accumulated technical debt of always choosing the quick and easy path set AI (and computing in general) back decades. The AI Winter, endless VC wealth thrown at non-problems chasing profit, massive wealth inequality, so many things stem from this daily application of easy at the expense of simple.I wish I could work on breaking down IP languages like Python into these const functional portions with mutability handled through message passing in LFE to create an IP <-> FP transpiler for optimization, automatic code generation and genetic algorithm purposes. Instead, I've had to survive by building CRUD apps and witness the glacial pace of AI progress from the sidelines.It may be too late for me, but maybe these breadcrumbs will help someone finally get some real work done.

2OEH8eoCRo06 days ago

We could be on a path to sentient malicious AI and not even know it.AI: Give me more compute power and I'll make you rich!Human: I like moneyAI: Just kidding!

评论 #44137716 未加载

评论 #44136879 未加载