Maccarone: AI-managed code blocks in Python

181 pointsby silverthornover 1 year ago

24 comments

panarkyover 1 year ago

It's so awesome to learn a new term like "macaronic language".<a href="https://en.wikipedia.org/wiki/Macaronic_language" rel="nofollow noreferrer">https://en.wikipedia.org/wiki/Macaronic_language</a>I finally have the right term to describe the warning signs from 1960's-era mainframes that coined "blinkenlichten".<a href="https://en.wikipedia.org/wiki/Blinkenlights" rel="nofollow noreferrer">https://en.wikipedia.org/wiki/Blinkenlights</a>There should be a German term for this, but "gefälschter deutscher" doesn't quite capture it.

评论 #37259678 未加载

syntaxingover 1 year ago

> What prevents my program from behaving differently after each preprocessing run?>The strength of your faith in GPT-4.I got a chuckle out of that

评论 #37258035 未加载

footaover 1 year ago

In theory, an AI that wrote proofs for their code (ala coq) could be used to validate preconditions specified by the developer, right?

评论 #37256832 未加载

评论 #37257666 未加载

anotherpaulgover 1 year ago

Very cool project. How reliable are you finding your prompts? They look like good choices based on my experience prompting GPT-3.5 and 4 for code editing.FYI, I think my open source tool aider would work out of the box to serve this use case. You would just run:<pre><code> aider file.py —msg “implement the comments” </code></pre> Of course aider works with any popular language, not just python. And it can do a lot of other coding tasks. It's like pair programming with an AI.<a href="https://github.com/paul-gauthier/aider">https://github.com/paul-gauthier/aider</a>

评论 #37257483 未加载

评论 #37257440 未加载

Nowadoover 1 year ago

Isn't this how copilot 'just' works, except with comments? What's the advantage over copilot?

评论 #37258461 未加载

评论 #37256958 未加载

itissidover 1 year ago

Copilot currently keeps in context the file you are editing. Cross file support is coming but not here.(<a href="https://githubnext.com/projects/copilot-view/" rel="nofollow noreferrer">https://githubnext.com/projects/copilot-view/</a>). But it would be very very useful.One logical concept that's also been noodling in my brain was to construct a DFA(Deterministic finite automata) from the code seen in all the files and then offer the n-1 tokens to the language model and constrain the nth token's selection from the valid ones. I recall someone did this for things that produce DFAs that are fairly small in size(like JSON) and that essentially produced 100% valid JSON without hallucinations(It could be garbage JSON).So for example if I had a `class ABC` then typing `abc.` could produce: 1. all the methods on it that were valid and 2. had arguments from the surrounding code informed by the LLM.

评论 #37261039 未加载

jinayover 1 year ago

It's like in-painting, but for code :)

joshuanapoliover 1 year ago

I love it.I'd like to make something more constrained. Instead of a fully-general programming language, let the LLM configure data-flows between pre-defined modules, field mappings, or presentations.Then, hopefully, we could let the end-user more directly edit the prompt.

verdvermover 1 year ago

Would python decorators be better for something like this?I always get squeamish when I see magic comments

评论 #37255350 未加载

评论 #37255060 未加载

评论 #37255354 未加载

评论 #37255063 未加载

BiteCode_devover 1 year ago

Nice, it's like cog (<a href="https://pypi.org/project/cog/" rel="nofollow noreferrer">https://pypi.org/project/cog/</a>), but automatic.It could replace template rendering in the long run.

skybrianover 1 year ago

Assuming you're using source control properly and read the diff before running it, I guess this is one way to make sure that a comment matches the code? If the bot changes it, maybe your comment wasn't clear enough?

penelokover 1 year ago

How hard would it be to use code llama instead? <a href="https://ai.meta.com/blog/code-llama-large-language-model-coding/" rel="nofollow noreferrer">https://ai.meta.com/blog/code-llama-large-language-model-cod...</a>

smrtinsertover 1 year ago

Not sure I see the benefit over standard ai integration into an editor, what am I missing?

评论 #37256066 未加载

khazhouxover 1 year ago

I tried implementing something like this over the summer but couldn't make progress with the 20-30 minimum response time for each OpenAI-generated block. From the demo video it looks like this runs pretty fast -- or does it?

gumballindieover 1 year ago

The name is foretelling of the end result: a flying spaghetti monster.

canadiantimover 1 year ago

Looks amazing. Would you ever consider using Claude as well?I prefer to use Claude for code generation if using a newer framework or language (the 2021 cutoff with gpt-4 is unfortunate)

coding123over 1 year ago

I'm usually just copy pasting my entire file into chatgpt and ask it for help - it re-writes the whole damn thing, no need to have managed sections.

Havocover 1 year ago

Seems likely that dev work moves towards this sort of thing - boilerplate being AI managed.It'll be hell to debug an ever shifting codebase though

mgphillipsover 1 year ago

This is really cool.Practically, how often does this lead to new errors from the AI managed codeblocks when you update code elsewhere?

swyxover 1 year ago

similar to this? <a href="https://www.askmarvin.ai/welcome/quickstart/#ai-functions" rel="nofollow noreferrer">https://www.askmarvin.ai/welcome/quickstart/#ai-functions</a>

评论 #37257764 未加载

Kiroover 1 year ago

What happens if you start editing the code in a block?

falcor84over 1 year ago

This answer in the FAQ is wonderful:<pre><code> What prevents my program from behaving differently after each preprocessing run? - The strength of your faith in GPT-4.</code></pre>

评论 #37256696 未加载

评论 #37256782 未加载

Nullabillityover 1 year ago

"Hallucination isn't a real problem, people will always scrutinize the generated code!"Sigh...

评论 #37256401 未加载

评论 #37255769 未加载

评论 #37256514 未加载

评论 #37259550 未加载

评论 #37256784 未加载

golergkaover 1 year ago

There are so many languages with awesome type systems which can help guide AI to generate better code — and yet, these experiments always choose Python.

评论 #37256463 未加载