Automating my job with GPT-3

479 pointsby daolfover 4 years ago

26 comments

chewxyover 4 years ago

FWIW I ran a startup that provided you with a program (single binary) that allowed you to run natural language queries on your database across most schemas. It had a full semantics layers which translated your query into a mixed-lambda calculus-prolog query, which is then translated into SQL as needed - you can see a sample of the semantics layer here: <a href="https://youtu.be/fd4EPh2tYrk?t=92" rel="nofollow">https://youtu.be/fd4EPh2tYrk?t=92</a>.It's deep learning based with a lot of augmentation. Going from the OP's article to actually being able to run queries on any schema is quite a bit more work. I'd love to see GPT3 handle arbitrary schemas.p/s: the startup failed. deep research based startups need a lot of funds.

评论 #25934118 未加载

评论 #25935710 未加载

评论 #25938047 未加载

frompdxover 4 years ago

Woah. I never gave it my database schema but it assumes I have a table called "users" (which is accurate) and that there's a timestamp field called "signup_time" for when a user signed up.I am definitely impressed by the fact that it could get this close without knowledge of the schema, and that you can provide additional context about the schema. Seems like there is a lot of potential for building a natural language query engine that is hooked up to a database. I suppose there is always a risk that a user could generate a dangerous query but that could be mitigated.Not related to the article but what exactly is "open" about OpenAI?

评论 #25932271 未加载

评论 #25932737 未加载

评论 #25935059 未加载

评论 #25933035 未加载

评论 #25937834 未加载

评论 #25934495 未加载

minimaxirover 4 years ago

This is a use case where AI-powered SQL is a solution in search of a problem, and introduces more issues than just doing boring SQL. For data analysis, it's much more important to be accurate than fast, and this article is unclear how many attempts each example query took. GPT-3 does not always output coherent output (even with good prompts), and since not 100% of the output is valid SQL the QA and risk tolerance of bad output affects the economics.OpenAI's GPT-3 API is expensive enough (especially with heavy prompt engineering) that the time saved may not outweigh the cost, particularly if the output is not 100% accurate.

评论 #25932482 未加载

mrkeenover 4 years ago

This seems to be consistent with my outsider view of AI demos.1) Have a question2) Figure out the answer3) Have the AI figure out the answer4) If the AI figured out your answer, be impressed, otherwise try again.

评论 #25935506 未加载

评论 #25944876 未加载

choegerover 4 years ago

The problem with the curre t "AI" technology is it is only approximately correct (or rather, it is somewhat likely to produce a "good" result). This gives great use-cases when it comes to human perception, as we can filter out or correct small mistakes and reject big ones. But when used as input to a machine, even the smallest mistake can have huge consequences. Admittedly, this nonlinearity also applies when human beings "talk" to machines, but the input to and output of a single human being will always be constrained, whereas a machine could output billions of programs per day. I don't think it would be wise to follow that route before we have computational models that can cope with the many small and few big mistakes an "AI" would make.

评论 #25933466 未加载

评论 #25933086 未加载

jakearmitageover 4 years ago

You know what grinds my gears with GPT-3? The fact that I can't tinker with it. I can't do what this guy just did, or play around with it, or learn from it, or whatever. Access is limited.I feel like I'm back in 95, when I had to beg faculty staff to get a copy of VB on some lab computer, only to be able to use it 1 hour a day. Restricting knowledge like this, in 2021, feels odd.

评论 #25934345 未加载

SomewhatLikelyover 4 years ago

Am I reading it wrong, or was the output for this example wrong:Input: how much revenue have we received from users with an email ending in 'seekwell.io' in the last 3 months?The output conditions on: users.signup_dt>= now() - interval '3 months'But my interpretation would be to condition on charges.charge_dtIt reminds me of the criticisms of almost fully autonomous vehicles where the driver pays less attention. Now I'm curious if the queries written from scratch would be less likely to have errors than queries that use GPT3 as a starting point.

iandanforthover 4 years ago

I've built this as well! I won an internal hackathon with this and so I ran up against many of the issues you'll find here.1. There is unlimited flexibility in the prompt.Seemingly irrelevant changes to the prompt can change whether you get out correct SQL or not. Sometimes you can just repeat things in the prompt and get different and better results. "Write correct SQL. Write correct SQL"For any one input question you may be able to tweak the prompt to get the correct answer out. But you need to do this tweaking for each question (and know the correct answer you need). Tweaking one prompt may break all other input-output pairs.2. Real questions involve multiple large schemas.I deal with tables with thousands or tens of thousands of columns. There is no way you can get GPT-3 to deal with that scale with a simple input as shown here. And of course you want to join across many tables etc.3. SyntaxNatural language is more robust than SQL, you can get close and get the point across. Most language models trained on general corpora are fundamentally not suited to the symbolic manipulation of languages like SQL.This isn't to say that GPT-3 couldn't be part of a solution to this problem, but please restrain your exuberance, it's not going to solve this problem out of the box.

评论 #25939078 未加载

Sidetalkerover 4 years ago

He's our fastest business analyst but sometimes on a hot day he'll just keep repeating "Syntax Error"...Very cool work, I continue to be blown away by what GPT-3 can achieve.

jgiliasover 4 years ago

"GPT-3, I need a React App that displays vital economic statistics from the World Bank API."----"Nice! can you add a drop-down for regional statistics when in country view?"----"Just one last thing. Can you make the logo bigger?"

评论 #25933738 未加载

评论 #25934043 未加载

mchusmaover 4 years ago

Amazon pitched something like this at reinvent. I'm bearish because the art to good analytics is deeply understanding the data. The SQL queries are the easy part. It's knowing when to omit inactive states, or how accrual works in the system, or that field X was depreciated and isn't accurate anymore.These systems solve none of those underlying questions, and in fact are more likely to make it worse. Because if you won't even bother to write SQL or look at your database, you are even less likely to make sure it's high quality.(There is maybe an exemption here for queries that you don't really care of you get the real answer. I don't know what those questions would be, but that would be a good usecase.)

htrpover 4 years ago

We should start with the caveat that the GPT3 API waitlist doesn't actually move, you literally need to get an employee to get you manually off the waitlist.

评论 #25933335 未加载

jawnsover 4 years ago

This is really cool, but it's clear that the person requesting the SQL has to know whether the generated SQL is correct for it to be of use.If I'm a non-technical user and I ask a plain-language question and the generated SQL is incorrect, it's likely going to give the wrong answer -- but unless it's terribly wrong ("Syntax error", truly implausible values) the user may not know that it's wrong.So I see this as more of a tool to speed up development than a tool that can power end users' plain-language queries. But who knows? Maybe GPT-4 will clear that hurdle.

评论 #25932488 未加载

评论 #25940952 未加载

mraza007over 4 years ago

Really interesting article. I’m just curious to know how do you get access to gpt-3

评论 #25931976 未加载

forgotmypw17over 4 years ago

Those 36MB GIFs can probably be optimized without losing GIF compatibility :)

navaitover 4 years ago

It reminds me of why tools like Tableau are so useful. You dont' have to teach people SQL or whatever, they can build their own visualizations and Tableau will do the SQL for you.

评论 #25933503 未加载

评论 #25941123 未加载

j_barbossaover 4 years ago

Can you be 100% sure that GPT-3 will really understand what you want to query? Or do you have to double check every query before you run it?As for the latter, it's no automation, it's just a nice "autocomplete" feature.

评论 #25935901 未加载

rexreedover 4 years ago

I constantly wonder how people are getting access to the GPT-3 API (as beta users) when so many are still on the waiting list. The answer to use the Adventure Dungeon game is quite lacking.

评论 #25933528 未加载

vfclistsover 4 years ago

Can someone check if the page <a href="https://blog.seekwell.io/gpt3" rel="nofollow">https://blog.seekwell.io/gpt3</a> is causing CPU spikes in the browser?

评论 #25941169 未加载

rottenover 4 years ago

I would have used the word "percentage" rather than "percent". I wonder if the slightly more precise english would have helped?

rcarmoover 4 years ago

I’m really curious as to how this would deal with a “Bobby Tables” scenario or deliberate malicious intent.The ethics implications are... fascinating.

philip142auover 4 years ago

Does there exist this sort of thing for graphql?

ElectricMindover 4 years ago

wait till it figured out the best way to "solve" everything is drop tables :D

Cypherover 4 years ago

how I get invite to gtp3?

qwantim1over 4 years ago

Teaching it CTEs, subqueries, generic column identifiers (1,2,3), unions, left outer join, like, regexp, JSON, array, array_agg, concat, coelesce, trunc, etc. would be fun.But something else humans can do is look at the data to construct patterns as another way to extract data. To some degree it requires some (bad) inductive reasoning approaches like “I’ll assume maybe most or all data in this column has this format.” “Oops, it didn’t, so let’s tweak it to look like this, to cover it sometimes also being empty string or null.”

Triv888over 4 years ago

At least you are not automating someone else's job and feeling bad about them being fired, lol.