TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: How to use LLMs to generate accurate SQL for real-world data

21 pointsby zainhodaalmost 2 years ago
Hey HN,<p>We are Zain and Ashish, founders of Vanna AI. We recently embarked on an experiment to see if large language models (specifically LLMs) could help in generating SQL queries for real-world datasets. We initially started this project as a web app but realized that it was most useful and had broadest applicability as a Python package since you can then incorporate it into an existing workflow (Jupyter notebook, Slackbot, etc).<p>We&#x27;ve had some good success with customer datasets but we&#x27;ve generally heard a lot of skepticism so we decided to write a paper about the methodology we&#x27;re using and how various LLMs compare.<p>Let us know if you have any questions or requests. The underlying Python package is open source. There is a server component to store and retrieve metadata but by next week there will be a fully open-source and locally runnable version.<p>Cheers!

2 comments

chuckhendalmost 2 years ago
This looks awesome, great work team! A few questions :)<p>How complex were the &#x27;static examples&#x27; that were used? Can you share the examples of the three that were used in the tests?<p>Were the &quot;contextually relevant sql&quot; ran in addition to or isolated from the &quot;static examples&quot;?
评论 #37166245 未加载
tamiralalmost 2 years ago
really interesting work ! looking forward to exploring this with some of my datasets. Keep up the great work!