TE
테크에코
홈24시간 인기최신베스트질문쇼채용
GitHubTwitter
홈

테크에코

Next.js로 구축된 기술 뉴스 플랫폼으로 글로벌 기술 뉴스와 토론을 제공합니다.

GitHubTwitter

홈

홈최신베스트질문쇼채용

리소스

HackerNews API원본 HackerNewsNext.js

© 2025 테크에코. 모든 권리 보유.

Launch HN: Nao Labs (YC X25) – Cursor for Data

155 포인트작성자: ClaireGz3일 전
Hey HN, we’re Claire and Christophe from nao Labs (<a href="https:&#x2F;&#x2F;getnao.io&#x2F;">https:&#x2F;&#x2F;getnao.io&#x2F;</a>). We just launched nao, an AI code editor to work with data: a local editor, directly connected with your data warehouse, and powered by an AI copilot with built-in context of your data schema and data-specific tools.<p>See our demo here: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=QmG6X-5ftZU" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=QmG6X-5ftZU</a><p>Writing code with LLMs is the new normal in software engineering. But not when it comes to manipulating data. Tools like Cursor don’t interact natively with data warehouses — they autocomplete SQL blindly, not knowing your data schema. Most of us are still juggling multiple tools: writing code in Cursor, checking results in the warehouse console, troubleshooting with an observability tool, and verifying in BI tool no dashboard broke.<p>When you want to write code on data with LLMs, you don’t care much about the code, you care about the data output. You need a tool that helps you write code relevant for your data, lets you visualize its impact on the output, and quality check it for you.<p>Christophe and I have each spent 10 years in data — Christophe was a data engineer and has built data platforms for dozens of orgs, I was head of data and helped data teams building their analytics &amp; data products. We’ve seen how the business asks you to ship data fast, while you’re here wondering if this small line of code will mistakenly multiply the revenue on your CEO dashboard by x5. Which leaves you 2 choices: test extensively and ship slow. Not test and ship fast. That’s why we wanted to create nao: a tool really adapted to our data work, that would allow data teams to ship at business pace.<p>nao is a fork of VS Code, with built-in connectors for BigQuery, Snowflake, and Postgres. We built our own AI copilot and tab system, gave them a RAG of your data warehouse schemas and of your codebase. We added a set of agent tools to query data, compare data, understand data tools like dbt, assess the downstream impact of code in your whole data lineage.<p>The AI tab and the AI agent write straight away code matching your schema, may it be for SQL, python, yaml. It shows you code diffs and data diffs side by side, to visualize what your change did to the data output. And you can leave the data quality checks to the agent: detect missing or duplicated values, outliers, anticipate breaking changes downstream or compare dev and production data differences.<p>Data teams usually use nao for writing SQL pipelines, often with dbt. It helps them create data models, document them, test them, while making sure they’re not breaking data lineage and figures in the BI. In run mode, they also use it to run some analytics, and identify data quality bugs in production. For less technical profiles, it’s also a great help to strengthen their code best practices. For large teams, it ensures that the code &amp; metrics remain well factorized and consistent.<p>Software engineers use nao for the database exploration part: write SQL queries with nao tab, explore data schema with the agent, and write DDL.<p>Question we often get is: why not just use Cursor and MCPs? Cursor has to trigger many MCP calls to get full context of the data, while nao has it always available in one RAG. MCPs stay in a very enclosed part of Cursor: they don’t bring data context to the tab. And they don’t make the UI more adapted to data workflows. Besides, nao comes as pre-packaged for data teams: they don’t have to set up extensions, install and authenticate in MCPs, build CI&#x2F;CD pipelines. Which means even non-technical data teams can have a great developer experience.<p>Our long-term goal is to become the best place to work with data. We want to fine-tune our own models for SQL, Python and YAML to give the most relevant code suggestions for data. We want to enlarge our comprehension of all data stack tools, to become the only agnostic editor for any of your data workflow.<p>You can try it here: <a href="https:&#x2F;&#x2F;sunshine.getnao.io&#x2F;releases&#x2F;">https:&#x2F;&#x2F;sunshine.getnao.io&#x2F;releases&#x2F;</a> - download nao, sign up for free and start using it. Just for HN Launch, you can create a temporary account with a simple username if you’d prefer not to use your email. For now, we only have Mac version but Linux and Windows are coming.<p>We’d love to hear your feedback — and get your thoughts on how we can improve even further the data dev experience!

23 comments

paddy_m2일 전
This looks like many LLM assisted data projects which help and are flexible, but aren&#x27;t repeatable and aren&#x27;t fast enough to be interactive. Nao is a good execution of the concept.<p>I built Buckaroo as a data table UI for Jupyter and Pandas&#x2F;Polars, that first lets you look at the data in a modern performant table with histograms, formatting, and summary stats.<p>Yesterday I released autocleaning for Buckaroo. This looks at data and heuristically chooses cleaning methods with definite code. This is fast (less than 500ms). Multiple cleaning strategies can be cycled through and you can choose the best approach for your data. For the simple problems we shoudn&#x27;t need to consult an LLM to do the obvious things.<p>All of this is open source and extensible.<p>[1] <a href="https:&#x2F;&#x2F;youtube.com&#x2F;shorts&#x2F;4Jz-Wgf3YDc" rel="nofollow">https:&#x2F;&#x2F;youtube.com&#x2F;shorts&#x2F;4Jz-Wgf3YDc</a><p>[2] <a href="https:&#x2F;&#x2F;github.com&#x2F;paddymul&#x2F;buckaroo">https:&#x2F;&#x2F;github.com&#x2F;paddymul&#x2F;buckaroo</a><p>[3] <a href="https:&#x2F;&#x2F;marimo.io&#x2F;p&#x2F;@paddy-mullen&#x2F;buckaroo-auto-cleaning" rel="nofollow">https:&#x2F;&#x2F;marimo.io&#x2F;p&#x2F;@paddy-mullen&#x2F;buckaroo-auto-cleaning</a> Live WASM notebook that you can play with - no downloads or installs required
评论 #43944135 未加载
评论 #43940922 未加载
EliBullockPapa3일 전
Cool idea! How did you train your tab model? Fill in the middle or is it based on edit history like cursor? Someone posted this yesterday and I found it fascinating <a href="https:&#x2F;&#x2F;www.coplay.dev&#x2F;blog&#x2F;a-brief-history-of-cursor-s-tab-completion" rel="nofollow">https:&#x2F;&#x2F;www.coplay.dev&#x2F;blog&#x2F;a-brief-history-of-cursor-s-tab-...</a>
评论 #43938765 未加载
tucared2일 전
Been using this for several weeks now and it&#x27;s genuinely improved my workflow—I&#x27;m choosing it over VSCode and extensions more than half the time.<p>The chat for exploratory data analysis (&quot;what can you tell me about this column I just added?&quot;), the worksheets and column lineage are real game-changers for dbt development. These features feel purposefully designed for how I actually work.<p>Claire and Christophe are super responsive to feedback, implementing features and fixes quickly. You can see the product evolving in all the right directions!
评论 #43940606 未加载
nathan_douglas3일 전
This is really slick. I watched the YouTube video (a couple of times; I didn&#x27;t grok what was happening immediately) and I really love how this accelerates feedback cycles. Very, very cool.
评论 #43939001 未加载
bilalq2일 전
Does this only work if I&#x27;m writing raw SQL? Can I use this today if my project uses Postgres but has queries written in TypeScript using a query builder like Kysely?
评论 #43940312 未加载
TheTaytay2일 전
How much data is hitting your models&#x2F;prompts? I am okay with you knowing about my schema, but a lot of warehouse data is sensitive data. I saw you have enterprise plans, and maybe that is my answer, but I’d love to know ahead of time if data&#x2F;results are hitting your servers in addition to the code, or if it’s code-only.
评论 #43940886 未加载
redwood_2일 전
Like the looks of this. Any chance you&#x27;ll be adding support for SQLite at some point?
评论 #43940253 未加载
jhashemi3일 전
also how does it do with transitive joins across multiple tables that may not have FK&#x2F;PK relationships? Other key features that would put this over the top: Usage analysis and query rewriting for inefficient already existing queries.
评论 #43938865 未加载
jinjin22일 전
Do you support Exasol? In the current climate we don’t want to be too dependent on US cloud services, so we are moving our performance sensitive dwh workloads off Snowflake to Exasol.
评论 #43940295 未加载
sgt1012일 전
Does anyone have any links for more LLM based tools that are aimed at data engineering and data science?
评论 #43946327 未加载
coatue3일 전
Would this work with Hydra? <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=43937852">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=43937852</a>
评论 #43939608 未加载
badotnet3일 전
I&#x27;ve met one of the founders, Christophe. Smart, perfect vision and huge energy. I can say that I have no doubt they&#x27;ll succeed with Nao! Congrats!
评论 #43939112 未加载
评论 #43939110 未加载
pdepip2일 전
Congrats on the HN launch! Really excited to give this a try I think this could be a huge unlock for my team.<p>One quick issue - unable to connect to my postgres instance that requires SSL.<p>SSH tunneling seems to be broken as well because when the box is checked I am unable to select a private key path and the connect button is gone<p>Parsing DB URI would be a helpful feature as well!<p>Thanks so much, excited to get this up and running when everything is fixed!
评论 #43941800 未加载
pomarie3일 전
Great idea! How does your tab model compare to other ones from Cursor&#x2F;Windsurf..?
评论 #43939087 未加载
jakozaur2일 전
The founders showcased a demo at the Data Council conference. Looked cool!
评论 #43940531 未加载
jnnnthnn2일 전
This looks awesome! I wish I could connect to my Postgres DB using SSL.
评论 #43940828 未加载
jhashemi3일 전
add dataform support please for us Google&#x2F;BigQuery native orgs :-)
评论 #43938814 未加载
yevpats2일 전
Nice! I think this space is growing. There are a few others Im aware off in the space worth checking out: <a href="https:&#x2F;&#x2F;julius.ai&#x2F;">https:&#x2F;&#x2F;julius.ai&#x2F;</a>, <a href="https:&#x2F;&#x2F;cipher42.ai" rel="nofollow">https:&#x2F;&#x2F;cipher42.ai</a> (I&#x27;ve built the early version of this).
评论 #43940273 未加载
dmonay2일 전
Any plans to add support for ClickHouse? If so, what does that timeline look like?
评论 #43942164 未加载
christoribeiro3일 전
That’s exactly what I was looking for months ago. I will check out Nao for sure.
评论 #43939123 未加载
mrfumier3일 전
Awesome product!
评论 #43939135 未加载
redox_3일 전
Well done!
评论 #43939427 未加载
dennisy3일 전
Does this mean we will have people “vibe coding” data warehouses now? Might cause a few issues…
评论 #43939426 未加载