TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Open-source tool for data cleaning with LLM

1 pointsby zh2408about 1 year ago
Data cleaning is often the first yet tedious step for DS&#x2F;ML&#x2F;BI. It involves tasks such as deduplication, data type casting, handling missing values, correcting typos, and more.<p>I&#x27;m working on an open-source project that guides you through the data cleaning process. It currently supports tables in Pandas&#x2F;Snowflake&#x2F;DuckDB, and outputs DBT SQL&#x2F;YAML for cleaning.<p>We take an agent approach, which is very different from Copilot. For Copilot, humans need to specify the steps, which is burdensome. For LLM agents, they handle the heavy lifting of exploring data, figuring out what needs cleaning, and guiding humans through steps.<p>Here is a 1-minute demo: <a href="https:&#x2F;&#x2F;youtu.be&#x2F;D7jw43ccOkg" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;D7jw43ccOkg</a><p>Please let me know your thoughts and feedback!

no comments

no comments