TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: An open-source tool that semantically profiles your data using LLMs

10 pointsby zh2408about 1 year ago
The problem we solve is profiling tables: this is the initial step where you need to understand the table and identify any anomalies.<p>During the process, many small decisions require semantic understanding. For example, missing values are normal for &#x27;deathdate&#x27; (still alive) but abnormal for &#x27;name.&#x27; For outliers, 100 for ages is fine, but some are -1, which is impossible! We use LLMs to semantically understand your tables and detect anomalies.<p>You can try it by uploading a CSV, and we will email back the profile: <a href="https:&#x2F;&#x2F;cocoon-data-transformation.github.io&#x2F;page&#x2F;" rel="nofollow">https:&#x2F;&#x2F;cocoon-data-transformation.github.io&#x2F;page&#x2F;</a><p>Let me know your feedback. Thanks!

1 comment

haidousmabout 1 year ago
cool project. getting insecure form warnings when submitting.<p>you&#x27;ll want to spin up an ingress (nginx, ..) to front your requests &amp; use TLS (let&#x27;s encrypt)<p>edit: the CSV i used had dates in 2024 -- got this back<p>&quot;Timestamps are from year 2024 which is in the future.&quot;
评论 #40249375 未加载