TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Lilac: Analyze, structure, and clean unstructured data with AI

2 点作者 nsthorat超过 1 年前

2 条评论

nsthorat超过 1 年前
Lilac co-creator here :)<p>Lilac is an open-source tool that enables AI practitioners to see and quantify their datasets.<p>Lilac allows users to:<p>- Browse datasets with unstructured data.<p>- Enrich unstructured fields with structured metadata using Lilac Signals, for instance near-duplicate and personal information detection. Structured metadata allows us to compute statistics, find problematic slices, and eventually measure changes over time.<p>- Create and refine Lilac Concepts which are customizable AI models that can be used to find and score text that matches a concept you may have in your mind.<p>- Download the results of the enrichment for downstream applications.<p>Out of the box, Lilac comes with a set of generally useful Signals and Concepts, however this list is not exhaustive and we will continue to work with the OSS community to continue to add more useful enrichments.<p>Check out the demo on HuggingFace: <a href="https:&#x2F;&#x2F;lilacai-lilac.hf.space&#x2F;" rel="nofollow noreferrer">https:&#x2F;&#x2F;lilacai-lilac.hf.space&#x2F;</a> Find us on GitHub: <a href="https:&#x2F;&#x2F;github.com&#x2F;lilacai&#x2F;lilac">https:&#x2F;&#x2F;github.com&#x2F;lilacai&#x2F;lilac</a>
sammcgrail超过 1 年前
I really like the tooltips when you hover over the text. Exploring the imdb database is a useful example.