TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Projell.com – Simple APIs for synthetic data generation

6 点作者 sumitsrivastava超过 5 年前
Hi, I&#x27;m Sumit Srivastava, founder of Projell.com . We made this after dealing with the data hell like low data availability, high data procuring cost, huge time sink for data collection, and privacy concerns over the user data.<p>This prompted me to build an easy way to generate synthetic data for machine learning models. This primarily uses GANs, but we use techniques which are most efficient for specific usecases.<p>Areas where we&#x27;ve found it useful are biomedical, drone imagery, satellite imagery, retail, and autonomous mobility.<p>As already prominent in the ImageNet challenge, the state of the art is using synthetic data to gain higher accuracy. [ https:&#x2F;&#x2F;paperswithcode.com&#x2F;sota&#x2F;image-classification-on-imagenet ]<p>Google, for their autonomous vehicles, used millions of miles of real driving data and billions of miles of synthetic data. It is clear where the world is moving towards.<p>I would be happy to share the tools with everyone since dealing with data is something we struggled with and don&#x27;t want anyone to struggle anymore. This is probably only the first step towards building something robust that can reduce as much data hassles as possible, if not all.

3 条评论

Farmify超过 5 年前
Hey, interesting idea! How do you plan to deal with the sparce dataset in models? Let&#x27;s suppose I have a biomedical dataset of 200 images, is there a minimum dataset requirement?
评论 #21203516 未加载
sumitsrivastava超过 5 年前
Hi folks, OP here! Happy to answer any questions you might be wondering about.
deepwav超过 5 年前
Hi! What kind of data are you planning to generate? Is it images only?